Notice that we're talking about *de-identified* patient data here. There is a ut...

_wmhc · on Nov 18, 2020

That if is not even an if.

They ask for: Gender, Age group, Zipcode and City

HIPAA already considers that too much under their safe habour rules, and k-anonymity (expert determination) can hardly be applied if you need to provide full zipcode and have a data-set that will grow/shrink over time.

https://www.hhs.gov/hipaa/for-professionals/privacy/special-...

edit: oh, and it seems they also ask for a "encrypted patient identifier", definitely doesn't seem kosher

kortex · on Nov 18, 2020

33 bits of entropy to narrow down to a single individual. ~28 in USA

UID, Gender, Age group, Zipcode and City, plus of course your medication habits, is probably enough to deanonymize with a reasonable amount of confidence. Say age group is one of 8, age+gender is 5 bits of entropy. City zip is ~8. So that's 15 bits left on a good day.

Throw in any off-the-shelf targeted marketing data (usally worth 10-25 bits iirc) and you might as well use SSN as the patient ID.

_lqaf · on Nov 18, 2020

As others noted, it is de-identified in name only. There is clearly sufficient information to make this something like a ROT13 analogue.

> If de-identification is done right

There is the rub, indeed. As a general rule, I don't trust de-identification. People mostly seem to reason poorly about how datasets can be merged and this has repeatedly failed.

Worse, I have seen it proposed to shut people up about privacy in situations where the proposer knew full well it would fail. De-identification was merely a prop in a con.

I would suggest that, if sensitive de-identified data is to be used by government, it go through a public trial challenge round. Let's let the public give a shot at it, it would build confidence and help suppress a little conspiratorial nonsense too, something we could use right now.

curryst · on Nov 18, 2020

They should put their money where their mouth is and release the de-identified info of the high ranking DEA personnel. If they're so confident it's de-identified, it shouldn't be a problem. If that's a problem for them, the rest of us should definitely not trust it.

mulmen · on Nov 18, 2020

That's the rub isn't it? How can anyone think US Intel/LEO agencies will settle for de-anonymized anything? Their definition of anonymous seems to be "we didn't look at it yet."

wonderwonder · on Nov 18, 2020

Issue is once they target an individual as potentially abusing their prescription its only a matter of time before they seek a warrant to properly ID and raid that person. Guilty or innocent, those raids never go well for anyone or their dogs.

pnw_hazor · on Nov 18, 2020

States already have similar registries that are not de-identified. Police or other local officials can review these registries almost at will. My provider requires patients to sign an expansive privacy waiver.

The US medical profession completely rolled over and sold out their patients. I can't figure out why, unless it is part of a deal to avoid being pursued or prosecuted for their part in creating the opioid crisis.