Pandas extension that measures privacy risk

Steve Nyemba 0b16ce94cc notebooks 6 лет назад
notebooks 0b16ce94cc notebooks 6 лет назад
src 43cbd12a1f misc updates ... 6 лет назад
README.md 47f94974c9 bug fix and adding usage 6 лет назад

README.md

deid-risk

The code below extends a data-frame by adding it the ability to compute de-identification risk (marketer, prosecutor). Because data-frames can connect to any database/file it will be the responsibility of the user to load the dataset into a data-frame.

Basic examples that illustrate usage of the the framework are in the notebook folder. The example is derived from http://ehelthinformation.ca

Dependencies:

numpy 
pandas

Limitations:

@TODO:    
    - Add support for journalist risk