Introduced
The risk of Inferred Sensitive Data is introduced in several components. It's
inherent to models due to their non-deterministic nature and is amplified by
inadequate data handling practices that fail to filter sensitive information.
It can also be due to training processes that neglect to evaluate the model's
potential for sensitive inferences.
Exposed
This risk is exposed within the model when it generates a response containing
inferred sensitive data that it shouldn't.
Mitigated
Mitigation is multi-pronged: filtering model outputs to prevent revealing
inferred sensitive data, rigorously testing the model during training, tuning,
and evaluation to prevent sensitive inferences, and proactively removing or
labeling data that could lead to such inferences during sourcing, filtering,
and processing before training.