We benchmarked three open-source PII detectors (Microsoft Presidio, GLiNER, and OpenAI's Privacy Filter) on detection accuracy and throughput across two datasets and six entity types. OPF leads out-of-the-box on a GPU, Presidio wins on CPU and pattern-rich entities, GLiNER is the most flexible to configure.
A structured breakdown of where privacy risk lives across the four phases of the AI lifecycle, and what genuinely changes with generative AI versus traditional ML.