AIBullisharXiv – CS AI · 10h ago6/10
🧠
GLiNER2-PII: A Multilingual Model for Personally Identifiable Information Extraction
Researchers have developed GLiNER2-PII, a compact 0.3B-parameter multilingual model for detecting personally identifiable information across 42 entity types at character-level precision. Trained on a synthetic corpus of 4,910 annotated texts to overcome privacy constraints in real data collection, the model outperforms existing systems including OpenAI's Privacy Filter on benchmark evaluations and is now publicly available on Hugging Face.
🏢 OpenAI🏢 Hugging Face