AIBullisharXiv โ CS AI ยท 14h ago7/10
๐ง
Private Seeds, Public LLMs: Realistic and Privacy-Preserving Synthetic Data Generation
Researchers propose RPSG, a novel method for generating synthetic data from private text using large language models while maintaining differential privacy protections. The approach uses private seeds and formal privacy mechanisms during candidate selection, achieving high fidelity synthetic data with stronger privacy guarantees than existing methods.