AIBearishArs Technica – AI · Feb 206/107
🧠
Microsoft deletes blog telling users to train AI on pirated Harry Potter books
Microsoft deleted a blog post that instructed users to train AI models using a dataset containing pirated Harry Potter books. The company acknowledged the Harry Potter dataset was "mistakenly" marked as public domain, raising questions about data sourcing practices for AI training.