Meta's Controversial Use of Pirated Books for AI Training
Via Wired
Summary
A judge has unredacted court documents revealing that Meta allegedly used the pirated "shadow library" LibGen to train its AI models, despite internal warnings that doing so could undermine the company’s position with regulators.
The filings suggest that top executives, including Mark Zuckerberg, were aware of the dataset's illicit origins and that Meta may have even distributed pirated works by "seeding" torrent files during the training process.