With the rapid development of generative AI technology, technology companies have also been faced with copyright disputes. After several AI companies were accused of using copyrighted works without permission, Apple was recently sued by two authors.Filing a class action lawsuit, accusing it of using pirated e-book resources in the process of training AI systems, thereby infringing the copyright of relevant authors.
The main plaintiffs in this lawsuit are American authors Grady Hendrix and Jennifer Roberson, both of whom have published numerous well-known works. The lawsuit alleges that Apple uses a dataset in its AI training process that includes a large number of unauthorized pirated e-books containing copyrighted works, without obtaining the original authors' consent or paying licensing fees. Apple then uses this content in AI model training and in its "Apple Intelligence" service.
The plaintiffs directly accuse Apple of copying and using copyrighted works to train AI models that compete with the originals, thereby diluting the market value of these works. They argue that this not only deprives authors of control over their creations but also allows Apple to reap substantial commercial benefits through illegal means. Due to the sheer volume of works involved in the pirated content, the case is currently seeking class action status, potentially covering thousands of authors.
It’s worth noting that this isn’t the first copyright lawsuit related to generative AI. In the past few months, AI companies including OpenAI and Anthropic have been hit with similar lawsuits.
Among them, OpenAI is embroiled in litigation with the New York Times and the oldest nonprofit news organization in the United States, while Anthropic reached a $15 billion settlement with a class action lawsuit involving 50 authors earlier this year. The authors involved in the lawsuit will receive approximately $3000 in compensation for each work.
Compared to other AI companies, Apple has always emphasized that its AI training process prioritizes privacy and data protection. However, this latest accusation highlights another layer of controversy: the reliance on the legality of data sources for AI model training.
If the court determines that Apple has indeed used unauthorized pirated content, it may not only affect its AI development strategy, but will also further deepen the outside world's attention on "large-scale content infringement" in the industry.
Apple has yet to officially respond to the lawsuit, but the industry generally believes that as the commercialization of generative AI accelerates, establishing a more transparent data licensing and compensation mechanism will become a core dispute between tech giants and creators. The development of this case may also serve as an important reference for future copyright regulations in the AI industry.



