Apple is facing another class-action lawsuit after being accused of using pirated book content to train its artificial intelligence models. This lawsuit names two neuroscience professors, Susana Martinez-Conde and Stephen Macknik, from the State University of New York Downstate Health Sciences.AllegationsApple used their registered works without authorization and used pirated content, including the works of the two, through "shadow libraries" and web crawler software to train its artificial intelligence models.
Prior to this lawsuit, Apple was sued by another group of authors a month ago, who also accused Apple of using published works to train its Apple Intelligence artificial intelligence model without consent.
In addition to Apple, technology companies including OpenAI are also facing similar lawsuits. For example, the New York Times has also faced similar lawsuits.Suing OpenAIAlthough the application of artificial intelligence technology is still in its early stages of development, related lawsuits have begun to produce results that may become precedents. For example, Anthropic reached a settlement with 50 authors involved in the lawsuit for US$15 billion earlier this year, which also involved copyright-related disputes.
Apple is once again facing a copyright dispute over the use of AI training data, indicating that the current development of AI technology still faces issues related to the legality of obtaining training data. Especially as more and more content creators and publishers begin to pay attention to the use of their content for AI training, technology companies will inevitably have to pay more attention to the legality of the use of training data.
Judging from the fact that Anthropic reached a settlement of up to US$15 billion and involved 50 authors, it is clear that such lawsuits may bring huge compensation pressure to technology companies and may even affect their development layout of artificial intelligence technology.
Therefore, it is expected that more technology companies will begin to focus on the legal use of training data and may cooperate with more content creators and publishers to obtain the data needed for training by signing licensing agreements, thereby avoiding greater litigation risks in the future.
However, whether such development will increase the cost of artificial intelligence technology training and thus affect the application and development of artificial intelligence technology may also be a project worthy of market attention.



