Politics

Court battles over AI training methods threaten industry stability

Tuesday, 24 February 2026 6:31AM UTC

Legal disputes over AI systems' data sourcing and transformative use are escalating in US courts, potentially shaping future industry practices and regulations amidst conflicting rulings on fair use and lawful acquisition.

Legal fights over how artificial intelligence systems are built have moved from technical debate into courtrooms, putting major technology firms under increasing scrutiny as judges test whether longstanding copyright rules can be stretched to cover machine learning. Companies including Google and Apple have mounted defences that portray their use of copyrighted works as transformative and essential to AI development, arguments being examined ahead of hearings in the Northern District of California. According to reporting, those defences are now colliding with a string of recent judicial decisions that offer contrasting views on fair use and the legality of data acquisition methods.

A notable milestone came when a federal judge concluded that one AI developer’s use of millions of legally acquired books to train a chatbot was “exceedingly transformative,” a finding that treated the training process itself as potentially protected fair use. The same company, however, remains exposed to separate litigation over whether it obtained other texts from illicit “shadow libraries,” underscoring the difference between how material is used and how it is sourced. Industry observers say that outcome shows courts will parse both transformation and provenance when assessing liability.

At the same time other cases have reached very different outcomes. A high-profile suit brought by a group of authors against a major social media company was dismissed by a federal judge who said plaintiffs had not properly framed their claims, a ruling that stopped short of endorsing the defendant’s practices but signalled that plaintiffs must craft clearer legal theories to prevail. That decision, and others like it, illustrate how litigation strategies and record-building can be decisive even where the underlying legal questions remain unsettled.

Courts have not been uniformly sympathetic to broad fair-use claims. In a separate matter, a judge sided with a publisher after finding that a defunct legal research firm had used proprietary database content without permission to train its model, rejecting the defendant’s fair-use defence. Legal experts warn that such rulings could become precedents limiting the scope of permissible training practices, particularly where high-value, subscription-based content is involved.

Those mixed outcomes have prompted calls from publishers, creators and some policymakers for clearer rules or licensing regimes that address both the transformative potential of AI and the economics of content sourcing. Commentators have pointed out that favourable rulings for AI makers often rest on case-specific facts and may leave creators with leverage to seek compensation, while rulings against firms emphasise the importance of lawful acquisition. Analysts also note the commercial effect: if companies must pay to license large bodies of text, the cost dynamics of model training will shift.

As litigation proceeds in California and elsewhere, the outcomes will shape not only industry practices but also the bargaining position of authors and publishers. Legal scholars and market participants are watching whether courts will converge on a coherent standard for when model training is transformative and when it crosses the line into infringement, and whether lawmakers will step in to set uniform rules. For companies facing parallel suits, the immediate task is twofold: defend their current data practices in court and prepare for a future in which some form of licensing or stricter provenance requirements may become the norm.

Source Reference Map

Inspired by headline at: ^[1]

Sources by paragraph:

Paragraph 1: ^[2], ^[6]
Paragraph 2: ^[2], ^[3]
Paragraph 3: ^[5]
Paragraph 4: ^[4]
Paragraph 5: ^[6], ^[7]
Paragraph 6: ^[2], ^[3], ^[5]

Source: Noah Wire Services

More on this

https://opentools.ai/news/google-and-apple-in-hot-water-ai-copyright-infringement-suits-heat-up - Please view link - unable to able to access data
https://apnews.com/article/1e5cece51c2e4bd0bb21d94de2abb035 - In a significant ruling, a U.S. federal judge determined that Anthropic's use of millions of copyrighted books to train its AI chatbot, Claude, was 'exceedingly transformative' and thus qualified as fair use. However, the company faces a separate trial concerning the acquisition of these books through illegal 'shadow libraries' of pirated content. The case highlights the complex legal landscape surrounding AI training and copyright laws, with potential implications for other AI firms like OpenAI and Meta. The trial on the piracy issue is scheduled for December.
https://apnews.com/article/9643064e847a5e88ef6ee8b620b3a44c - A U.S. federal judge has granted preliminary approval for a $1.5 billion settlement between AI company Anthropic and authors who alleged the company pirated nearly 465,000 books to train its chatbots. Each affected author or publisher may receive around $3,000 per book under the agreement. The legal action centered on Anthropic using pirated materials to train its Claude chatbot, despite a previous ruling stating that generative AI training on copyrighted texts could fall under fair use. The settlement is seen as a significant milestone in defending intellectual property rights against AI misuse.
https://apnews.com/article/4a127c5b7e8bb76c84499fe12ad643c8 - Thomson Reuters secured an early legal victory in a significant AI-related copyright case in the U.S. against Ross Intelligence, a now-defunct legal research firm. The lawsuit asserted that Ross used content from Thomson Reuters’ Westlaw legal database without permission to train its own AI model. U.S. Judge Stephanos Bibas ruled in favor of Thomson Reuters, rejecting Ross's 'fair use' defense and affirming that such use does not fall under the fair use doctrine. This ruling could set a precedent for future AI-related copyright disputes as the legal battle over AI training practices continues to unfold.
https://apnews.com/article/e77968015b94fbbf38234e3178ede578 - A federal judge in San Francisco dismissed a copyright infringement lawsuit filed by 13 well-known authors, including Sarah Silverman and Ta-Nehisi Coates, against Meta Platforms. The authors alleged Meta used their books, obtained from pirated sources, to train its AI system, Llama, without proper authorization. U.S. District Judge Vince Chhabria ruled that the plaintiffs made flawed legal arguments and failed to build a proper case record. He emphasized that the decision does not affirm the legality of Meta's methods, suggesting other similar lawsuits could succeed with better-prepared arguments.
https://www.lemonde.fr/pixels/article/2025/06/25/droit-d-auteur-un-tribunal-americain-accorde-une-victoire-partielle-a-la-societe-d-ia-anthropic_6615902_4408996.html - Two recent U.S. court decisions have offered partial victories to AI companies Meta and Anthropic in copyright disputes. Judge Vince Chhabria ruled that Meta's training of AI models using unauthorized books could fall under 'fair use,' while noting that the plaintiffs' arguments were poorly formulated and might be reconsidered under the 'market dilution' angle. In Anthropic's case, Judge William Alsup recognized 'fair use' for books legally purchased and scanned by the company. However, a separate legal procedure is required to determine if the use of millions of pirated books constitutes a copyright violation. These decisions underscore the ongoing legal debates over the use of protected content for AI training and the need for licensing agreements.
https://www.youtube.com/watch?v=8beAhtbnM4Y - This video discusses the significant implications of the Anthropic AI copyright ruling, highlighting that while training large language models on entire books is considered 'quintessentially transformative' and protected under fair use, the company was held liable for downloading books from pirate sites. The ruling emphasizes the importance of lawful data acquisition for AI training and suggests that well-funded AI firms should purchase content to train their models, reshaping future data-sourcing economics. The decision also provides authors with leverage to demand payment while allowing transformative innovation in AI development.

Noah Fact Check Pro

The draft above was created using the information available at the time the story first emerged. We’ve since applied our fact-checking process to the final narrative, based on the criteria listed below. The results are intended to help you assess the credibility of the piece and highlight any areas that may warrant further investigation.

Freshness check

Score: 6

Notes: The article references recent legal actions involving Google and Apple concerning AI and copyright infringement. However, the specific publication date of the article is not provided, making it challenging to assess its freshness. The latest developments in this area include a lawsuit filed by a New York Times reporter against xAI, Google, and OpenAI in December 2025 ([geo.tv](https://www.geo.tv/latest/641512-new-york-times-reporter-sues-elon-musks-xai-google-openai-over-copyright-infringement?utm_source=openai)), and Disney's cease-and-desist letter to Google in December 2025 ([yahoo.com](https://www.yahoo.com/news/articles/disney-hits-google-ai-copyright-154724893.html?utm_source=openai)). Without a clear publication date, it's difficult to determine if the article is up-to-date or if it recycles older information. The lack of a publication date raises concerns about the article's timeliness and relevance. Additionally, the article's reliance on a press release as a source typically warrants a high freshness score, but without confirmation of the publication date, this cannot be verified. Therefore, the freshness score is reduced due to these uncertainties.

Quotes check

Score: 5

Notes: The article includes direct quotes attributed to various sources. However, without access to the original sources, it's challenging to verify the accuracy and context of these quotes. The absence of direct links to the original statements or interviews raises concerns about the reliability of the quoted material. Additionally, if identical quotes appear in earlier material, this could indicate reused content, but such instances cannot be confirmed without further information. Therefore, the quotes check score is reduced due to these verification challenges.

Source reliability

Score: 4

Notes: The article cites sources such as the Associated Press and Le Monde, which are reputable news organizations. However, the article's reliance on a press release as a source raises concerns about potential bias and lack of independent verification. The absence of direct links to the original sources and the lack of a clear publication date further diminish the source reliability score. Therefore, the source reliability score is reduced due to these concerns.

Plausibility check

Score: 7

Notes: The article discusses ongoing legal actions involving major technology firms like Google and Apple concerning AI and copyright infringement, which aligns with recent developments in the tech industry. However, without access to the original sources and a clear publication date, it's difficult to fully assess the accuracy and context of the claims made in the article. Therefore, the plausibility score is moderate, reflecting some concerns about the article's content.

Overall assessment

Verdict (FAIL, OPEN, PASS): FAIL

Confidence (LOW, MEDIUM, HIGH): MEDIUM

Summary: The article presents information on legal actions involving Google and Apple concerning AI and copyright infringement. However, the lack of a clear publication date, reliance on a press release as a primary source, and absence of direct links to original sources raise significant concerns about the article's freshness, source reliability, and verification independence. These issues collectively lead to a 'FAIL' verdict with medium confidence.

Artificial Intelligence
Copyright Law
Legal Battles