Encyclopædia Britannica and its Merriam‑Webster subsidiary have taken OpenAI to federal court in Manhattan, accusing the company of copying nearly 100,000 of their entries without permission to train its large language models and claiming the practice has siphoned off readers and tarnished the encyclopaedia’s reputation. According to the complaint, the AI outputs produce summaries that too closely resemble Britannica’s work and therefore undercut visits to its sites. (Sources: Encyclopædia Britannica filings and reporting.) Industry observers say the case sits at the heart of a broader legal contest over whether AI systems’ outputs are “transformative” enough to qualify as fair use. According to reporting on parallel litigation, plaintiffs argue that when models replicate the distinctive substance or structure of copyrighted works, that use cannot be characterised as transformative and may harm the original creator’s market. The outcome could reshape permissible training practices for commercial AI developers. (Sources: Encyclopædia Britannica filings; coverage of related suits.) The Britannica suit joins a growing list of high‑profile actions alleging unauthorised ingestion of proprietary material. The New York Times and several regional newspapers secured judicial permission to press copyright claims against OpenAI and Microsoft, contending that generative responses have reproduced reporting verbatim and threatened news business models. Similarly, media outlets including The Intercept, Raw Story and AlterNet have lodged claims alleging large‑scale copying of journalism. These cases underscore shared concerns about downstream economic effects on content producers. (Sources: Encyclopædia Britannica filings; Associated Press coverage.) Beyond news organisations, companies that compile and curate specialised databases have also sued AI firms. Nielsen’s Gracenote alleges OpenAI misused copyrighted metadata and the unique relational framework that underpins its commercial offerings, while groups of authors have targeted other technology providers over the use of books and other written works. According to reporting, those suits emphasise not only verbatim copying but the appropriation of curated structures and identifiers that embody commercial value. (Sources: Axios; The Guardian.) Entertainment and creative industries are pursuing comparable claims. Major studios have accused generative image platforms of enabling near‑replication of protected characters, arguing that such capabilities threaten licensing markets and the value of original artistic works. Those actions challenge the extent to which training on large, mixed‑source datasets can be justified under fair use and raise questions about whether current practice preserves or depletes the incentives that sustain creative production. (Sources: The Associated Press; Time.) Legal experts say the Britannica case could become a bellwether because it focuses on encyclopaedic content long considered core reference material and because the complaint links alleged copying to measurable commercial harm. If a court finds that OpenAI’s use fails the transformative test or that AI outputs supplant the market for the originals, the decision may force companies to alter training datasets, licensing arrangements or the way AI systems present sourced information. Conversely, a ruling for OpenAI would reinforce broader latitude for model builders but would likely invite new debates about attribution, remediation and compensation for rights holders. (Sources: Encyclopædia Britannica filings; Axios; Associated Press.)

Source Reference Map

Inspired by headline at: [1]

Sources by paragraph:

Source: Noah Wire Services