Skip to main content

Authors Sue Adobe Over Alleged Use of Pirated Books in AI Training

Adobe Faces Legal Heat Over AI Training Practices

Software powerhouse Adobe finds itself embroiled in controversy as authors allege the company used pirated books to train its artificial intelligence systems. The proposed class-action lawsuit could have far-reaching implications for how tech firms develop AI models.

The Heart of the Lawsuit

Elizabeth Lyon, an Oregon-based writer, has taken legal action claiming Adobe incorporated her copyrighted material into its lightweight language model called SlimLM without authorization. Court documents reveal Lyon isn't alone - potentially thousands of authors may have had their works used similarly.

The crux of the complaint centers on Adobe's alleged use of SlimPajama-627B, an open-source dataset containing the notorious Books3 collection. This controversial trove includes approximately 191,000 e-books believed to have been scraped without proper licensing.

A Growing Industry Controversy

Adobe isn't the first tech company to face such allegations. Industry heavyweights including Apple, Salesforce, and Anthropic have previously encountered legal challenges regarding their use of similar datasets containing Books3 content.

The SlimLM model at issue specializes in document assistance tasks optimized for mobile devices. While convenient for users, this accessibility raises questions about whether copyrighted materials were properly licensed during development.

What This Means for Creators and Tech Firms

The lawsuit arrives amid mounting concerns from creative professionals about AI companies using protected works without compensation or consent. Authors argue these practices undermine their livelihoods while giving tech firms an unfair advantage.

Adobe hasn't yet publicly addressed the allegations. However, legal experts suggest this case could become a landmark decision shaping how courts view AI training data sourcing moving forward.

The outcome may force technology companies to rethink how they acquire materials for machine learning projects while potentially opening new revenue streams for content creators.

Key Points:

  • Legal Action: Proposed class-action lawsuit targets Adobe's AI training practices
  • Allegations: Claims involve unauthorized use of copyrighted books from Books3 dataset
  • Model Impact: Affected SlimLM technology focuses on mobile document assistance
  • Industry Trend: Similar lawsuits emerging against major tech companies
  • Future Implications: Case could establish important precedents for AI development

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

New York Times Takes Legal Action Against AI Startup Over Content Scraping
News

New York Times Takes Legal Action Against AI Startup Over Content Scraping

The New York Times has filed a lawsuit against AI search company Perplexity, alleging unauthorized use of copyrighted content. The complaint cites nearly 180,000 visits to NYTimes.com in one month and accuses Perplexity of reproducing articles almost verbatim through its AI system. This marks the second major copyright lawsuit by the Times against AI companies, following their case against OpenAI/Microsoft last year.

December 9, 2025
AI copyrightNew York TimesPerplexity
Chicago Tribune Takes AI Search Engine to Court Over Content Scraping
News

Chicago Tribune Takes AI Search Engine to Court Over Content Scraping

The Chicago Tribune has filed a lawsuit against AI company Perplexity, alleging unauthorized use of its news content. The legal action claims Perplexity not only scraped articles without permission but also bypassed paywalls to deliver nearly verbatim content through its search engine. This marks another chapter in the growing tension between media organizations and AI firms over copyright issues.

December 5, 2025
AI copyrightmedia lawsuitscontent scraping
AI Outperforms Humans in Literary Imitation, Sparks Copyright Debate
News

AI Outperforms Humans in Literary Imitation, Sparks Copyright Debate

A study reveals AI models fine-tuned with minimal data can mimic famous authors' styles more effectively than human professionals, raising legal questions about fair use. Readers preferred AI-generated texts, which cost 99.7% less to produce than human-written imitations.

October 27, 2025
AI copyrightliterary imitationfair use
Disney Forces Character.AI to Remove Mickey Mouse and Other IPs
News

Disney Forces Character.AI to Remove Mickey Mouse and Other IPs

Disney issued a legal demand to Character.AI, resulting in the removal of all Disney-related characters from the platform. The company cited copyright infringement and concerns over inappropriate content. Users found loopholes with lesser-known Disney-linked characters still available.

October 3, 2025
AI copyrightDisneyCharacter.AI
News

Penske Media Sues Google Over AI Summaries in Landmark Case

Penske Media Group has filed a lawsuit against Google, alleging unauthorized use of its news content for AI-generated search summaries. The publisher claims this practice harms revenue and traffic, marking a pivotal moment in digital copyright disputes. Google defends its AI features as beneficial for user experience and website discovery.

September 15, 2025
AI copyrightGoogle lawsuitDigital media
Perplexity AI Launches $42.5M Revenue Share for Publishers
News

Perplexity AI Launches $42.5M Revenue Share for Publishers

Perplexity AI has introduced a $42.5 million Publisher Revenue Sharing Program to compensate news organizations for content usage. The initiative, funded through its Comet Plus subscription service, aims to address copyright disputes while creating a new revenue stream for publishers like Time Magazine and the Los Angeles Times.

August 26, 2025
AI copyrightPerplexityAImedia partnerships