Adobe Under Fire: Class-Action Lawsuit Alleges Misuse of Authors' Work for AI Training

Adobe, a prominent technology company, is facing a proposed class-action lawsuit alleging that it utilized pirated books, including copyrighted works by author Elizabeth Lyon, to train its artificial intelligence model, SlimLM. The lawsuit, filed on behalf of Lyon, claims that Adobe's small language model, designed for document assistance tasks on mobile devices, was pre-trained on SlimPajama-627B. This dataset, described by Adobe as a "deduplicated, multi-corpora, open-source dataset," was released by Cerebras in June 2023.
According to Lyon, who specializes in non-fiction writing guidebooks, some of her copyrighted works were incorporated into a pretraining dataset used by Adobe. The lawsuit, initially reported by Reuters, asserts that Lyon's writing was part of a processed subset of a manipulated dataset that formed the foundation of Adobe's program. Specifically, it states, "The SlimPajama dataset was created by copying and manipulating the RedPajama dataset (including copying Books3). Thus, because it is a derivative copy of the RedPajama dataset, SlimPajama contains the Books3 dataset, including the copyrighted works of Plaintiff and the Class members."
"Books3," a vast collection comprising 191,000 books, has become a recurring point of legal contention within the tech community due to its alleged use in training generative AI systems. Similarly, the RedPajama dataset has been implicated in multiple litigations. This legal challenge against Adobe is part of a growing trend of copyright infringement lawsuits targeting the tech industry's use of massive datasets for AI training, many of which allegedly contain pirated materials.
The issue of copyrighted content in AI training data has led to numerous legal battles. For instance, in September, Apple faced a lawsuit claiming it used copyrighted material to train its Apple Intelligence model, specifically mentioning the RedPajama dataset and accusing the company of copying protected works without consent or compensation. A similar lawsuit was filed against Salesforce in October, also citing the use of RedPajama for training purposes. These cases highlight a pervasive challenge for the tech industry, as AI algorithms rely on extensive datasets, and the provenance of some of these materials is increasingly being scrutinized.
A notable precedent occurred in September when Anthropic agreed to a $1.5 billion settlement with several authors who had accused the company of using pirated versions of their work to train its chatbot, Claude. This settlement was widely regarded as a significant development in the ongoing legal discourse surrounding copyrighted material in AI training data, underscoring the legal and ethical complexities inherent in the development and deployment of advanced AI technologies.
You may also like...
WNBA Forges New Path: Historic CBA Ratified, Set to Transform League and Player Fortunes

The WNBA embarks on a new era following the unanimous ratification of a seven-year collective bargaining agreement, prom...
Salah's Next Chapter: Liverpool Legend's Legacy Debated as Transfer Saga Heats Up

Mohamed Salah is set to leave Liverpool at the end of the season, marking the conclusion of an extraordinary era at Anfi...
Ryan Gosling Teams Up With 'Everything Everywhere All at Once' Directors Daniels for Major Sci-Fi Film

Ryan Gosling is confirmed to star in the highly anticipated new film from The Daniels, the acclaimed directors behind <i...
Netflix Unleashes Mind-Bending Horror Sensation 'Something Very Bad Is Going to Happen', Capturing Binge Watchers

Netflix's new horror series 'Something Very Bad Is Going to Happen,' created by Haley Z. Boston, explores the deep-seate...
iHeartRadio Music Awards 2026: Taylor Swift Reigns, Miley Cyrus Innovates, & A Star-Studded Debut Rocks The Red Carpet!

The 2026 iHeartRadio Music Awards celebrated major artists, with Miley Cyrus receiving the Innovator Award for her fearl...
BTS Takes Over 'Tonight Show' With '2.0' Performance, Sparks Fan Frenzy For Free Stream!

K-pop superstars BTS made a grand return with their new album ARIRANG, shattering streaming records and dominating globa...
Crunchyroll's 12-Part Anime Sensation: Proves Manga's Power and Teases Season 2!

The recent anime adaptations of "Hana-Kimi" and "Tamon's B-Side" showcase the intricate art of bringing beloved manga to...
NCIS Hits Explosive 500 Episodes: Stars Unveil Secrets to Enduring Success!

NCIS celebrates its 500th episode, with actors Wilmer Valderrama and Emily Wickersham and showrunner Steven D. Binder di...





