Reddit Unleashes Legal Fury: AI Company Perplexity Sued for 'Industrial-Scale' Scraping of User Data
Social media platform Reddit has initiated a lawsuit against artificial intelligence company Perplexity AI and three other entities, alleging their involvement in an “industrial-scale, unlawful” economy to “scrape” the comments of millions of Reddit users for commercial gain. Filed in a New York federal court, Reddit’s lawsuit specifically targets San Francisco-based Perplexity, which produces an AI chatbot and an “answer engine” competing with services like Google and ChatGPT in online search. Also named in the suit are Lithuanian data-scraping company Oxylabs UAB, a web domain known as AWMProxy, which Reddit describes as a “former Russian botnet,” and Texas-based startup SerpApi, which reportedly lists Perplexity as a customer on its website.
This legal action marks the second such lawsuit filed by Reddit against a major AI company, following a previous suit against Anthropic in June. However, the current lawsuit is notable for confronting not only an AI company but also the less visible services within the AI industry that are relied upon to acquire online writings for training AI chatbots. Ben Lee, Reddit’s chief legal officer, stated that “Scrapers bypass technological protections to steal data, then sell it to clients hungry for training material,” adding that “Reddit is a prime target because it’s one of the largest and most dynamic collections of human conversation ever created.”
The lawsuit alleges unfair competition and unjust enrichment against the involved companies, and further claims that some of them have violated U.S. copyright laws. Perplexity, in its initial response, stated it had not yet received the lawsuit but affirmed its commitment to “always fight vigorously for users’ rights to freely and fairly access public knowledge,” emphasizing its “principled and responsible” approach to providing factual answers with accurate AI. Ryan Schafer, customer success director for SerpApi, expressed strong disagreement with Reddit’s allegations and an intent to vigorously defend the company in court. Oxylabs, through its chief governance and strategy officer Denas Grybauskas, conveyed that the company was “shocked and disappointed” and would “not hesitate to defend itself against these allegations,” further stating that “no company should claim ownership of public data that does not belong to them,” suggesting the lawsuit might be “just an attempt to sell the same public data at an inflated price.” AWMProxy could not be reached for comment.
Reddit characterizes the companies it is suing as akin to “would-be bank robbers” who, unable to access a bank vault, instead break into an armored truck. The lawsuit contends that these entities are not only evading Reddit’s own anti-scraping measures but are also “circumventing Google’s controls and scraping Reddit content directly from Google’s search engine results.” Lee further elaborated that because direct scraping of Reddit is difficult, the defendants “mask their identities, hide their locations, and disguise their web scrapers to steal Reddit content from Google Search.” He specifically accused Perplexity of being a “willing customer of at least one of these scrapers, choosing to buy stolen data rather than enter into a lawful agreement with Reddit itself.” This argument parallels Reddit’s claims in its lawsuit against Anthropic, where it alleged the company ignored requests to cease using its content; that case was moved to federal court and has a hearing scheduled for January.
Websites such as Wikipedia and Reddit, along with digitized books and news articles, represent deep troves of written materials essential for teaching AI assistants the intricate patterns of human language. Recognizing the value of this data, Reddit has previously established licensing agreements with major AI companies, including Google and OpenAI. These agreements involve payments for the right to train AI systems on the vast public commentary contributed by Reddit’s more than 100 million daily users. Such licensing deals were crucial in helping the 20-year-old online platform raise capital in anticipation of its successful Wall Street debut as a publicly traded company last year.
You may also like...
Consumerism vs Culture: Is Africa Trading Values for Trendy Lifestyles?
Is Africa trading its cultural values for trendy lifestyles? Explore how consumerism, foreign brands, and social media p...
The War on Boys: Are African Male Being Left Behind in Gender Conversations
Why are African boys and men often left out of gender empowerment programs? Explore how emotional suppression, lack of m...
Pay Slip, Motivation Slips: The Silent Crisis Among the Working Class
Across Nigeria, millions of workers are trapped in jobs that pay just enough to survive but too little to live. Beneath ...
Premier League's Unsung Heroes: Bournemouth, Sunderland, and Tottenham Shockingly Exceed Expectations

This Premier League season sees teams like Bournemouth, Sunderland, and Tottenham exceeding expectations. Under Thomas F...
El Clasico Fury: Yamal Controversy and Refereeing Blunders Ignite Post-Match Debates
)
Real Madrid secured a 2-1 El Clasico victory over Barcelona amidst significant controversy surrounding a late penalty de...
Kate Hudson & Hugh Jackman Eyeing Oscars with ‘Song Sung Blue’ Breakthrough

"Song Sung Blue" tells the true story of Mike and Claire Sarina, who find love and acclaim as a Neil Diamond tribute ban...
Pennywise Unleashes Horror: 'It: Welcome to Derry' Premiere Shocks Viewers

HBO Max's "IT: Welcome to Derry" plunges audiences into 1962 Derry, Maine, exploring the origins of Pennywise with a bru...
Zimbabwe's Ammara Brown Stages 'Spectacular' Comeback, New Album Imminent!

Ammara Brown triumphantly returned to the music scene at the Castle Lager Braai Festival after a year-long maternity bre...



