Tech

Hacker News user alleges AI-driven content scraping undermines original creators

A recent critique on Hacker News argues that artificial intelligence companies engage in large-scale unauthorised plagiarism by training on data without consent and selling outputs without compensating original authors.

Author

Owen Mercer

Markets and Finance Editor

Published

Draft

Source: Hacker News · original

Artificial Intelligence Media Research

Related coverage

Explore Artificial Intelligence coverage Explore Media coverage Explore Research coverage More from the Tech desk

Tech

No image available

E-commerce tutorial writer claims search algorithms favour AI-generated copies over source material

A post on Hacker News has reignited debate over the ethics of artificial intelligence training data, with a user describing the current industry model as "unauthorised plagiarism at a bigger scale." The author, who specialises in writing e-commerce tutorials, argues that AI companies ingest internet data without seeking author consent and subsequently sell the resulting outputs without providing compensation to the original creators.

The user details a specific instance of alleged infringement involving competitor websites. After noticing that other sites had replicated their e-commerce tutorials, the author identified the source of the content by observing that the copied articles retained exact link text pointing back to their original website. The user asserts that these copycat authors did not bother to remove the attribution, a oversight that revealed the automated nature of the content generation.

Frustration was directed at search engine dynamics, with the user claiming that Google’s algorithm ranked these unoriginal, AI-generated pages higher than their original work. The post characterises the business model of AI providers as "lazy and greedy," citing the lack of revenue sharing for content creators and the resale of processed results to third parties who then monetise the generated text.

The incident highlights broader tensions between content creators and AI developers regarding consent, attribution, and the definition of fair use. While the user attributes the replication to AI tools, the broader context suggests ongoing uncertainty about whether training on public internet data constitutes infringement or if the copied content was manually replicated by human operators.

This critique emerges against a backdrop of shifting legal and personnel landscapes within the artificial intelligence sector. Recent developments, including the dismissal of a lawsuit involving key industry figures and notable researcher movements between major labs, underscore the complex regulatory environment in which these technologies operate.

The debate underscores the challenges faced by individual creators in a digital ecosystem where algorithmic prioritisation can inadvertently boost low-quality or duplicated content. As AI capabilities expand, the friction between automated data processing and intellectual property rights is likely to remain a central issue for investors and policymakers monitoring the media and technology sectors.

Hacker News user alleges AI-driven content scraping undermines original creators

More from Tech

Apple to roll out manual EQ controls for AirPods in iOS 27 update

Apple rolls out visionOS 27, integrating AI-driven Siri into Vision Pro headset

Apple Overhauls Siri with Google Gemini Partnership and Standalone App at WWDC 2026