ArXiv Cracks Down: Researchers Banned for AI Over-Reliance

ArXiv is tightening its regulations on the use of large language models in scientific papers, implementing a new policy to ensure research integrity. Authors found with unchecked LLM-generated content, such as hallucinated references, will face a one-year ban and a requirement for prior peer-review acceptance. This initiative emphasizes authors' full responsibility for their paper's content, regardless of generation method.

Uche Emeka • AI • 1 month ago • 2 minute read •

Key Points

• ArXiv is implementing stricter policies to combat the careless application of large language models (LLMs) in submitted research papers.

• Authors will face a one-year ban from ArXiv if their submissions contain incontrovertible evidence of unchecked LLM generation, such as hallucinated references.

• The new policy emphasizes that authors must assume full responsibility for their paper's content, regardless of how it was generated by an LLM.

ArXiv Cracks Down: Researchers Banned for AI Over-Reliance

ArXiv, a widely recognized open repository for preprint scientific research, is implementing stricter measures to address the careless application of large language models (LLMs) in submitted papers. As a primary channel for research circulation in fields such as computer science and mathematics, ArXiv serves as a crucial data source for scientific trends, even though its papers are posted prior to peer review.

The organization has previously taken steps to combat the increasing influx of low-quality, AI-generated submissions, including mandating endorsements from established authors for first-time posters. Furthermore, after more than two decades under Cornell's stewardship, ArXiv is transitioning into an independent nonprofit entity, a move expected to enhance its funding capabilities to tackle challenges like AI-generated 'slop'.

In its latest initiative, Thomas Dietterich, who chairs ArXiv’s computer science section, announced a significant policy update. He stated that if a submission contains "incontrovertible evidence that the authors did not check the results of LLM generation," it undermines the trustworthiness of the entire paper. Such evidence could encompass issues like "hallucinated references" or direct comments to or from the LLM within the submission.

Authors found to be in violation of this policy will face a stringent penalty: a one-year ban from ArXiv. Following this ban, any subsequent submissions to the platform will be required to first gain acceptance from a reputable peer-reviewed venue. It is crucial to note that this policy does not constitute an outright prohibition on the use of LLMs in research. Instead, it strongly emphasizes that authors must assume "full responsibility" for the content of their papers, regardless of how that content was generated. This means researchers are accountable for any "inappropriate language, plagiarized content, biased content, errors, mistakes, incorrect references, or misleading content" that might be directly copied or generated by an LLM.

Dietterich elaborated that this will operate as a "one-strike" rule. However, its implementation involves a careful process: moderators must first flag the issue, and then section chairs must confirm the incontrovertible evidence before any penalty is imposed. Authors will also retain the right to appeal these decisions. This policy comes at a time when recent peer-reviewed research indicates a concerning rise in fabricated citations within biomedical research, a trend largely attributed to the use of LLMs, highlighting a broader challenge in scientific integrity.