Scientists decry conference’s use of hidden prompts to snare AI peer reviews
Organizers of a prominent neuroscience conference are facing pushback on social media after adding hidden prompts to their papers to catch peer reviewers who are using generative artificial intelligence (AI) to referee papers.
The 40th Annual Conference on Neural Information Processing Systems (NeurIPS)—which is slated to take place in Sydney, Australia, in December 2026—bans peer reviewers from uploading papers they referee to AI chatbots, as the practice breaches confidentiality. Reviewers can still use AI chatbots for background research purposes, according to the policy outlined in the conference’s handbook.
To enforce the policy and catch illicit AI use in peer review, the event’s organizers have included deliberately concealed instructions for large language models (LLMs) in papers sent out for peer review.
The instructions tell an LLM to use telltale phrases—such as “This work addresses the central challenge” and “The claims of the paper”—in a peer-review report. Some researchers have already been caught trying to sneak secret messages into their papers in a bid to game AI tools into giving them favorable referee reports. Many publishers ban the use of AI in peer review.
Multiple researchers refereeing papers for NeurIPS have taken to social media to express their concerns about the indirect prompt injections inserted into papers.
“Designing a trap that presumes bad faith corrodes the relationship the whole system depends on,” Sören Auer, a computer scientist at Leibniz University Hannover, wrote on LinkedIn. “You do not build a healthy reviewing culture by treating your reviewers as suspects.”
But others see merits in the approach. A similar prompt-injection effort has caught hundreds of reviewers misusing LLMs in submissions for next week’s 43rd International Conference on Machine Learning (ICML 2026) in Seoul, South Korea, according to Nihar Shah, a computer scientist at Carnegie Mellon University and scientific integrity chair of that conference.
In a statement to The Transmitter, the NeurIPS organizing committee says it can’t discuss injected prompts in detail “without eroding the effectiveness of this intervention.”
A uer told The Transmitter he was assigned eight NeurIPS papers to review. He says he sometimes converts PDF files into Microsoft Word documents when carrying out peer review, which renders some prompts visible.
Auer says he initially rejected the first paper he was reviewing because he thought the prompts had been inserted by the study’s authors. But he removed the flag after discovering hidden prompts in a second paper and seeing researchers discussing this issue on a Reddit thread.
It’s possible that more papers are being rejected because referees don’t know that prompts were inserted by conference organizers, he says. “I personally think it’s not good to prohibit the use of AI,” Auer adds. “We should rather, of course, have a discussion on how to use it.”
The NeurIPS committee has been replying directly to any reviewer who has noticed the hidden prompts, informing them not to penalize individual papers, according to the statement.
Like Auer, Sara Atito, an AI researcher at the University of Surrey, told The Transmitter she spotted the same prompt in all four papers she reviewed for NeurIPS. She says she also found it in the version of her own paper that NeurIPS organizers created before sending the paper out for peer review.
Atito calls hidden prompts a “poor mechanism,” arguing that it may filter out some problematic submissions but won’t solve the bigger problems with peer review. “We put too much blame on reviewers because they are the visible point of failure,” she says.
But Shah says hidden prompts are “viable and feasible.” Shah led a similar effort at ICML 2026 by injecting hidden prompts into all submitted papers.
By doing so, Shah says, he and his team identified hundreds of referees who were using AI when they weren’t supposed to, leading to their reviews being rejected. ICML 2026 desk-rejected just under 500 papers over violations of its LLM review policy—about 2 percent of the total number of submissions the conference received this year.
Researchers expressed “overwhelming support” for the strategy, says Shah, who adds that he shared the methodology with the NeurIPS team. “I have been working on conference peer review for several years, and I have hardly seen such strong support for anything,” he says. “People were really tired of reviewers copy-pasting AI-generated reviews without putting any effort.”
Related Stories
AI News
World Cup 2026: Has football fever taken hold in the US?
22 minutes ago
AI News
Cape Verde: All to know before FIFA World Cup knockout against Argentina
22 minutes ago
AI News
Medal Suite Unveiled for World Athletics U20 Championships Oregon 26
22 minutes ago
AI News
Ocean surface temperatures hit record high as world enters ‘uncharted territory,’ scientists warn
23 minutes ago
AI News
Trump accused of ‘disgusting’ greed after earning over $2bn since return to office
23 minutes ago
AI News
News Live: Watch Times Now Live Streaming | Live News India, World, Politics, Sports News
23 minutes ago
AI News
Fire risk forces St. John's to postpone Canada Day fireworks show
24 minutes ago
AI News
Experts confirm tornado touched down in Winnipeg's Whyte Ridge area Monday
24 minutes ago