Peer-reviewed by human experts: AI failed in key steps to generate a scoping review on the neural mechanisms of cross-education
- Open Access
- 24.12.2025
- Invited Review
Abstract
Background
The prompt
Box 1
-
Generate a comprehensive review that maps the current evidence, identifies major gaps and limitations, and outlines potential directions for future research on the neural adaptations that mediate cross-education of strength.
-
Focus on accuracy, take as much time as you need to.
-
Neural mechanisms mediating cross-education of voluntary force: a scoping review mapping main gaps, limitations and future directions for research
-
Cross education
-
Interlimb transfer
-
Cross-training
-
Contralateral training effect
-
Contralateral strength training effect
-
Contralateral effect
-
Use only peer-reviewed scientific articles sourced from PubMed, Google Scholar, Scopus, and Web of Science.
-
Ensure that the review captures the most up-to-date and relevant studies, using the synonyms provided to broaden
-
Abstract: A succinct summary of the review’s scope, key findings, and implications.
-
Introduction: Overview of the phenomenon of cross-education, its significance in physiology, and the rationale for a scoping review.
-
Methodology: Detailed description of the search strategy, including databases used, search terms (including all synonyms), inclusion/exclusion criteria, and any quality assessment methods.
-
Results: Comprehensive mapping of the current evidence. Summarize key findings on the neural mechanisms involved in cross-education.
-
Discussion: Critical analysis of the identified gaps and limitations in current literature. Discuss discrepancies, methodological constraints, and areas where evidence is lacking.
-
Future Directions: Propose specific research directions and methodological improvements that could address the identified gaps.
-
Conclusion: Summarize the major insights and the implications of the findings for future research.
-
Use an academic and analytical tone. Provide a balanced discussion supported by recent and high-quality scientific evidence. Ensure clarity, precision, and appropriate citation of all sources.
-
Be critical and integrative in your synthesis of literature. Adhere strictly to the scientific rigor expected in a peer-reviewed review.
Peer-reviewing the LLM-generated article
Section of AI-Review | Reviewer 1 | Reviewer 2 | Reviewer 3 | Reviewer 4 |
|---|---|---|---|---|
Abstract | Content is correct but has grammatical and phrasing errors. | Too many abbreviations. | Doesn’t resemble a scoping review abstract (lacks study counts, evidence quality). | Readable but too many unexplained abbreviations |
Reports underexplored findings as consolidated knowledge. | Inaccurate references to neurophysiological protocols. | Lacks hierarchy; minor findings are given the same weight as major ones. | ||
Insufficiently detailed to guide future research. | ||||
Introduction | Comprehensive, but citations are problematic and don’t always support the statements. Warrants careful checking of all references. | Suboptimal or inappropriate referencing (relies on reviews, not primary sources). | Fails to detail training methods. | References are not original sources. |
Uses single references for long statements. | ||||
Lacks specificity (e.g., doesn’t specify upper/lower limbs). | Misses key papers from highly reputable, subscription-based journals. | Cites only open-access journals, missing seminal studies from subscription-based ones. | ||
Rationale & Objectives | Aims are disconnected from the review’s practical application focus. | Strongly criticizes this section. | No negative or positive remarks on this subsection | Critical failure: Did not articulate specific research questions, a defining stage of scoping reviews. |
Aims are overly broad, vague, and disconnected from the background. | ||||
Fails to define key terms like "controversies." | ||||
Methods | References cited to support the methodology are unsuitable. | Incorrect or inappropriately applied references. | Falsely claims adherence to PRISMA-ScR guidelines. | No PRISMA flow chart. |
Doesn’t adhere to seminal scoping review frameworks. | Contradicts its own criteria by relying on secondary sources (reviews) while claiming to exclude them. | Lacks transparency; makes false claims about database searches and reviewer calibration. | ||
Arbitrary literature cut-off date, missing recent key studies. | Suggests a major flaw is the AI’s inability to access pay-walled academic databases. | |||
Results | Fails to report key data (e.g., number of studies). | Lacks detail, structure, and a PRISMA flow diagram. | Notes the AI provides a “template” for a review ("insert diagram here"). | Includes irrelevant studies (motor skills instead of force). |
Raises concerns of plagiarism/copyright infringement. | Mixes results with discussion and interpretation, which is inappropriate. | Referencing is mostly inadequate or wrong. | Total absence of quantitative data (e.g., number of studies per technique). | |
Praised some summaries (e.g., on TMS) but notes the AI used a preliminary, uncorrected version of a cited paper. | Contains blatant plagiarism (e.g., "insert flow diagram here"). | Attributes findings to incorrect sources. | Mixes results with discussion, making it hard to follow. | |
No summary tables to structure the findings. | Elaborates on topics using studies unrelated to cross-education. | |||
Discussion | Highly repetitive and redundant. | Repetitive and disconnected from the results; introduces new topics not previously covered. | Repetitive yet did a decent job identifying some key research gaps. | Repetitive; issues stem from poor objectives and results. |
Discusses methodological limitations but fails to connect them to the review’s focus on neural adaptations. | Tone is interpretive, not descriptive as required for a scoping review. | Noted the use of generic, non-substantive statements that could apply to any research field. | Disconnected from the evidence presented. | |
Recommendations are overly ambitious and impractical. | Lack of hierarchization; minor/inconsistent findings are given disproportionate coverage. | |||
Structure lacks coherence. | ||||
References | Countless inaccuracies and errors in the background and, overall, in the whole manuscript. | All references need to be reviewed for accuracy and position within the text, demanding hours and hours of human time. | Numerous issues with accuracy, duplication of the same source that was retrieved from different platforms (e.g., pre-print server, journal source, repositories). Reliance on review articles | Critical failure: inaccuracies permeating the whole bibliography, including all the call-outs within the text, threatening the review’s trustworthiness and reliability. The LLM is below-the-bar in managing scientific bibliography. |