Can AI Spot a false Citation?

Author: Denis Avetisyan

A new framework leverages the power of artificial intelligence to automatically verify research citations by deeply analyzing the supporting evidence within full-text sources.

A multi-stage automated pipeline dissects academic citations, employing document processing, vector embeddings, and hybrid retrieval-combining semantic similarity with BM25 keyword matching-before subjecting results to neural reranking via FlashRank and ultimately, LLM-based analysis, to deliver not merely a classification, but supporting evidence, detailed reasoning, and a confidence score for each verification.

SemanticCite employs full-text analysis and efficient language models to provide nuanced citation verification and improve research integrity.

Accurate citations are fundamental to scientific communication, yet increasingly challenged by semantic errors and the rise of AI-generated content. To address this, we introduce SemanticCite: Citation Verification with AI-Powered Full-Text Analysis and Evidence-Based Reasoning, a novel framework that automatically verifies citations by analyzing source text and providing nuanced, evidence-based assessments. Our system achieves competitive performance with significantly smaller language models, offering a scalable solution for maintaining research integrity and streamlining peer review. Will this approach foster greater trust in scientific literature and enable more effective quality control for AI-generated research?

Unraveling the Citation Labyrinth

The bedrock of credible research rests upon the meticulous verification of citations, a historically labor-intensive undertaking. For decades, scholars have painstakingly confirmed that sources cited within academic papers genuinely support the claims being made, a process demanding significant time and expertise. This manual effort involves locating the original source, comparing it to the cited material, and assessing the accuracy of the representation – ensuring that the author’s interpretation aligns with the source’s actual content. While essential for maintaining scientific integrity, this traditional approach struggles to keep pace with the exponential growth of published research, creating a critical bottleneck in the dissemination of reliable knowledge and highlighting the urgent need for innovative verification strategies.

Reliance on abstract screening for citation verification presents significant limitations in ensuring research integrity. While seemingly efficient, this method often overlooks crucial contextual details embedded within the full source material – nuances in methodology, specific data interpretations, or even retractions not yet widely disseminated. Consequently, assessments based solely on abstracts can yield inaccurate conclusions regarding the cited work’s validity or relevance, potentially perpetuating flawed research or misrepresenting an author’s intent. This is particularly problematic given the increasing complexity of scientific literature and the subtle ways in which errors or biases can manifest, demanding a more comprehensive evaluation than abstract screening alone can provide.

The relentless expansion of scholarly literature, coupled with the rising tide of AI-generated text, presents a formidable challenge to maintaining the integrity of academic citations. Previously manageable through manual review, the sheer volume of new publications now overwhelms traditional verification methods, increasing the risk of inaccurate or fabricated references slipping through the cracks. This proliferation isn’t merely quantitative; AI’s ability to generate convincingly formatted, yet entirely fictitious, citations introduces a qualitatively new level of difficulty. Consequently, there is an urgent need for automated tools capable of not only identifying potential discrepancies but also contextualizing citations within the broader research landscape, ensuring that claims are supported by genuine sources and preventing the spread of misinformation within the scientific community.

The data selection and processing pipeline establishes a workflow for verifying the semantic meaning of citations.

Dissecting the Source: A Full-Text Approach

SemanticCite addresses the limitations of traditional citation verification methods, which typically rely on abstract screening or limited keyword searches. Instead of these approaches, SemanticCite performs a comprehensive, full-text analysis of the cited source document. This involves processing the entirety of the source text to identify passages that directly support or contradict claims made by the citing publication. By examining the complete document content, the system aims to provide a more accurate and reliable assessment of citation validity than methods that only consider abstracts or limited metadata, enabling detection of instances where citations are misrepresented, lack supporting evidence, or are entirely fabricated.

SemanticCite categorizes the level of support a citation receives from its source document using a four-class taxonomy. Citations are classified as Supported when the source text directly confirms the claim made by the citing sentence. Partially Supported indicates the source confirms a related, but not identical, claim, or only supports a portion of the citing sentence. Unsupported denotes a clear contradiction or absence of evidence for the claim within the source. Finally, Uncertain is assigned when the source document is inaccessible, ambiguous, or lacks sufficient information to determine support, requiring manual review.

SemanticCite’s retrieval strategy combines dense semantic search and sparse BM25 matching to maximize evidence identification within cited source documents. Dense semantic search, utilizing pre-trained language models, identifies passages with similar meaning to the citing sentence, even with lexical differences. Complementing this, BM25, a sparse retrieval method based on term frequency and inverse document frequency, efficiently identifies passages containing keywords from the citing sentence. This hybrid approach leverages the strengths of both techniques: semantic search captures nuanced relationships while BM25 provides high recall for exact matches, improving the accuracy and comprehensiveness of citation support assessment.

The SemanticCite web interface allows users to input citations, upload reference documents via file or URL, customize metadata, and configure model parameters, supporting flexible deployment with multiple LLM and embedding model providers.

The Machine’s Logic: Models and Methods

SemanticCite utilizes large language models from the Qwen3 family to perform fine-grained analysis of citation contexts. These models, chosen for their performance in natural language understanding, are adapted through a fine-tuning process to specifically identify the relationships between citing and cited text. This adaptation enables SemanticCite to move beyond simple keyword matching and assess the semantic meaning of citations, allowing for a more nuanced understanding of how sources are used and referenced within academic writing. The Qwen3 models provide the foundational intelligence for interpreting citation intent and evaluating the validity of evidence presented in support of claims.

QLoRA, or Quantized Low-Rank Adaptation, is employed to mitigate the substantial computational demands typically associated with fine-tuning large language models. This technique freezes the pre-trained model weights and introduces a small number of trainable low-rank matrices to adapt the model to the specific citation context task. By quantizing the pre-trained model to 4-bit precision, QLoRA significantly reduces the memory footprint and computational resources required for training, enabling effective fine-tuning of powerful models – such as the Qwen3 series – even on hardware with limited resources. This approach maintains performance comparable to full fine-tuning while drastically decreasing the required GPU memory, facilitating research and deployment in resource-constrained environments.

Weighted Accuracy serves as the primary evaluation metric for SemanticCite, addressing the limitations of standard accuracy by factoring in the degree of misclassification. The system employs a four-class taxonomy for citation assessment; misclassifications are not treated equally. Instead, penalties are applied based on the semantic distance between the predicted and actual classes, with larger penalties assigned for more significant errors. This approach provides a more granular and informative assessment of model performance than simple accuracy calculations. Utilizing the Qwen3 4B model, SemanticCite achieved a peak Weighted Accuracy of 83.64% on the evaluation dataset, demonstrating the effectiveness of this metric and the model’s ability to discern nuanced differences in citation context.

SemanticCite’s use of the Qwen3 4B model results in a text generation character similarity of 90.01%. This metric quantifies the overlap in characters between the generated text and the reference text, indicating a high degree of fidelity in the system’s evidence retrieval and assessment capabilities. The achieved character similarity demonstrates the model’s ability to accurately synthesize information from retrieved evidence and generate text that closely aligns with established sources, confirming the quality of its citation support process.

SemanticCite’s evidence retrieval process integrates Dense Semantic Search and BM25 Matching to enhance both the breadth and precision of identified supporting evidence. Dense Semantic Search utilizes vector embeddings to identify conceptually similar passages, while BM25 Matching, a traditional keyword-based approach, ensures recall of relevant documents based on lexical overlap. This combined approach demonstrably improves citation support assessment, maintaining a weighted accuracy of 75.15% even when utilizing the Qwen3 1.7B model, which represents a resource-constrained deployment scenario. The sustained performance indicates the robustness of the combined search strategy in delivering accurate results despite limitations in model size and computational resources.

This four-category scheme provides a framework for assessing the alignment between sources and claims.

Beyond Verification: Impact and Future Directions

The process of peer review is often hindered by the time-consuming task of verifying cited sources, a crucial step to ensure the accuracy and reliability of published research. SemanticCite addresses this bottleneck by automating the initial stages of citation verification, leveraging natural language processing to confirm that claims made in a manuscript are, in fact, supported by the cited references. This automation doesn’t replace human judgment-rather, it flags potential discrepancies, allowing reviewers to focus their expertise on assessing the validity of arguments and the novelty of findings. By reducing the burden of tedious fact-checking, SemanticCite promises to accelerate the peer review process, potentially shortening publication timelines and increasing the overall efficiency of scholarly communication, ultimately benefiting both researchers and the public who rely on vetted scientific information.

Beyond its application in academic literature, SemanticCite presents a novel solution for evaluating the veracity of claims made within AI-generated content. As large language models become increasingly integrated into information dissemination, the potential for fabricated or unsupported statements rises significantly. This framework offers a means to automatically cross-reference assertions made by these models with supporting evidence, flagging inconsistencies or a lack of substantiation. By verifying the factual basis of AI outputs, SemanticCite addresses a critical need for trust and reliability in a landscape where distinguishing between genuine and artificial information becomes ever more challenging, ultimately bolstering the responsible deployment of artificial intelligence technologies.

SemanticCite’s developers recognized that maximizing the impact of their citation verification framework required more than just functional efficacy; it demanded broad accessibility and collaborative refinement. Consequently, the framework has been released as an open-source resource, freely available to researchers, developers, and institutions worldwide. This deliberate choice fosters reproducibility by allowing independent verification of the framework’s performance and encourages community-driven improvements, adapting SemanticCite to a wider range of research contexts and accelerating innovation in automated citation analysis. By removing barriers to entry and promoting collaborative development, the open-source release aims to establish SemanticCite as a standard tool for ensuring the integrity and reliability of scholarly communication.

Ongoing development of SemanticCite prioritizes addressing the nuances of scholarly communication beyond simple citation matching. Future iterations aim to decipher more intricate citation relationships, such as those involving rebuttals, extensions, or contrasting viewpoints, thereby offering a richer understanding of the research landscape. The framework’s expansion will also encompass a broader range of academic disciplines, moving past the current focus on computer science to incorporate the unique citation styles and complexities of fields like medicine, law, and the humanities. This interdisciplinary approach will necessitate the incorporation of domain-specific knowledge and the development of adaptable algorithms capable of accurately interpreting varied research methodologies and reporting standards, ultimately bolstering the reliability of automated citation verification across all areas of scholarly inquiry.

The pursuit of automated citation verification, as demonstrated by SemanticCite, inherently involves a controlled dismantling of established assumptions. The system doesn’t merely accept citations at face value; it actively probes for supporting evidence within full-text sources, effectively testing the boundaries of claimed knowledge. This echoes Donald Knuth’s sentiment: “Debugging is twice as hard as writing the code in the first place. Therefore, if you write the code first, debug it twice.” SemanticCite, in its rigorous analysis, ‘debugs’ the scholarly landscape, identifying discrepancies and ensuring the integrity of research by meticulously verifying each claim against its source material. The framework’s ability to achieve competitive results with smaller models further highlights the efficiency gained through this deliberate, analytical approach.

What Lies Ahead?

SemanticCite represents a necessary, if incremental, step towards treating the scholarly record with the scrutiny it deserves. The system functions, demonstrably, but the true challenge isn’t merely detecting unsupported claims-it’s understanding why such claims proliferate. This framework, like any attempt to automate reasoning, operates on the assumption that the ‘code’ governing scientific discourse is legible, that patterns of support and contradiction can be reliably extracted. The persistent presence of false positives and negatives suggests that the code remains largely obfuscated, or perhaps, that the language of science isn’t entirely rule-based.

Future iterations should prioritize not simply verification, but the identification of how a citation fails-is it a misinterpretation, a fabrication, or a genuine ambiguity in the source material? Furthermore, the reliance on full-text analysis, while powerful, introduces vulnerabilities to manipulated or AI-generated content. The system essentially trusts the text it’s analyzing, a precarious position when the very nature of truth is under assault.

Ultimately, this work underscores a fundamental truth: reality is open source-the principles governing it are there to be discovered. But reading the code requires more than computational power; it demands a deeper understanding of the biases, assumptions, and inherent uncertainties embedded within the human systems that generate knowledge. The next phase isn’t about building better fact-checkers, it’s about reverse-engineering the motivations behind the ‘bugs’ in the system.

Original article: https://arxiv.org/pdf/2511.16198.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/

Unraveling the Citation Labyrinth

Dissecting the Source: A Full-Text Approach

The Machine’s Logic: Models and Methods

Beyond Verification: Impact and Future Directions

What Lies Ahead?

See also: