Legalmind a multiagent legal reasoning framework leveraging finetuning of large language models and retrievalaugmented generation to reduce hallucinations| International Journal of Innovative Science and Research Technology

LegalMind: A Multi-Agent Legal Reasoning Framework Leveraging Fine-Tuning of Large Language Models and Retrieval-Augmented Generation to Reduce Hallucinations

Authors : Soham Sachin Shelar; Dr. Manisha Bharati

Volume/Issue : Volume 11 - 2026, Issue 5 - May

Google Scholar : https://tinyurl.com/55f6drj9

Scribd : https://tinyurl.com/yfrhvcss

DOI : https://doi.org/10.38124/ijisrt/26May1714

PlumX Metrics

Semantic Scholar

ResearchGate

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Abstract : The Indian legal system generates an enormous volume of judgments every year across thousands of courts, yet access to structured legal research remains limited for a large portion of the population. Existing large language models (LLMs), while capable of impressive natural language generation, suffer from hallucination—fabricating statutory provisions, inventing case citations, and producing reasoning that lacks grounding in actual evidence. These failures are especially dangerous in the legal domain, where an incorrect citation can invalidate an entire argument. This paper presents LegalMind AI, a multi-agent legal reasoning system designed specifically for Indian law.

Keywords : Multi-Agent AI; Legal Reasoning; Retrieval-Augmented Generation; QLoRA Fine-Tuning; Indian Law; Natural Language Inference; Hallucination Detection; Mistral-7B; FAISS; IL-TUR Dataset; DeBERTa; Streamlit.

References :

N. Aletras et al., “Predicting judicial decisions of the European Court of Human Rights,” PeerJ Computer Science, vol. 2, p. e93, 2016.
K. Malik et al., “ILDC for CJPE,” in Proc. ACL, 2021, pp. 4046–4062.
A. Jain et al., “Predicting and Explaining Indian Court Decisions,” in Proc. EMNLP Findings, 2021.
Exploration-Lab, “IL-TUR Benchmark,” Hugging-
Face Datasets: Exploration-Lab/IL-TUR, 2022.
J. Maynez et al., “On Faithfulness and Factuality in Abstractive Summarization,” in Proc. ACL, 2020, pp. 1906–1919.
Z. Ji et al., “Survey of Hallucination in NLG,” ACM Computing Surveys, vol. 55, no. 12, pp. 1–38, 2023.
Y. Bang et al., “A Multitask Evaluation of ChatGPT,” arXiv:2302.04023, 2023.
P. Lewis et al., “Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks,” in Advances in NeurIPS, 2020.
V. Karpukhin et al., “Dense Passage Retrieval,” in Proc. EMNLP, 2020.
N. Reimers and I. Gurevych, “Sentence-BERT,” in Proc. EMNLP, 2019.
R. Nogueira and K. Cho, “Passage Re-ranking with BERT,” arXiv:1901.04085, 2019.
J. Johnson, M. Douze, and H. Jégou, “Billion-Scale Similarity Search with GPUs,” IEEE Trans. Big Data, vol. 7, no. 3, pp. 535–547, 2021.
I. Chalkidis et al., “LEGAL-BERT,” in Findings of EMNLP, 2020.
J. Wei et al., “Finetuned Language Models Are ZeroShot Learners,” in Proc. ICLR, 2022.
E. Hu et al., “LoRA,” in Proc. ICLR, 2022.
T. Dettmers et al., “QLoRA,” in Advances in NeurIPS, 2023.
A. Jiang et al., “Mistral 7B,” arXiv:2310.06825, 2023. [18] P. He et al., “DeBERTa,” in Proc. ICLR, 2021.
S. Bowman et al., “A Large Annotated Corpus for NLI,” in Proc. EMNLP, 2015, pp. 632–642.
A. Williams, N. Nangia, and S. Bowman, “A BroadCoverage Challenge Corpus for Sentence Understanding,” in Proc. NAACL, 2018.
J. Park et al., “Generative Agents,” in Proc. UIST, 2023.
Q. Wu et al., “AutoGen,” arXiv:2308.08155, 2023.
C. Chan et al., “ChatEval,” arXiv:2308.07201, 2023.
A. Vaswani et al., “Attention Is All You Need,” in Advances in NeurIPS, 2017.
J. Devlin et al., “BERT,” in Proc. NAACL, 2019.
H. Touvron et al., “Llama 2,” arXiv:2307.09288, 2023.
Z. Guo et al., “A Survey on Automated FactChecking,” TACL, vol. 10, pp. 178–206, 2022.
J. Zhou et al., “LawGPT,” arXiv:2306.03061, 2023.
S. Yue et al., “Disc-LawLLM,” arXiv:2309.11325, 2023.
T. Wolf et al., “Transformers,” in Proc. EMNLP System Demonstrations, 2020, pp. 38–45.

The Indian legal system generates an enormous volume of judgments every year across thousands of courts, yet access to structured legal research remains limited for a large portion of the population. Existing large language models (LLMs), while capable of impressive natural language generation, suffer from hallucination—fabricating statutory provisions, inventing case citations, and producing reasoning that lacks grounding in actual evidence. These failures are especially dangerous in the legal domain, where an incorrect citation can invalidate an entire argument. This paper presents LegalMind AI, a multi-agent legal reasoning system designed specifically for Indian law.

Paper Submission Last Date
31 - July - 2026

SUBMIT YOUR PAPER CALL FOR PAPERS

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.