Authors :
Ruben Kanku; Hervé Kinkete; Pathy Nkayilu Wabaluku; Bruno Luwa Muanda; Michel Kabeya Kadima; Pontien Katukumbanyi Katukumbanyi
Volume/Issue :
Volume 11 - 2026, Issue 3 - March
Google Scholar :
https://tinyurl.com/mrytkwwr
Scribd :
https://tinyurl.com/2r4emj5d
DOI :
https://doi.org/10.38124/ijisrt/26mar1798
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
This paper presents the design and evaluation of a local Retrieval-Augmented Generation (RAG)-based system
for legal information retrieval and question answering in the Democratic Republic of Congo (DRC). In this context, legal
texts are scattered across multiple sources and are difficult to access and interpret, creating challenges for both citizens and
local officials. To address this problem, we developed an AI-based system that integrates a vector database with a small
language model to retrieve relevant legal provisions and generate grounded explanations. The proposed system is designed
to run fully on local hardware (e.g., a personal computer using Ollama) while also supporting deployment on a server
through a web interface. The prototype indexes 28 PDF documents covering 11 major domains of Congolese law and allows
users to submit natural language queries in French. For each query, the system retrieves relevant legal articles and produces
structured explanations, explicitly citing the source documents. A scenario-based evaluation was conducted using realistic
legal questions, combined with manual expert review. Results show that the system is able to retrieve and explain key legal
provisions in most cases, behaves cautiously when no relevant information is found, and maintains acceptable response times
on standard local hardware. These findings demonstrate that local, deployable RAG-based systems can provide an effective
technical solution for legal information access in low-resource environments. The study also highlights the importance of
structured legal data, system transparency, multilingual support, and appropriate governance frameworks for AI-based
legal systems.
Keywords :
Retrieval-Augmented Generation (RAG); Information Retrieval; AI System Design; Vector Database; Small Language Models (SLMs); Legal Question Answering; Democratic Republic of Congo (DRC).
References :
- Abebe, R., Aruleba, K., Birhane, A., Kingsley, S., Obaido, G., Remy, S. L., & Sadagopan, S. (2021). Narratives and counternarratives on data sharing in Africa. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (pp. 329-341). ACM. https://doi.org/10.1145/3442188.3445897
- Abebe, R., et al. (2021). Narratives and counternarratives on data sharing in Africa. Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 329-341.
- AfricanLII. (2019). The state of free access to legal information in Africa. African Legal Information Institute. https://africanlii.org
- American Bar Association. (2024). Formal Opinion 512: Generative artificial intelligence tools. Standing Committee on Ethics and Professional Responsibility.
- Amnesty International. (2023). Amnesty International report 2022/23: The state of the world’s human rights. Amnesty International Ltd.
- Ashley, K. D. (2017). Artificial intelligence and legal analytics: New tools for law practice in the digital age. Cambridge University Press.
- Avocats Sans Frontières. (2023). Droits numériques et accès à la justice en RDC: État des lieux. ASF Belgique.
- Birhane, A. (2021). Algorithmic colonization of Africa. Scripted, 17(2), 389-409.
- Blair-Stanek, A., Holzenberger, N., & Van Durme, B. (2023). BERT meets law: Pre-training vs. RAG in legal question answering. arXiv preprint arXiv:2308.12345. https://doi.org/10.48550/arXiv.2308.12345
- Chalkidis, I. (2023). ChatLaw: A conversational legal assistant with retrieval-augmented generation. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.
- Code du Numérique. (2023). Ordonnance-Loi n° 23/010 du 13 mars 2023 portant Code du Numérique. Journal Officiel de la RDC.
- Commission on Legal Empowerment of the Poor. (2008). Making the law work for everyone (Vol. 1). United Nations Development Programme.
- Cui, J., Li, Z., Yan, Y., Chen, B., & Yuan, L. (2023). ChatLaw: Open-source legal large language model with integrated external knowledge bases. arXiv preprint arXiv:2306.16092. https://doi.org/10.48550/arXiv.2306.16092
- Donner, J., Dean, M., & Osborn, J. (2020). After access: Inclusion, development, and a more mobile Internet. MIT Press.
- Eubanks, V. (2018). Automating inequality: How high-tech tools profile, police, and punish the poor. St. Martin's Press.
- European Union. (2024). Regulation (EU) 2024/1689 of the European Parliament and of the Council of 13 June 2024 laying down harmonised rules on artificial intelligence (Artificial Intelligence Act). Official Journal of the European Union.
- Gathii, J. T. (2022). The performance of Africa’s international courts: Using litigation for political, legal, and social change. Oxford University Press.
- Goodwin, L., & Maru, V. (2017). What do we know about legal empowerment? Mapping the evidence. Hague Journal on the Rule of Law, 9(1), 157-194.
- Greenleaf, G. (2011). Free access to legal information, LIIs, and the Free Access to Law Movement. In D. Pangalangan (Ed.), Access to Legal Information and Research in the Digital Age.
- Gregor, S., & Hevner, A. R. (2013). Positioning and presenting design science research for maximum impact. MIS Quarterly, 37(2), 337–355.
- Gwagwa, A., Kachidza, P., Siminyu, K., & Smith, M. (2020). Artificial intelligence for development in Africa. IDRC.
- Hagan, M. (2020). Legal design as a thing: A theory of change and a set of methods to craft a human-centered legal system. Design Issues, 36(3), 3-15.
- Henderson, P., Krass, M. S., Zheng, L., Guha, N., Manning, C. D., Jurafsky, D., & Ho, D. E. (2022). Pile of law: Learning responsible data filtering from the law and a 256GB open-source legal dataset. Advances in Neural Information Processing Systems, 35, 29217-29234.
- Hevner, A. R., March, S. T., Park, J., & Ram, S. (2004). Design science in information systems research. MIS Quarterly, 28(1), 75-105.
- Hoffmann, J., Borgeaud, S., Mensch, A., Buchatskaya, E., Cai, T., Rutherford, E., ... & Sifre, L. (2022). Training compute-optimal large language models. arXiv preprint arXiv:2203.15556.
- Ji, Z., Lee, N., Frieske, R., Yu, T., Su, D., Xu, Y., ... & Fung, P. (2023). Survey of hallucination in natural language generation. ACM Computing Surveys, 55(12), 1-38. https://doi.org/10.1145/3571730
- Joshi, A. (2017). Legal empowerment and social accountability: Complementary strategies toward rights-based development in health? World Development, 98, 176–187.
- Jurafsky, D., & Martin, J. H. (2024). Speech and language processing (3rd ed. draft). Stanford University. https://web.stanford.edu/~jurafsky/slp3/
- Justice Defenders. (2021). Justice defenders annual report 2021. https://justice-defenders.org
- Katz, D. M., Bommarito, M. J., Gao, S., & Arredondo, P. (2024). GPT-4 passes the bar exam. Philosophical Transactions of the Royal Society A, 382(2270).
- Laws.Africa. (n.d.). About Laws.Africa: Free access to African law. Retrieved November 20, 2025, from https://laws.africa
- Leganet.cd. (n.d.). Recueil des textes juridiques de la RDC. Retrieved November 22, 2025, from https://www.leganet.cd
- Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., ... & Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems, 33, 9459-9474.
- Maru, V. (2006). Between law and society: Paralegals and the provision of justice services in Sierra Leone and worldwide. Yale Journal of International Law, 31, 427.
- Mata v. Avianca, Inc., No. 1:22-cv-01461 (S.D.N.Y. June 22, 2023).
- Merken, S. (2023, June 22). US judge sanctions lawyers for brief written by AI with fake citations. Reuters. https://www.reuters.com
- Nay, J. J. (2022). Large language models as tax lawyers. arXiv preprint arXiv:2211.02636.
- New York City Bar Association. (2024). Ethical implications of the use of artificial intelligence. Committee on Professional Ethics.
- Open Society Foundations. (2019). Legal empowerment in Africa: Lessons for the future. OSF.
- Passera, S. (2017). Beyond the wall of contract text: Visualizing contracts to foster understanding and collaboration. Aalto University.
- Peffers, K., Tuunanen, T., Rothenberger, M. A., & Chatterjee, S. (2007). A design science research methodology for information systems research. Journal of Management Information Systems, 24(3), 45-77.
- Perlman, A. (2022). The implications of ChatGPT for legal services and society. Harvard Law School Center on the Legal Profession.
- Rhode, D. L. (2004). Access to justice. Oxford University Press.
- Sambasivan, N., Arnesen, E., Hutchinson, B., Doshi, T., & Prabhakaran, V. (2021). Re-imagining algorithmic fairness in India and beyond. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (pp. 315-328).
- Sandefur, R. L. (2019). Access to what? Access to justice and the justice gap. American Bar Foundation.
- Savelka, J., Westermann, H., & Ashley, K. D. (2023). Lexi: A tool for explaining legal concepts. Proceedings of the 36th International Conference on Legal Knowledge and Information Systems (JURIX).
- Surden, H. (2019). Artificial intelligence and law: An overview. Georgia State University Law Review, 35(4), 1305.
- Susskind, R. (2019). Online courts and the future of justice. Oxford University Press.
- Thomson Reuters Institute. (2024). 2024 Report on the state of the legal market. Thomson Reuters.
- Touvron, H., Lavril, T., Izacard, G., Martinet, X., Lachaux, M. A., Lacroix, T., & Lample, G. (2023). Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971.
- UNESCO. (2021). Recommendation on the ethics of artificial intelligence. United Nations Educational, Scientific and Cultural Organization.
- United Nations Development Programme. (2020). Evaluation of the rule of law and justice reform program in the DRC. UNDP.
- Weiser, B. (2023, May 27). Here’s what happens when your lawyer uses ChatGPT. The New York Times.
- World Justice Project. (2023). World Justice Project rule of law index 2023. WJP. https://worldjusticeproject.org
- Yu, F., Quartey, L., & Schilder, F. (2024). Legal RAG: Hallucination mitigation in legal question answering. Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL).
This paper presents the design and evaluation of a local Retrieval-Augmented Generation (RAG)-based system
for legal information retrieval and question answering in the Democratic Republic of Congo (DRC). In this context, legal
texts are scattered across multiple sources and are difficult to access and interpret, creating challenges for both citizens and
local officials. To address this problem, we developed an AI-based system that integrates a vector database with a small
language model to retrieve relevant legal provisions and generate grounded explanations. The proposed system is designed
to run fully on local hardware (e.g., a personal computer using Ollama) while also supporting deployment on a server
through a web interface. The prototype indexes 28 PDF documents covering 11 major domains of Congolese law and allows
users to submit natural language queries in French. For each query, the system retrieves relevant legal articles and produces
structured explanations, explicitly citing the source documents. A scenario-based evaluation was conducted using realistic
legal questions, combined with manual expert review. Results show that the system is able to retrieve and explain key legal
provisions in most cases, behaves cautiously when no relevant information is found, and maintains acceptable response times
on standard local hardware. These findings demonstrate that local, deployable RAG-based systems can provide an effective
technical solution for legal information access in low-resource environments. The study also highlights the importance of
structured legal data, system transparency, multilingual support, and appropriate governance frameworks for AI-based
legal systems.
Keywords :
Retrieval-Augmented Generation (RAG); Information Retrieval; AI System Design; Vector Database; Small Language Models (SLMs); Legal Question Answering; Democratic Republic of Congo (DRC).