Skip to main content Skip to navigation

Towards Efficient Educational Chatbots: Benchmarking RAG Frameworks

Project Overview

The document examines the utilization of a generative AI framework in educational chatbot systems designed to assist students preparing for the Graduate Aptitude Test in Engineering (GATE). It emphasizes the integration of large language models (LLMs) and Retrieval-Augmented Generation (RAG) techniques to enhance the chatbot's ability to provide accurate answers and contextual explanations. Through comprehensive evaluation using various performance metrics, the framework demonstrated notable improvements in response quality and retrieval accuracy, showcasing its effectiveness in educational settings. Additionally, the document addresses challenges encountered during implementation, including data processing issues and inherent model limitations, highlighting the potential of generative AI to transform learning experiences by offering tailored support and fostering better understanding among students. The findings suggest that such AI-driven tools can significantly enhance the educational landscape, providing personalized assistance and contributing to more effective exam preparation strategies.

Key Applications

GATE question-answering framework using LLMs and RAG

Context: This framework is designed for students preparing for the GATE exam, providing explanations for solutions and assisting in study preparation.

Implementation: The framework integrates LLMs and embedding models to retrieve and explain GATE solutions. It utilizes a two-stage pipeline for retrieving relevant data and generating context-aware responses.

Outcomes: The framework demonstrated improved retrieval accuracy and response quality, enhancing students' learning efficiency and reducing the time needed for information access.

Challenges: Limitations included data processing complexities, model limitations in retaining infrequent information, and the need for ongoing updates to the models.

Implementation Barriers

Technical Barrier

Challenges in data processing, particularly in extracting complex mathematical content from PDFs.

Proposed Solutions: Explored various data extraction tools and techniques, ultimately identifying Mathpix as a reliable solution for intricate equations.

Model Limitations

LLMs can only provide information based on the training data available at the time and may struggle with infrequent data.

Proposed Solutions: Utilizing knowledge grounding through RAG to improve accuracy and reduce hallucinations.

Project Team

Umar Ali Khan

Researcher

Ekram Khan

Researcher

Fiza Khan

Researcher

Athar Ali Moinuddin

Researcher

Contact Information

For information about the paper, please contact the authors.

Authors: Umar Ali Khan, Ekram Khan, Fiza Khan, Athar Ali Moinuddin

Source Publication: View Original PaperLink opens in a new window

Project Contact: Dr. Jianhua Yang

LLM Model Version: gpt-4o-mini-2024-07-18

Analysis Provider: Openai

Let us know you agree to cookies