RAG-Retrieval is an open-source framework for building and training retrieval systems used in retrieval-augmented generation pipelines. Retrieval-augmented generation combines large language models with external knowledge retrieval to improve factual accuracy and domain-specific reasoning. This repository provides end-to-end infrastructure for training retrieval models, performing inference, and distilling embedding models for improved performance. It includes implementations of modern embedding architectures designed to map documents and queries into vector spaces for efficient similarity search. The framework also supports reranking models that refine retrieved results using large language models or lightweight transformer architectures. Additional training techniques such as preference-based supervised fine-tuning and embedding distillation are included to improve retrieval quality.

Features

  • End-to-end training pipeline for retrieval-augmented generation systems
  • Embedding model training for vector-based document retrieval
  • Support for reranker models that improve retrieval relevance
  • Preference-based supervised fine-tuning for retrieval optimization
  • Embedding model distillation for efficient deployment
  • Integration with large language models for RAG pipelines

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow RAG-Retrieval

RAG-Retrieval Web Site

Other Useful Business Software
Collect! is a highly configurable debt collection software Icon
Collect! is a highly configurable debt collection software

Everything that matters to debt collection, all in one solution.

The flexible & scalable debt collection software built to automate your workflow. From startup to enterprise, we have the solution for you.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of RAG-Retrieval!

Additional Project Details

Programming Language

Python

Related Categories

Python Large Language Models (LLM)

Registered

2026-03-09