Re-Ranker Module

ColBERT Framework

Source:https://www.semanticscholar.org/paper/ColBERT%3A-Efficient-and-Effective-Passage-Search-via-Khattab-Zaharia/60b8ad6177230ad5402af409a6edb5af441baeb4

This directory consists of code implementations for retrieving data (from Solr, Watson Discovery and Elasticsearch) and applying the Re-Ranking algorithm to create a more efficient search result. This example demonstrates ColBert based DrDecr reranker model here but you can also replace it with any other re-ranking model such as Perplexity ranking model.

Read our blog for more detailed explanation - https://medium.com/towards-generative-ai/improving-rag-retrieval-augmented-generation-answer-quality-with-re-ranker-55a19931325

Directory Content:

Elasticsearch

Elastic retriever has various models to improve the relevancy of the returned document.
1. es_retriever_reranker.ipynb: This Jupyter notebook explains the process of retrieving data from the ES index and applying the Colbert based DrDecr re-Ranker algorithm using a Deep learning model.

Solr

Solr retrieval works best for very long documents, e.g. books with hundreds of pages.
1. solr_retriever_reranker.ipynb: This Jupyter notebook explains the process of retrieving data from the Solr database and applying the Colbert Re-Ranker algorithm using a Deep learning model.
2. solr_retriever_reranker.py: This Python script programmatically retrieves data from the Solr database and applies the Colbert Re-Ranker algorithm using a Deep learning model.

Watson Discovery

Re-ranker.ipynb: This Jupyter notebook outlines the data retrieval process from Watson Discovery and the subsequent application of the Colbert Re-Ranker algorithm.

Reranker.py:

A reusable script to rank and get document that is the closest match to given query.

These scripts and notebooks are helpful guidebooks. They demonstrate how to pull data from different places and make your search results better by using something called the re-ranking algorithm. This algorithm considers how closely a query matches and the data quality, to ensure the best possible results.

Getting Started

Clone this repository.
Modify the config.yaml to update the reranker model if required
Run the reranker.py to see the Reranker module in action.

Reference

DrDecr Re-ranking model

superknowa

Re-Ranker Module

Directory Content:

Elasticsearch

Solr

Watson Discovery

Reranker.py:

Getting Started

Reference

Использование cookies

superknowa

KSKunal SawarkarUpdate README.md8 месяцев назадce87b1

Re-Ranker Module

Directory Content:

Elasticsearch

Solr

Watson Discovery

Reranker.py:

Getting Started

Reference

Использование cookies