Density2R

Official implementation of Density2R, an efficient zero-shot document re-ranking method for information retrieval, RAG, and LLM-based reranking, published at IEEE BigData 2025.

Density2R is a lightweight zero-shot document re-ranking method that uses embedding density over LLM parametric knowledge to reduce token cost and latency for RAG and information retrieval pipelines.

Abstract

The advent of transformers has substantially advanced the task of document re-ranking within the field of Information Retrieval (IR). While pre-trained transformers have enabled the development of numerous supervised re-rankers, the unsupervised domain has largely been dominated by Large Language Model (LLM)-based approaches. However, despite their effectiveness, these models suffer from several critical limitations. Supervised re-rankers rely on large datasets of relevance judgments, an expensive and scarce resource, which further leads to poor generalizability. At the same time, unsupervised, zero-shot LLM-based methods suffer from limited context length, high token cost, and significant inference latency. To address these, we propose Density2R, a lightweight zero-shot re-ranker that leverages parametric knowledge from LLM to rank documents via density estimators. Unlike contemporary unsupervised methods, it only requires a handful of tokens to represent the query-dependent parametric information. The proposed approach further incorporates a selective re-ranking strategy that operates only on relevant candidates from the initial retriever, thereby reducing latency. Additionally, we extend the Density2R into a multi-stage pipeline, enabling it to boost the efficiency of re-rankers proposed in contemporary literature. Extensive evaluation on the TREC DL19, DL20, and the BEIR benchmark demonstrates the effectiveness of the proposed approach while maintaining a low-latency footprint.

Keywords

Document re-ranking, passage re-ranking, reranker, efficient reranking, zero-shot reranking, LLM reranking, RAG reranking, information retrieval, BEIR, TREC DL, density estimation, parametric knowledge.

Paper

IEEE Xplore: Density2R: Efficient Document Re-Ranking via Embedding Density Over Parametric Knowledge of Large Language Models
DOI: 10.1109/BigData66926.2025.11401587
Venue: IEEE BigData 2025

⭐ Citation

If you find this work helpful in your research, please consider citing our work.

@INPROCEEDINGS {11401587,
author = { Zaoad, Md Shahir and Zawad, Niamat and Khan, Latifur and Ranade, Priyanka and Krogman, Richard },
booktitle = { 2025 IEEE International Conference on Big Data (BigData) },
title = {{ Density2R: Efficient Document Re-Ranking via Embedding Density Over Parametric Knowledge of Large Language Models }},
year = {2025},
volume = {},
ISSN = {},
pages = {3142-3151},
abstract = { The advent of transformers has substantially advanced the task of document re-ranking within the field of Information Retrieval (IR). While pre-trained transformers have enabled the development of numerous supervised re-rankers, the unsupervised domain has largely been dominated by Large Language Model (LLM)-based approaches. However, despite their effectiveness, these models suffer from several critical limitations. Supervised re-rankers rely on large datasets of relevance judgments, an expensive and scarce resource, which further leads to poor generalizability. At the same time, unsupervised, zero-shot LLM-based methods suffer from limited context length, high token cost, and significant inference latency. To address these, we propose Density2R, a lightweight zero-shot re-ranker that leverages parametric knowledge from LLM to rank documents via density estimators. Unlike contemporary unsupervised methods, it only requires a handful of tokens to represent the query-dependent parametric information. The proposed approach further incorporates a selective re-ranking strategy that operates only on relevant candidates from the initial retriever, thereby reducing latency. Additionally, we extend the Density2R into a multi-stage pipeline, enabling it to boost the efficiency of re-rankers proposed in contemporary literature. Extensive evaluation on the TREC DL19, DL20, and the BEIR benchmark demonstrates the effectiveness of the proposed approach while maintaining a low-latency footprint. },
keywords = {Uncertainty;Costs;Filtering;Large language models;Computational modeling;Pipelines;Estimation;Transformers;Information retrieval;Low latency communication},
doi = {10.1109/BigData66926.2025.11401587},
url = {https://doi.ieeecomputersociety.org/10.1109/BigData66926.2025.11401587},
publisher = {IEEE Computer Society},
address = {Los Alamitos, CA, USA},
month =Dec}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Results		Results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Two_stage_MonoBERT.py		Two_stage_MonoBERT.py
Two_stage_MonoT5.py		Two_stage_MonoT5.py
d2R_KDE.py		d2R_KDE.py
d2R_KDE_Llama.py		d2R_KDE_Llama.py
d2R_RitM.py		d2R_RitM.py
d2R_evaluation.py		d2R_evaluation.py
env.yml		env.yml
gar_evaluation.py		gar_evaluation.py
index_builder.py		index_builder.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Density2R

Abstract

Keywords

Paper

⭐ Citation

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Density2R

Abstract

Keywords

Paper

⭐ Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages