Abstract:We propose ListT5, a novel reranking approach based on Fusion-in-Decoder (FiD) that handles multiple candidate passages at both train and inference time. We also introduce an efficient inference framework for listwise ranking based on m-ary tournament sort with output caching. We evaluate and compare our model on the BEIR benchmark for zero-shot retrieval task, demonstrating that ListT5 (1) outperforms the state-of-the-art RankT5 baseline with a notable +1.3 gain in the average NDCG@10 score, (2) has an efficiency comparable to pointwise ranking models and surpasses the efficiency of previous listwise ranking models, and (3) overcomes the lost-in-the-middle problem of previous listwise rerankers. Our code, model checkpoints, and the evaluation framework are fully open-sourced at \url{<a class="link-external link-https" href="https://github.com/soyoung97/ListT5" rel="external noopener nofollow">this https URL</a>}.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to solve several key challenges in zero - shot retrieval tasks, specifically including: 1. **Improve zero - shot retrieval performance**: - The paper proposes a new method named LISTT5, which is based on the Fusion - in - Decoder (FiD) architecture for listwise reranking. This method performs excellently in zero - shot retrieval tasks. In particular, in the BEIR benchmark test, LISTT5 outperforms the existing state - of - the - art models (such as RankT5), achieving a significant + 1.3 improvement in the NDCG@10 metric. 2. **Overcome the limitations of existing methods**: - **Efficiency problem**: Existing listwise reranking methods (such as DuoT5) usually have a high time complexity (O(n²)). LISTT5 reduces the complexity to O(n + k log n) by introducing the m - ary tournament sort algorithm, thus improving the computational efficiency. - **Position bias problem**: Large language models (LLMs) are prone to the "lost - in - the - middle problem" when dealing with long inputs, that is, insufficient understanding of the information in the middle of the input sequence. LISTT5 effectively alleviates this problem and reduces the influence of position bias by using the FiD architecture. 3. **Provide more robust reranking ability**: - LISTT5 not only outperforms pointwise and pairwise reranking models in performance but also shows stronger robustness when facing changes in input order. Experimental results show that LISTT5 performs more stably under different initial rankings, reducing the sensitivity to the order returned by the first - stage retriever. 4. **Achieve efficient inference**: - The paper also proposes an efficient inference framework, which uses output caching and tournament tree structures to further improve the inference speed, making LISTT5 more practical in actual applications. ### Summary In general, this paper solves the problems of low computational efficiency, serious position bias, and sensitivity to initial ranking in zero - shot retrieval tasks by proposing the LISTT5 model, providing a new solution for improving zero - shot retrieval performance.

ListT5: Listwise Reranking with Fusion-in-Decoder Improves Zero-shot Retrieval

FIRST: Faster Improved Listwise Reranking with Single Token Decoding

Fusion-in-T5: Unifying Document Ranking Signals for Improved Information Retrieval

Contribution-Based Multi-Stream Feature Distance Fusion Method With <inline-formula> <tex-math notation="LaTeX">${k}$ </tex-math></inline-formula>-Distribution Re-Ranking for Person Re-Identification

Analyzing the Effectiveness of Listwise Reranking with Positional Invariance on Temporal Generalizability

An Early FIRST Reproduction and Improvements to Single-Token Decoding for Fast Listwise Reranking

RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze!

Ranked List Loss for Deep Metric Learning

Zero-Shot Listwise Document Reranking with a Large Language Model

InRanker: Distilled Rankers for Zero-shot Information Retrieval

Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers

Efficient course recommendations with T5-based ranking and summarization

Integrating listwise ranking into pairwise-based image-text retrieval

Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models

Efficient Title Reranker for Fast and Improved Knowledge-Intense NLP

Few-shot Prompting for Pairwise Ranking: An Effective Non-Parametric Retrieval Model

An Optimization Framework for Merging Multiple Result Lists

LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking

Top-N-Rank: A Scalable List-wise Ranking Method for Recommender Systems

Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers

RankDNN: Learning to Rank for Few-shot Learning