ListT5: Listwise Reranking with Fusion-in-Decoder Improves Zero-shot Retrieval

Soyoung Yoon,Eunbi Choi,Jiyeon Kim,Hyeongu Yun,Yireun Kim,Seung-won Hwang
2024-06-07
Abstract:We propose ListT5, a novel reranking approach based on Fusion-in-Decoder (FiD) that handles multiple candidate passages at both train and inference time. We also introduce an efficient inference framework for listwise ranking based on m-ary tournament sort with output caching. We evaluate and compare our model on the BEIR benchmark for zero-shot retrieval task, demonstrating that ListT5 (1) outperforms the state-of-the-art RankT5 baseline with a notable +1.3 gain in the average NDCG@10 score, (2) has an efficiency comparable to pointwise ranking models and surpasses the efficiency of previous listwise ranking models, and (3) overcomes the lost-in-the-middle problem of previous listwise rerankers. Our code, model checkpoints, and the evaluation framework are fully open-sourced at \url{<a class="link-external link-https" href="https://github.com/soyoung97/ListT5" rel="external noopener nofollow">this https URL</a>}.
Information Retrieval
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve several key challenges in zero - shot retrieval tasks, specifically including: 1. **Improve zero - shot retrieval performance**: - The paper proposes a new method named LISTT5, which is based on the Fusion - in - Decoder (FiD) architecture for listwise reranking. This method performs excellently in zero - shot retrieval tasks. In particular, in the BEIR benchmark test, LISTT5 outperforms the existing state - of - the - art models (such as RankT5), achieving a significant + 1.3 improvement in the NDCG@10 metric. 2. **Overcome the limitations of existing methods**: - **Efficiency problem**: Existing listwise reranking methods (such as DuoT5) usually have a high time complexity (O(n²)). LISTT5 reduces the complexity to O(n + k log n) by introducing the m - ary tournament sort algorithm, thus improving the computational efficiency. - **Position bias problem**: Large language models (LLMs) are prone to the "lost - in - the - middle problem" when dealing with long inputs, that is, insufficient understanding of the information in the middle of the input sequence. LISTT5 effectively alleviates this problem and reduces the influence of position bias by using the FiD architecture. 3. **Provide more robust reranking ability**: - LISTT5 not only outperforms pointwise and pairwise reranking models in performance but also shows stronger robustness when facing changes in input order. Experimental results show that LISTT5 performs more stably under different initial rankings, reducing the sensitivity to the order returned by the first - stage retriever. 4. **Achieve efficient inference**: - The paper also proposes an efficient inference framework, which uses output caching and tournament tree structures to further improve the inference speed, making LISTT5 more practical in actual applications. ### Summary In general, this paper solves the problems of low computational efficiency, serious position bias, and sensitivity to initial ranking in zero - shot retrieval tasks by proposing the LISTT5 model, providing a new solution for improving zero - shot retrieval performance.