Abstract:As the heart of a search engine, the ranking system plays a crucial role in satisfying users' information demands. More recently, neural rankers fine-tuned from pre-trained language models (PLMs) establish state-of-the-art ranking effectiveness. However, it is nontrivial to directly apply these PLM-based rankers to the large-scale web search system due to the following challenging issues: (1) the prohibitively expensive computations of massive neural PLMs, especially for long texts in the web document, prohibit their deployments in an online ranking system that demands extremely low latency; (2) the discrepancy between existing ranking-agnostic pre-training objectives and the ad-hoc retrieval scenarios that demand comprehensive relevance modeling is another main barrier for improving the online ranking system; (3) a real-world search engine typically involves a committee of ranking components, and thus the compatibility of the individually fine-tuned ranking model is critical for a cooperative ranking system. In this work, we contribute a series of successfully applied techniques in tackling these exposed issues when deploying the state-of-the-art Chinese pre-trained language model, i.e., ERNIE, in the online search engine system. We first articulate a novel practice to cost-efficiently summarize the web document and contextualize the resultant summary content with the query using a cheap yet powerful Pyramid-ERNIE architecture. Then we endow an innovative paradigm to finely exploit the large-scale noisy and biased post-click behavioral data for relevance-oriented pre-training. We also propose a human-anchored fine-tuning strategy tailored for the online ranking system, aiming to stabilize the ranking signals across various online components. Extensive offline and online experimental results show that the proposed techniques significantly boost the search engine's performance.

Siamese BERT-based Model for Web Search Relevance Ranking Evaluated on a New Czech Dataset

A Comparative Study of Text Retrieval Models on DaReCzech

CWRCzech: 100M Query-Document Czech Click Dataset and Its Application to Web Relevance Ranking

Some Like It Small: Czech Semantic Embedding Models for Industry Applications

Pretrained Language Model based Web Search Ranking: From Relevance to Satisfaction

Pre-trained Language Model based Ranking in Baidu Search

Long short-term search session-based document re-ranking model

Best Practices for Distilling Large Language Models into BERT for Web Search Ranking

Pre-trained Language Model for Web-scale Retrieval in Baidu Search

Composite Re-Ranking for Efficient Document Search with BERT

Cross-lingual Information Retrieval with BERT

A Dataset and Strong Baselines for Classification of Czech News Texts

Interpretable & Time-Budget-Constrained Contextualization for Re-Ranking

BioASQ 11B: Integrating Domain Specific Vocabulary to BERT-based Model for Biomedical Document Ranking

DeText: A Deep Text Ranking Framework with BERT

SPM: Structured Pretraining and Matching Architectures for Relevance Modeling in Meituan Search

BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online E-Commerce Search

Search Result Reranking with Visual and Structure Information Sources

Relevance Estimation with Multiple Information Sources on Search Engine Result Pages.

Open-Source Web Service with Morphological Dictionary-Supplemented Deep Learning for Morphosyntactic Analysis of Czech

Deploying a BERT-based Query-Title Relevance Classifier in a Production System: a View from the Trenches