Patapasco: A Python Framework for Cross-Language Information Retrieval Experiments

Cash Costello,Eugene Yang,Dawn Lawrie,James Mayfield
DOI: https://doi.org/10.48550/arXiv.2201.09996
2022-01-25
Abstract:While there are high-quality software frameworks for information retrieval experimentation, they do not explicitly support cross-language information retrieval (CLIR). To fill this gap, we have created Patapsco, a Python CLIR framework. This framework specifically addresses the complexity that comes with running experiments in multiple languages. Patapsco is designed to be extensible to many language pairs, to be scalable to large document collections, and to support reproducible experiments driven by a configuration file. We include Patapsco results on standard CLIR collections using multiple settings.
Information Retrieval
What problem does this paper attempt to address?