pathfinder: A Semantic Framework for Literature Review and Knowledge Discovery in Astronomy
Kartheik G. Iyer,Mikaeel Yunus,Charles O'Neill,Christine Ye,Alina Hyk,Kiera McCormick,Ioana Ciuca,John F. Wu,Alberto Accomazzi,Simone Astarita,Rishabh Chakrabarty,Jesse Cranney,Anjalie Field,Tirthankar Ghosal,Michele Ginolfi,Marc Huertas-Company,Maja Jablonska,Sandor Kruk,Huiling Liu,Gabriel Marchidan,Rohit Mistry,J.P. Naiman,J. E. G. Peek,Mugdha Polimera,Sergio J. Rodriguez,Kevin Schawinski,Sanjib Sharma,Michael J. Smith,Yuan-Sen Ting,Mike Walmsley
2024-08-03
Abstract:The exponential growth of astronomical literature poses significant challenges for researchers navigating and synthesizing general insights or even domain-specific knowledge. We present Pathfinder, a machine learning framework designed to enable literature review and knowledge discovery in astronomy, focusing on semantic searching with natural language instead of syntactic searches with keywords. Utilizing state-of-the-art large language models (LLMs) and a corpus of 350,000 peer-reviewed papers from the Astrophysics Data System (ADS), Pathfinder offers an innovative approach to scientific inquiry and literature exploration. Our framework couples advanced retrieval techniques with LLM-based synthesis to search astronomical literature by semantic context as a complement to currently existing methods that use keywords or citation graphs. It addresses complexities of jargon, named entities, and temporal aspects through time-based and citation-based weighting schemes. We demonstrate the tool's versatility through case studies, showcasing its application in various research scenarios. The system's performance is evaluated using custom benchmarks, including single-paper and multi-paper tasks. Beyond literature review, Pathfinder offers unique capabilities for reformatting answers in ways that are accessible to various audiences (e.g. in a different language or as simplified text), visualizing research landscapes, and tracking the impact of observatories and methodologies. This tool represents a significant advancement in applying AI to astronomical research, aiding researchers at all career stages in navigating modern astronomy literature.
Instrumentation and Methods for Astrophysics,Digital Libraries,Information Retrieval