The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces
Kyle Lo,Joseph Chee Chang,Andrew Head,Jonathan Bragg,Amy X. Zhang,Cassidy Trier,Chloe Anastasiades,Tal August,Russell Authur,Danielle Bragg,Erin Bransom,Isabel Cachola,Stefan Candra,Yoganand Chandrasekhar,Yen-Sung Chen,Evie Yu-Yen Cheng,Yvonne Chou,Doug Downey,Rob Evans,Raymond Fok,Fangzhou Hu,Regan Huff,Dongyeop Kang,Tae Soo Kim,Rodney Kinney,Aniket Kittur,Hyeonsu Kang,Egor Klevak,Bailey Kuehl,Michael Langan,Matt Latzke,Jaron Lochner,Kelsey MacMillan,Eric Marsh,Tyler Murray,Aakanksha Naik,Ngoc-Uyen Nguyen,Srishti Palani,Soya Park,Caroline Paulic,Napol Rachatasumrit,Smita Rao,Paul Sayre,Zejiang Shen,Pao Siangliulue,Luca Soldaini,Huy Tran,Madeleine van Zuylen,Lucy Lu Wang,Christopher Wilhelm,Caroline Wu,Jiangjiang Yang,Angele Zamarron,Marti A. Hearst,Daniel S. Weld
2023-04-23
Abstract:Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the need for new technology to support the reading process grows. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has changed little in decades. The PDF format for sharing research papers is widely used due to its portability, but it has significant downsides including: static content, poor accessibility for low-vision readers, and difficulty reading on mobile devices. This paper explores the question "Can recent advances in AI and HCI power intelligent, interactive, and accessible reading interfaces -- even for legacy PDFs?" We describe the Semantic Reader Project, a collaborative effort across multiple institutions to explore automatic creation of dynamic reading interfaces for research papers. Through this project, we've developed ten research prototype interfaces and conducted usability studies with more than 300 participants and real-world users showing improved reading experiences for scholars. We've also released a production reading interface for research papers that will incorporate the best features as they mature. We structure this paper around challenges scholars and the public face when reading research papers -- Discovery, Efficiency, Comprehension, Synthesis, and Accessibility -- and present an overview of our progress and remaining open challenges.
Human-Computer Interaction,Artificial Intelligence,Computation and Language