The Computing Research Repository: Promoting the Rapid Dissemination and Archiving of Computer Science Research

Joseph Y. Halpern,Carl Lagoze
DOI: https://doi.org/10.48550/arXiv.cs/9812020
1998-12-22
Abstract:We describe the Computing Research Repository (CoRR), a new electronic archive for rapid dissemination and archiving of computer science research results. CoRR was initiated in September 1998 through the cooperation of ACM, LANL (Los Alamos National Laboratory) e-Print archive, and NCSTRL (Networked Computer Science Technical Research Library. Through its implementation of the Dienst protocol, CoRR combines the open and extensible architecture of NCSTRL with the reliable access and well-established management practices of the LANL XXX e-Print repository. This architecture will allow integration with other e-Print archives and provides a foundation for a future broad-based scholarly digital library. We describe the decisions that were made in creating CoRR, the architecture of the CoRR/NCSTRL interoperation, and issues that have arisen during the operation of CoRR.
Digital Libraries
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the rapid dissemination and reliable archiving of computer science research results. Specifically, the authors are concerned with how to combine the advantages of centralized and distributed electronic publishing systems to create a platform that can ensure the rapid dissemination of research results as well as provide high - quality cataloging, indexing, and long - term archiving. ### Main problems 1. **The contradiction between rapid dissemination and high - quality archiving**: - Before the advent of the Internet, researchers carried out the rapid dissemination of research results through printed materials such as technical reports and conference papers. With the development of the Internet, FTP and websites of individuals and organizations have made research results instantly accessible globally. However, these methods often lack the reliability and high - quality cataloging, indexing, and archiving provided in the journal publishing process. 2. **The deficiencies of existing systems**: - Existing systems such as electronic journals, bibliographic servers, intelligent crawlers, federated architectures, and independent e - Print repositories have achieved success in some aspects, but they have limitations in solving problems such as long - term archiving, format conversion, reliability, and scalability. 3. **The trade - off between centralized and distributed systems**: - Centralized systems (such as the XXX e - Print archive at Los Alamos National Laboratory) usually have high reliability and good management practices, but lack flexibility and interoperability; while distributed systems (such as NCSTRL) are more flexible, but have problems in management and reliability. ### The goals of the paper To address the above problems, the authors propose the idea of establishing the Computing Research Repository (CoRR). CoRR aims to combine the stability of the XXX e - Print archive and the open interface of NCSTRL to provide a platform that can both rapidly disseminate research results and ensure high - quality cataloging, indexing, and long - term archiving. ### Key points of the solution - **Integrating the advantages of XXX and NCSTRL**: By implementing the Dienst protocol, CoRR combines the stability of XXX and the open interface of NCSTRL to achieve unified cross - library search and access. - **Supporting multiple submission formats**: It accepts multiple formats such as TeX/LaTeX/AMSTeX, HTML + GIF, PDF, and Postscript, ensuring the long - term availability and readability of documents. - **Copyright and long - term archiving**: CoRR does not force authors to transfer copyright and promises to permanently preserve submitted papers, allowing authors to update versions without deleting old versions. Through these measures, CoRR hopes to become a reliable and scalable platform for publishing and archiving research results in the field of computer science.