Cloud-Based Big Data Management and Analytics for Scholarly Resources: Current Trends, Challenges and Scope for Future Research

Samiya Khan,Kashish A. Shakil,Mansaf Alam
DOI: https://doi.org/10.48550/arXiv.1606.01808
2016-06-07
Abstract:With the shifting focus of organizations and governments towards digitization of academic and technical documents, there has been an increasing need to use this reserve of scholarly documents for developing applications that can facilitate and aid in better management of research. In addition to this, the evolving nature of research problems has made them essentially interdisciplinary. As a result, there is a growing need for scholarly applications like collaborator discovery, expert finding and research recommendation systems. This research paper reviews the current trends and identifies the challenges existing in the architecture, services and applications of big scholarly data platform with a specific focus on directions for future research.
Digital Libraries,Computers and Society
What problem does this paper attempt to address?
The problem this paper attempts to address is: With the increasing trend of digitization in academic and technical documents, how can this vast repository of academic resources be utilized to develop applications that can facilitate research management? Additionally, due to the growing interdisciplinary nature of research questions, there is an increasing demand for applications such as collaborator discovery, expert finding, and research recommendation systems. Therefore, this paper aims to review the current trends in cloud-based big data management and analysis in academic resources, identify the challenges in existing architectures, services, and applications, and explore future research directions. Specifically, the paper focuses on the following aspects: 1. **Characteristics of Academic Big Data**: Including Volume, Variety, and Velocity, and how these characteristics impact data management, processing, and resource allocation. 2. **Data Acquisition and Integration**: How to acquire and integrate different types of academic data from multiple sources, ensuring data integrity and accuracy. 3. **Information Extraction**: How to extract useful information from raw academic data, such as metadata, author information, citations, and chapter content, to improve data quality and the accuracy of analysis results. 4. **Technical and Non-Technical Challenges**: Exploring the technical challenges faced in data management, analysis, and visualization, as well as the non-technical challenges related to managing and adopting these solutions. Through these areas of research, the paper hopes to provide direction and guidance for future research in the field of cloud-based academic big data management and analysis.