Bangladesh Agricultural Knowledge Graph: Enabling Semantic Integration and Data-driven Analysis--Full Version

Rudra Pratap Deb Nath,Tithi Rani Das,Tonmoy Chandro Das,S.M. Shafkat Raihan
2024-03-19
Abstract:In Bangladesh, agriculture is a crucial driver for addressing Sustainable Development Goal 1 (No Poverty) and 2 (Zero Hunger), playing a fundamental role in the economy and people's livelihoods. To enhance the sustainability and resilience of the agriculture industry through data-driven insights, the Bangladesh Bureau of Statistics and other organizations consistently collect and publish agricultural data on the Web. Nevertheless, the current datasets encounter various challenges: 1) they are presented in an unsustainable, static, read-only, and aggregated format, 2) they do not conform to the Findability, Accessibility, Interoperability, and Reusability (FAIR) principles, and 3) they do not facilitate interactive analysis and integration with other data sources. In this paper, we present a thorough solution, delineating a systematic procedure for developing BDAKG: a knowledge graph that semantically and analytically integrates agriculture data in Bangladesh. BDAKG incorporates multidimensional semantics, is linked with external knowledge graphs, is compatible with OLAP, and adheres to the FAIR principles. Our experimental evaluation centers on evaluating the integration process and assessing the quality of the resultant knowledge graph in terms of completeness, timeliness, FAIRness, OLAP compatibility and data-driven analysis. Our federated data analysis recommend a strategic approach focused on decreasing CO$_2$ emissions, fostering economic growth, and promoting sustainable forestry.
Computers and Society,Databases
What problem does this paper attempt to address?
The problems that this paper attempts to solve are: How to improve the sustainability and resilience of agricultural data in Bangladesh through semantic integration and data - driven analysis in response to United Nations Sustainable Development Goal 1 (No Poverty) and 2 (Zero Hunger). Specifically, the paper aims to address the following challenges currently faced by agricultural datasets: 1. **Unsustainable, Static, Read - Only and Aggregated Formats**: Existing agricultural data are usually presented in formats such as PDF, making it difficult to conduct interactive analysis or integrate with other data sources. 2. **Non - compliance with the FAIR Principles**: These data do not conform to the principles of "Findability, Accessibility, Interoperability, and Reusability", so it is difficult for researchers and other stakeholders to discover, integrate, or reuse them in new situations. 3. **Semantic Heterogeneity Problem**: Different sources describe the same data in different ways, resulting in semantic inconsistencies. 4. **Lack of Global Interconnection**: There are no global links established between data, limiting the ability to gain insights by analyzing or comparing different datasets. To solve these problems, the paper proposes a systematic method for developing BDAKG (Bangladesh Agricultural Knowledge Graph), which has the following characteristics: - **Multidimensional Semantics**: Enhance data analysis capabilities through multidimensional modeling. - **External Knowledge Graph Linking**: Connect with external knowledge graphs to expand the information network. - **OLAP Compatibility**: Support Online Analytical Processing (OLAP) for easy complex querying and analysis. - **Compliance with the FAIR Principles**: Ensure the findability, accessibility, interoperability, and reusability of data. Through these improvements, BDAKG can not only provide a more comprehensive view of data but also support more in - depth data - driven analysis, thereby providing strategic suggestions for reducing carbon dioxide emissions, promoting economic growth, and promoting the development of sustainable forestry.