Abstract:Precise semantic representation is important for allowing machines to truly comprehend the meaning of natural language text, especially biomedical literature. Although the semantic relations among words in a single sentence may be accurately represented with existing approaches, relations between two sentences cannot yet be accurately modeled, which leads to a lack of contextual information and difficulty in performing interpretable semantic inference. Additionally, it is challenging to merge semantic representations curated by different experts. These critical challenges are insufficiently addressed by existing methods. In this paper, we present a framework for structured semantic representation (FSSR) to address these issues. FSSR uses a double-layer structure Construct that combines Paradigm and Instance to represent the semantics of a word or a sentence. It uses six types of rules to represent the semantic relations between sentence Constructs and uses a Computational Model to represent an action. FSSR is a graph-based representation of semantics, in which a node represents a Construct or a Paradigm. Two nodes are connected by an edge (a rule). In addition, FSSR enables interpretable inference and active acquisition of new information, as illustrated in a case study. This case study models the semantics of a cancer prognostic analysis article and reproduces its text results and charts. We provide a website that visualizes the inference process (http://cragraph.synergylab.cn).

Semantic-based Intelligent Data Clean Framework for Big Data

A Data Fusion and Data Cleaning System for Smart Grids Big Data.

Semantic-based Big Data integration framework using scalable distributed ontology matching strategy

A Framework for Structured Semantic Representation Capable of Active Sensing and Interpretable Inference: A Cancer Prognostic Analysis Case Study

An Open Data Cleaning Framework Based on Semantic Rules for Continuous Auditing

Semantic Framework Of Internet Of Things For Smart Cities: Case Studies

Cleanix: A Big Data Cleaning Parfait.

Large-Scale Real-Time Semantic Processing Framework for Internet of Things

Research on Semantic++ Computing Based on Big Data Environment

IterClean: an Iterative Data Cleaning Framework with Large Language Models

A Data Cleaning Method for Industrial Data Flow Based on Multistage Combinational Optimization of Rule Set

A Semantic++ MapReduce Parallel Programming Model.

SemanMR: big data processing framework based on semantics

A novel agent-based parallel ETL system for massive data

Data Cleaning for Accurate, Fair, and Robust Models: A Big Data - AI Integration Approach

Data Cleaning Using Large Language Models

A Semantic++ MapReduce: A Preliminary Report

An Ontology-Based Approach to Data Cleaning

A Semantic Framework for Chinese Historical Events Based on Linked Data and Knowledge Graph.

Large-scale Real-time Data-driven Scientific Applications

A Survey of Semantics-Aware Performance Optimization for Data-Intensive Computing