Abstract:The large-scale and complex data generated in the teaching field of business administration poses challenges for decision-makers and managers of companies, and how to effectively extract and manage the useful information contained in these data has become a problem to be solved. Currently available methods of subject knowledge library clustering and visualization struggle to handle the complexity and multi-hierarchies of such subject data effectively or meet users’ requirements for advanced semantic understanding and retrieval. In view of these matters, this study aims to probe deeper into the problem of multi-level text clustering in the subject knowledge library and its visualization. Firstly, an innovative strategy-based subject semantic representation method for knowledge libraries was proposed to better interpret and represent the semantic information of subject data. Secondly, a subject clustering model of the knowledge library was constructed based on an improved hierarchical Dirichlet polynomial distribution, enabling efficient and accurate clustering of subject data. Lastly, visualization technology was employed to display the cluster results, allowing users to gain a clear understanding of the internal relationships and structure of the subject data. The research findings of this study could provide valuable new tools and methods for solving the problem of subject knowledge library management and utilization, analyzing the subject data, and supporting decision-making. As a result, they hold both theoretical and practical significance.

Extract List Data from Semi-Structured Document Using Clustering

List data extraction in semi-structured document

Document Clustering Using Locality Preserving Indexing

Optimized Hierarchy Clustering Based Extraction for Logical Document Structures

Logical Structure Based Semantic Relationship Extraction from Semi-Structured Documents

A Rule-Based Information Extraction System for Human-Readable Semi-Structured Scientific Documents

A Semantic approach for effective document clustering using WordNet

A Novel Text Clustering Algorithm Based on Inner Product Space Model of Semantic

Clustering Unstructured Data (Flat Files) - An Implementation in Text Mining Tool

Clustering-based Semantic Retrieval Algorithm

Untagged Table Extraction in Semi-structured Documents

A Semi-Structured Document Model for Text Mining.

Semantic Text Mining with Linked Data

A Model To Enhance Xml Document Clustering

Clustering articles based on semantic similarity

Multi-documents Automatic Abstracting Based on Text Clustering and Semantic Analysis

Document Clustering Based on Word Sense Cluster

Document Clustering Based on Semantic Smoothing Approach

Information Retrieval in long documents: Word clustering approach for improving Semantics

Concept chain based text clustering

Multi-Level Text Clustering in Subject Knowledge Library and its Visualization