Abstract:Finding hot topics in scholarly fields can help researchers to keep up with the latest concepts, trends, and inventions in their field of interest. Due to the rarity of complete large-scale scholarly data, earlier studies target this problem based on manual topic extraction from a limited number of domains, with their focus solely on a single feature such as coauthorship, citation relations, and etc. Given the compromised effectiveness of such predictions, in this paper we use a real scholarly dataset from Microsoft Academic Graph [1] , which provides more than 12000 topics in the field of Computer Science (CS), including 1200 venues, 14.4 million authors, 30 million papers, and their citation relations over the period of 1950 till now. Aiming to find the topics that will trend in CS area, we innovatively formalize a hot topic prediction problem where, with joint consideration of both inter-, and intra-topical influence, 17 different scientific features are extracted for comprehensive description of topic status. By leveraging all those 17 features, we observe good accuracy of topic scale forecasting after 5, and 10 years with $R^2$ values of 0.9893, and 0.9646, respectively. Interestingly, our prediction suggests that the maximum value matters in finding hot topics in scholarly fields, primarily from three aspects: (1) the maximum value of each factor, such as authors' maximum h-index, and largest citation number, provides three times the amount of information than the average value in prediction; (2) the mutual influence between the most correlated topics serve as the most telling factor in long-term topic trend prediction, interpreting that those currently exhibiting the maximum growth rates will drive the correlated topics to be hot in the future; (3) we predict in the next - years the top 100 fastest growing (maximum growth rate) topics that will potentially get the major attention in CS area. All our findings are further demonstrated through an online visualization system.

Finding Maximal Ranges with Unique Topics in a Text Database.

Efficiently answering top-k frequent term queries in temporal-categorical range

Unique Topic Query Processing On Cloud

MaxiZone: Maximizing Influence Zone over Geo-Textual Data (Extended abstract)

On the Unsupervised Analysis of Domain-Specific Chinese Texts

Scalable Top-K Spatial Keyword Search

Topic Mining over Asynchronous Text Sequences

Efficient Algorithms for Top-k Keyword Queries on Spatial Databases

Bottom-up Discovery of Frequent Rooted Unordered Subtrees

"Draw My Topics": Find Desired Topics fast from large scale of Corpus

Topic Discovery in Massive Text Corpora Based on Min-Hashing

Processing Spatial Keyword Query As a Top-K Aggregation Query

Intensity of Relationship Between Words: Using Word Triangles in Topic Discovery for Short Texts

Diversified Spatial Keyword Query on Topic Coverage

WordTopic-MultiRank: A New Method for Automatic Keyphrase Extraction.

Short Texts'Hot Topics Detection: Based on Word Frequency Mean Fluctuation and Probabilistic Language Model

Online Subset Topic Modeling For Interactive Documents Exploration

A Topic Model for Hierarchical Documents

Topics Modeling Based on Selective Zipf Distribution

A text visualization method for cross-domain research topic mining

Maximum Value Matters: Finding Hot Topics in Scholarly Fields