Abstract:Automatically assessing academic papers has enormous potential to reduce peer-review burden and individual bias. Existing studies strive for building sophisticated deep neural networks to identify academic value based on comprehensive data, e.g., academic graphs and full papers. However, these data are not always easy to access. And the content of the paper rather than other features outside the paper should matter in a fair assessment. Furthermore, while BERT models can maintain general semantics by pre-training on large-scale corpora, they tend to be over-smoothing due to stacked self-attention layers among unfiltered input tokens. Therefore, it is nontrivial to figure out distinguishable value of an academic paper from its limited content. In this study, we propose a novel deep neural network, namely Dual-view Graph Convolutions Enhanced BERT (DGC-BERT), for academic paper acceptance estimation. We combine the title and abstract of the paper as input. Then, a pre-trained BERT model is employed to extract the paper’s general representations. Apart from hidden representations of the final layer, we highlight the first and last few layers as lexical and semantic views. In particular, we re-examine the dual-view filtered self-attention matrices via constructing two graphs, respectively. After that, two multi-hop Graph Convolutional Networks (GCNs) are separately employed to capture pivotal and distant dependencies between the tokens. Moreover, the dual-view representations are facilitated by each other with biaffine attention modules. And a re-weighting gate is proposed to further streamline the dual-view representations with the help of the original BERT representation. Finally, whether the submitted paper could be acceptable is predicted based on the original language model features cooperated with the dual-view dependencies. Extensive data analyses and the full paper based MHCNN studies provide insights into the task and structural functions. Comparison experiments on two benchmark datasets demonstrate that the proposed DGC-BERT significantly outperforms alternative approaches, especially the state-of-the-art models like MHCNN and BERT variants. Additional analyses reveal significance and explainability of the proposed modules in the DGC-BERT. Our codes and settings have been released on Github (https://github.com/ECNU-Text-Computing/DGC-BERT).

Feature Engineering and Tree Modeling for Author-Paper Identification Challenge

Re-examining Lexical and Semantic Attention: Dual-view Graph Convolutions Enhanced BERT for Academic Paper Rating.

Feature Engineering and Ensemble Modeling for Paper Acceptance Rank Prediction

A semi-supervised approach for author disambiguation in KDD CUP 2013.

Text-Driven Neural Collaborative Filtering Model for Paper Source Tracing

AdaWIRL: A Novel Bayesian Ranking Approach for Personal Big-Hit Paper Prediction

Automatic Document Metadata Extraction Based on Deep Networks.

Author Name Disambiguation Based on Rule and Graph Model

Paper Evolution Graph: Multi-View Structural Retrieval for Academic Literature

A Graph-Based Author Name Disambiguation Method and Analysis via Information Theory

MORE: Toward Improving Author Name Disambiguation in Academic Knowledge Graphs

ExploreTree: Interactive Tree Modeling in Semantic Trait Space with Online Intent Learning

Machine Identification of High Impact Research through Text and Image Analysis

Diting: An Author Disambiguation Method Based On Network Representation Learning

Visual Analysis for Name Disambiguation of Academic Papers

A Network-embedding Based Method for Author Disambiguation.

Social Network Extraction of Academic Researchers

Feature engineering vs. deep learning for paper section identification: Toward applications in Chinese medical literature

An Effective Approach for Citation Intent Recognition Based on Bert and LightGBM

Author Name Disambiguation Via Paper Association Refinement and Compositional Contrastive Embedding

Few-shot named entity recognition framework for forestry science metadata extraction