Abstract:The development of modern ab initio methods has rapidly increased our understanding of physics, chemistry and materials science. Unfortunately, intensive ab initio calculations are intractable for large and complex systems. On the other hand, empirical force fields are less accurate with poor transferability even though they are efficient to handle large and complex systems. The recent development of machine-learning based neural-network (NN) for local atomic environment representation of density functional theory (DFT) has offered a promising solution to this long-standing challenge. Si is one of the most important elements in science and technology, however, an accurate and transferable interatomic potential for Si is still lacking. Here, we develop a generalized NN potential for Si, which correctly predicts the Si(111)-(7x7) ground-state surface reconstruction for the first time and accurately reproduces the DFT results in a wide range of complex Si structures. We envision similar developments will be made for a wide range of materials systems in the near future.
What problem does this paper attempt to address?
The key problem that this paper attempts to solve is to develop an accurate and transferable inter - silicon (Si) atomic potential to overcome the limitations of traditional first - principles calculations (such as density functional theory, DFT) and empirical force - field methods. Specifically:
1. **Limitations of First - Principles Methods**:
- Although first - principles methods such as DFT have achieved remarkable success in understanding and predicting material properties, their computational cost is very high, and it is difficult to handle large and complex systems.
- The algorithmic complexity of DFT is usually super - linear, and as the simulated system increases, the amount of computation will increase dramatically.
2. **Limitations of Empirical Force Fields**:
- Although empirical force fields have high computational efficiency, their accuracy is low and their transferability is poor, that is, they cannot well describe systems with a variety of different properties.
- Empirical force fields rely on a small number of parameters, which are usually obtained by fitting experimental data or first - principles results, so there are limitations in describing complex structures.
3. **Potential of Machine - Learning Potentials**:
- Machine - learning (ML) methods, especially those based on neural networks (NN), can provide a solution that takes into account both accuracy and efficiency by high - dimensional fitting of the potential energy surface (PES) of first - principles.
- The NN potential can automatically adjust the model parameters according to the local atomic environment without relying on a predefined physical model, which enables it to better describe complex systems.
4. **Specific Challenges of Silicon**:
- Silicon is one of the most important elements in science and technology, but so far there is a lack of a general and accurate inter - silicon atomic potential, especially in describing complex surface reconstructions, etc.
- In particular, for complex phenomena such as the silicon (111)-(7x7) surface reconstruction, the existing inter - atomic potentials cannot accurately describe them.
For this reason, the author has developed a generalized neural - network potential (NN potential), aiming to accurately reproduce DFT results and be able to describe a variety of complex silicon structures from bulk crystals to liquid, amorphous systems, point defects, and surface reconstructions. In particular, this NN potential successfully predicts and describes for the first time the dimer - adatom - stacking - fault (DAS) reconstruction of the silicon (111)-(7x7) surface in the ground state.
### Formula Summary
The formulas involved in the paper include:
1. **Neural - Network Energy Output Formula**:
\[
E_i=\sum_{k = 1}^{m}\sum_{j = 1}^{n}w_{jk}^{(2)}f_a^{(2)}\left(w_{0j}^{(2)}+\sum_{i = 1}^{m}w_{ij}^{(1)}f_a^{(1)}(w_{0i}^{(1)}+G_{ik})\right)
\]
where \(w_{jk}^{(2)}\) and \(w_{ij}^{(1)}\) are the weight parameters connecting the hidden layer and the output layer, and the input layer and the hidden layer respectively; \(f_a^{(1)}\) and \(f_a^{(2)}\) are activation functions; \(G_{ik}\) is a symmetric function.
2. **Cut - off Function**:
\[
f_c(R_{ij})=\begin{cases}0.5\times[\cos(\pi R_{ij}/R_c)+1]&\text{for }R_{ij}\leq R_c\\0&\text{for }R_{ij}>R_c\end{cases}
\]
3. **Radially Symmetric Function**:
\[
G_i^{(2)}=\sum_{j\neq i}e^{-\eta(R_{ij}-R_s)^2}f_c(R_{ij})
\]
4. **Angular Symmetric Function**:
\[
G_i^{(3)}=\sum_{j\neq i}\sum_{k\neq i,j}2(1+\lambda\cos\theta_{ijk})e^{-\eta(R_{ij}^2+R_{ik}^2+R_{