Predicting Algorithm of Attc Site Based on Combination Optimization Strategy

Zhendong Liu,Xi Chen,Dongyan Li,Xinrong Lv,Mengying Qin,Ke Bai,Zhiqiang He,Yurong Yang,Xiaofeng Li,Qionghai Dai
DOI: https://doi.org/10.1080/09540091.2022.2086217
2022-01-01
Connection Science
Abstract:Site-specific recombination systems are widely used as bioengineering tools. However, the traditional site-specific recombination system requires a consensus sequence for the specific site. Such sequence-level constraints limit effective recombination between sites. Therefore, in order to develop an efficient site-specific recombination system, we investigated the attC site of the bacterial integration subsystem and built a predictive model to infer the important features that contribute to recombination. Here, we design an attC site prediction algorithm based on a combination optimisation strategy. Based on the structural features of attC sites, the prediction algorithm realises the high-precision prediction of the recombination frequencies between sites and the screening of the top 20 important features that play a role in recombination, which are effective for improving the design method of attC sites. The algorithm has better portability and higher prediction accuracy compared with the existing advanced algorithms, among which the Pearson correlation coefficient is 0.87, explained variance score is 0.73, root mean square error is 0.006 and mean absolute error is 0.041. This can not only provide ideas for the research of efficient recombination systems but also provide a theoretical basis for developing genetic engineering further.
What problem does this paper attempt to address?