Multi-motif Identification Using Differential Information Content and Cluster Refine

Ying-guo WANG,Cheng ZHONG
DOI: https://doi.org/10.3969/j.issn.1000-1220.2017.09.010
2017-01-01
Abstract:The repulsive force function of sampling Markov chain is extended by differential information content to increase the value of repulsive force,the two sampling Markov chains close to each other are pushed to search different regions,the values of elements in probability matrix of motif positions are updated,and an improved multi-motif discovery algorithm is proposed. This algorithm can a-void to the local optimal solution and find more candidate motifs. Furthermore,the obtained motif clusters of the algorithm are refined by the information content to reduce the impact of false positive motifs on the accuracy of the results,and the precision and recall rate of identification results are improved. The experimental results on synthetic promoter sequences and ENCODE TF Chip-seq real data-sets show that,compared with existing multi-motif finding algorithms,the proposed algorithm can obtain high recall rate and precision, and recognize highly conservative motifs and match more real motifs.
What problem does this paper attempt to address?