Exact Distribution of the Occurrence Number for K -Tuples over an Alphabet of Non-Equal Probability Letters

Chan Zhou,Huimin Xie
DOI: https://doi.org/10.1007/s00026-004-0236-0
2005-01-01
Annals of Combinatorics
Abstract:. A nucleotide sequence can be considered as a realization of the non-equal-probability independently and identically distributed (niid) model. In this paper we derive the exact distribution of the occurrence number for each K -tuple with respect to the niid model by means of the Goulden-Jackson cluster method. An application of the probability function to get exact expectation curves [9] is presented, accompanied by comparison between the exact approach and the approximate solution.
What problem does this paper attempt to address?