Phasing and Missing Data Recovery in Family Trios

D Brinza,JW He,WD Mao,A Zelikovsky
DOI: https://doi.org/10.1504/ijbra.2005.007580
2005-01-01
International Journal of Bioinformatics Research and Applications
Abstract:Although there exist many phasing methods for unrelated adults or pedigrees, phasing and missing data recovery for data representing family trios is lagging behind. This paper is an attempt to fill this gap by considering the following problem. Given a set of genotypes partitioned into family trios, find for each trio a quartet of parent haplotypes which agree with all three genotypes and recover the SNP values missed in given genotype data. Our contributions include (i) formulating the pure-parsimony trio phasing and the trio missing data recovery problems, (ii) proposing two new greedy and integer linear programming based solution methods, and (iii)extensive experimental validation of proposed methods showing advantage over the previously known methods.
What problem does this paper attempt to address?