On triangle inequalities of correlation-based distances for gene expression profiles

Jiaxing Chen,Yen Kaow Ng,Lu Lin,Xianglilan Zhang,Shuaicheng Li
DOI: https://doi.org/10.1186/s12859-023-05161-y
IF: 3.307
2023-02-11
BMC Bioinformatics
Abstract:Distance functions are fundamental for evaluating the differences between gene expression profiles. Such a function would output a low value if the profiles are strongly correlated—either negatively or positively—and vice versa. One popular distance function is the absolute correlation distance, , where is similarity measure, such as Pearson or Spearman correlation. However, the absolute correlation distance fails to fulfill the triangle inequality, which would have guaranteed better performance at vector quantization, allowed fast data localization, as well as accelerated data clustering.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?