Decoding STAR code for tolerating simultaneous disk failure and silent errors

Jianqiang Luo,Cheng Huang,Lihao Xu
DOI: https://doi.org/10.1109/DSN.2010.5544948
2010-01-01
Abstract:As storage systems grow in size and complexity, various hardware and software component failures inevitably occur, resulting in disk malfunction in failures, as well as silent errors. Existing techniques and schemes overcome the failures and silent errors in a separate fashion. In this paper, we advocate using the STAR code as a unified and systematic mechanism to simultaneously tolerate failures on one disk and silent errors on another. By exploring the unique geometric structure of the STAR code, we propose a novel efficient decoding algorithm - EEL. Both theoretical and experimental performance evaluations show that EEL constantly outperforms a naive Try-and-Test approach by large factors in overall decoding throughput.
What problem does this paper attempt to address?