Synthetic Heparan Sulfate Standards and Machine Learning Facilitate the Development of Solid-State Nanopore Analysis

Ke Xia,James T. Hagan,Li Fu,Brian S. Sheetz,Somdatta Bhattacharya,Fuming Zhang,Jason R. Dwyer,Robert J. Linhardt
DOI: https://doi.org/10.1073/pnas.2022806118
2021-01-01
Abstract:The application of solid-state (SS) nanopore devices to single-molecule nucleic acid sequencing has been challenging. Thus, the early successes in applying SS nanopore devices to the more difficult class of biopolymer, glycosaminoglycans (GAGs), have been surprising, motivating us to examine the potential use of an SS nanopore to analyze synthetic heparan sulfate GAG chains of controlled composition and sequence prepared through a promising, recently developed chemoenzymatic route. A minimal representation of the nanopore data, using only signal magnitude and duration, revealed, by eye and image recognition algorithms, clear differences between the signals generated by four synthetic GAGs. By subsequent machine learning, it was possible to determine disaccharide and even monosaccharide composition of these four synthetic GAGs using as few as 500 events, corresponding to a zeptomole of sample. These data suggest that ultrasensitive GAG analysis may be possible using SS nanopore detection and well-characterized molecular training sets.
What problem does this paper attempt to address?