ActSort: An active-learning accelerated cell sorting algorithm for large-scale calcium imaging datasets

Yiqi Jiang,Hakki Orhun Akengin,Ji Zhou,Mehmet Anil Aslihak,Yang Li,Oscar Hernandez,Sadegh Ebrahimi,Yanping Zhang,Hakan Inan,Omar Jaidar,Christopher Miranda,Fatih Dinc,Marta Blanco-Pozo,Mark J. Schnitzer
DOI: https://doi.org/10.1101/2024.08.21.609011
2024-08-22
Abstract:Recent advances in calcium imaging enable simultaneous recordings of up to a million neurons in behaving animals, producing datasets of unprecedented scales. Although individual neurons and their activity traces can be extracted from these videos with automated algorithms, the results often require human curation to remove false positives, a laborious process called 'cell sorting'. To address this challenge, we introduce ActSort, an active-learning algorithm for sorting large-scale datasets that integrates features engineered by domain experts together with data formats with minimal memory requirements. By strategically bringing outlier cell candidates near the decision boundary up for annotation, ActSort reduces human labor to about 1-3% of cell candidates and improves curation accuracy by mitigating annotator bias. To facilitate the algorithm's widespread adoption among experimental neuroscientists, we created a user-friendly software and conducted a first-of-its-kind benchmarking study involving about 160,000 annotations. Our tests validated ActSort's performance across different experimental conditions and datasets from multiple animals. Overall, ActSort addresses a crucial bottleneck in processing large-scale calcium videos of neural activity and thereby facilitates systems neuroscience experiments at previously inaccessible scale
Neuroscience
What problem does this paper attempt to address?