Abstract:Population neuroscience is a discipline generated by the intersection of neuroscience (brain), genomics (gene), and epidemiology (environment), providing new perspectives and methods for studying a full picture of human brain structure and function. Resting state functional magnetic resonance imaging (rsfMRI) has become one of the most widely used tool for collecting brain phenotype information in population neuroscience. Its measurement of spontaneous neural activity in the human brain is safe, reliable, and easy to operate and implement in large-scale studies. RsfMRI can reveal the patterns of cognitive activity and psychiatric conditions. Its reliability is a fundamental requirement for measuring individual differences in the spontaneous brain activity. However, the importance of reliability in rsfMRI has not been fully recognized by the field. This article first provides a brief introduction to the definition and calculation of reliability of rsfMRI. By reviewing the literature, we describe the research progresses on the reliability of rsfMRI. Intraclass correlation (ICC) is the most ideal indicator for measuring the reliability of rsfMRI and used in most current research on rsfMRI to calculate the test-retest reliability of rsfMRI measurements. The overall proportion of reliability research on rsfMRI is small, and empirical research on reliability needs to be strengthened. The attention to the reliability of rsfMRI in the field is constantly increasing, and the reliability of rsfMRI is often involved and discussed. The overall measurement reliability level of rsfMRI is mostly below 0.6, mostly around 0.4; The reliability obtained from different studies and measurement indicators varies greatly, and there is also significant heterogeneity in the reliability between brain networks. Secondly, a study on the reliability of rsfMRI was proposed from three dimensions: demographic characteristics, sample size, and measurement arrangement. The impact of various factors in these three dimensions on the reliability of rsfMRI was summarized, including gender, age, sample size, scanner, magnetic field intensity, scanning parameters, physiological noise, head movement, body cognitive changes, open/closed eye status, retest interval, and scanning duration factors such as data preprocessing and functional indicators have varying degrees of impact on the reliability of rsfMRI. Finally, a guideline for the reliability study of population neuroscience with rsfMRI is proposed: (1) Referring to the brain chart of the entire human life cycle and using standardized modeling based on population neuroscience; (2) When conducting rsfMRI, report reliability as a necessary content and provide measurement indicator reliability information; (3) Drawing on a large amount of open access rsfMRI data resources for research, increasing the sample size, avoiding subjects from closing their eyes and sleeping during scanning, and the scanning interval should not be too long, the shorter the better. It is necessary to extend the scanning time each time; (4) Continuously optimizing scanning parameters, reducing scanning noise, and conducting standardized scans to improve the comfort level of participants during scanning, in order to reduce physical movement and cognitive changes, and monitor the scanning process in real time to pay attention to individual physical and mental changes; (5) Adopting standardized procedures to standardize the data rest state function and preprocessing process, selecting calculation indicators and methods to achieve the goal of improving research reliability. To better serve the in-depth development of population neuroscience and the widespread use of rsfMRI, it is imperative to comprehensively reveal the reliability and influencing factors of rsfMRI. Population neuroscience still needs to improve research on the impact of demographic characteristics, sample size, and testing arrangements on the reliability of rsfMRI measurements. Soon, population neuroscience should also refer to disciplines such as psychometric to establish a complete set of rsfMRI data collection standards and guidelines, truly achieving stable and reliable measurement of data.

The complexity of measuring reliability in learning tasks: An illustration using the Alternating Serial Reaction Time Task

Reaction-time task reliability is more accurately computed with permutation-based split-half correlations than with Cronbach's alpha

A measure of reliability convergence to select and optimize cognitive tasks for individual differences research

Resting-state Fmri and Population Neuroscience: Progresses and Guidelines for Reliability Research

The Burden of Reliability: How Measurement Noise Limits Brain-Behaviour Predictions

Measuring the Reliability of Reinforcement Learning Algorithms

On the reliability of behavioral measures of cognitive control: retest reliability of task-inhibition effect, task-preparation effect, Stroop-like interference, and conflict adaptation effect

Psychological Science Needs a Standard Practice of Reporting the Reliability of Cognitive-Behavioral Measurements

Test–retest reliability of reinforcement learning parameters

Reliably Measuring Learning-Dependent Distractor Suppression with Eye Tracking

Reliability of the serial reaction time task: If at first you don't succeed, try, try, try again

On the (un)reliability of common behavioral and electrophysiological measures from the stop signal task: Measures of inhibition lack stability over time

Do current statistical learning tasks capture stable individual differences in children? An investigation of task reliability across modality

Estimating Test-Retest Reliability in the Presence of Self-Selection Bias and Learning/Practice Effects

Consistency within change: Evaluating the psychometric properties of a widely used predictive-inference task

Methods to split cognitive task data for estimating split-half reliability: A comprehensive review and systematic assessment

When most fMRI connectivity cannot be detected: insights from time course reliability

Reproducibility of Frequency-Dependent Low Frequency Fluctuations in Reaction Time over Time and Across Tasks.

Impact of analytic decisions on test-retest reliability of individual and group estimates in functional magnetic resonance imaging: a multiverse analysis using the monetary incentive delay task

Hierarchical-Model Insights for Planning and Interpreting Individual-Difference Studies of Cognitive Abilities

A proof-of-concept study testing the factor structure of the Stop Signal Task: overlap with substance use and mental health symptoms