Abstract:Previous studies had shown that there is a certain relationship between mathematics self-efficacy and math performance. For students, parents, and front-line scholars, it is urgent and important to study the measurement relationship between math achievement and self-efficacy. The research aimed to observe how to measure mathematics self-efficacy and find which of the three traits and which of the three methods better reflect individuals' self-efficacy. The present study used a multitrait-multimethod (MTMM) design to measure mathematics self-efficacy by constructing the confirmatory factor analysis (CFA) model. "Number and Algebra," "Graphics and Geometry," and "Synthesis and Practice" were considered three traits, and General-Math-Task-referenced self-efficacy, Unconventional-Math-Problem-referenced self-efficacy, and Motivated Strategies for Learning Questionnaire (MSLQ) self-efficacy were discussed as three methods to study. A questionnaire survey was used to obtain data. A total of 100 students completed all the questionnaires. Excel was used to collect math scores, and SPSS version 26.0 and AMOS version 26.0 were used to manage the data, confirm a hypothesis, and build a model by using MTMM design and CFA. CFA was used to verify convergent validity and discriminant validity. A total of eight models were constructed in the study that includes first-order CFA models and second-order CFA models, and model D was finally selected as the most perfect model in the second-order CFA model. The results showed that the "Synthesis and Practice" fields were the most significant reflection of self-efficacy among the three traits. MSLQ was the most significant reflection of self-efficacy among the three methods. It is beneficial to improve the level of self-efficacy from the aspect of mathematics subject. In addition, the research confirmed that CFA can support MTMM data for data modeling and found that the correlation between the Unconventional-Math-Problem-referenced self-efficacy and MSLQ is higher than that of General-Math-Task-referenced self-efficacy in the second-order model. It makes certain theoretical significance for improving students' mathematics self-efficacy levels.

Investigating the reliability of CET-SET using Multi-Facet Rasch Model

Investigating the Reliability of CET6 Essay Scoring:An Application of Generalizability Theory and Many-facet Rasch Model

STUDY OF SOURCES OF SCORE VARIABILITY IN PERFORMANCE ASSESSMENT USING MFRM : A CASE OF SPEAKING TESTIN PETS BAND3 ?

Multi-faceted Rasch measurement and bias patterns in EFL writing performance assessment

Exploring Variability Sources in Student Evaluation of Teaching via Many-Facet Rasch Model

Applying Unidimensional and Multidimensional Item Response Theory Models in Testlet-Based Reading Assessment

Application of Bi-factor MIRT and Higher-order CDM Models to an In-house EFL Listening Test for Diagnostic Purposes

A Comparison of EFL Raters' Essay-Rating Processes across Two Types of Rating Scales.

A Corpus-Based Investigation into the Validity of the CET-SET Group Discussion

Rater severity differences in English language as a second language speaking assessment based on rating experience, training experience, and teaching experience through many-faceted Rasch measurement analysis

Test Fairness: Examining Differential Functioning of the Reading Comprehension Section of the GSEEE in China

A Robust Computerized Adaptive Testing Approach in Educational Question Retrieval

A Validating Study on Consecutive Interpreting Test Using Many-facet Rasch Model

Scale Separation Reliability: What Does It Mean in the Context of Comparative Judgment?

Adaptive testing with the GGUM-RANK multidimensional forced choice model: Comparison of pair, triplet, and tetrad scoring

Comparison of Automatic and Expert Teachers’ Rating of Computerized English Listening-Speaking Test

A Multi-Faceted Approach to Scrutinizing the Reliability of a Measure of STEM Teacher Strategic Knowledge

How can dense results be differentiated in comprehensive evaluations? A hybrid information filtering model

The Stabilizing Influences of Linking Set Size and Model-Data Fit in Sparse Rater-Mediated Assessment Networks

Measuring mathematics self-efficacy: Multitrait-multimethod comparison

Application of an Automated Essay Scoring engine to English writing assessment using Many-Facet Rasch Measurement