CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark
Ge Zhang,Xinrun Du,Bei Chen,Yiming Liang,Tongxu Luo,Tianyu Zheng,Kang Zhu,Yuyang Cheng,Chunpu Xu,Shuyue Guo,Haoran Zhang,Xingwei Qu,Junjie Wang,Ruibin Yuan,Yizhi Li,Zekun Wang,Yudong Liu,Yu-Hsuan Tsai,Fengji Zhang,Chenghua Lin,Wenhao Huang,Wenhu Chen,Jie Fu
2024-03-18
Abstract:As the capabilities of large multimodal models (LMMs) continue to advance, evaluating the performance of LMMs emerges as an increasing need. Additionally, there is an even larger gap in evaluating the advanced knowledge and reasoning abilities of LMMs in non-English contexts such as Chinese. We introduce CMMMU, a new Chinese Massive Multi-discipline Multimodal Understanding benchmark designed to evaluate LMMs on tasks demanding college-level subject knowledge and deliberate reasoning in a Chinese context. CMMMU is inspired by and strictly follows the annotation and analysis pattern of MMMU.
Computation and Language,Artificial Intelligence,Computer Vision and Pattern Recognition