RWMF: A Real-World Multimodal Foodlog Database.

Pengfei Zhou,Cong Bai,Kaining Ying,Jie Xia,Lixin Huang
DOI: https://doi.org/10.1109/icpr48806.2021.9412433
2021-01-01
Abstract:With the increasing health concerns on diet, it's worthwhile to develop an intelligent assistant that can help users eat healthier. Such an assistant can automatically give personal advice for the user's diet and generate health report about eating on a regular basis. To boost the research on such diet assistant, we establish a real-world foodlog database using various methods such as filter, cluster and graph convolutional network. This database is built based on real-world lifelog and medical data, which is named as Real-World Multimodal Foodlog (RWMF). It contains 7500 multimodal pairs, and each pair consists of a food image paired with a line of personal biometrics data (such as Blood Glucose) and a textual food description of food composition paired with a line of food nutrition data. In this paper, we present the detailed procedures for setting up the database. We evaluate the performance of RWMF using different food image classification and cross-modal retrieval approaches. We also test the performance of multimodal fusion on RWMF through ablation experiments. The experimental results show that the RWMF database is quite challenging and can be widely used to evaluate the performance of food analysis methods based on multimodal data.
What problem does this paper attempt to address?