Abstract:Cuisine is a style of cooking and usually associated with a specific geographic region. Recipes from different cuisines shared on the web are an indicator of culinary cultures in different countries. Therefore, analysis of these recipes can lead to deep understanding of food from the cultural perspective. In this paper, we perform the first cross-region recipe analysis by jointly using the recipe ingredients, food images, and attributes such as the cuisine and course (e.g., main dish and dessert). For that solution, we propose a culinary culture analysis framework to discover the topics of ingredient bases and visualize them to enable various applications. We first propose a probabilistic topic model to discover cuisine-course specific topics. The manifold ranking method is then utilized to incorporate deep visual features to retrieve food images for topic visualization. At last, we applied the topic modeling and visualization method for three applications: 1) multimodal cuisine summarization with both recipe ingredients and images, 2) cuisine-course pattern analysis including topic-specific cuisine distribution and cuisine-specific course distribution of topics, and 3) cuisine recommendation for both cuisine-oriented and ingredient-oriented queries. Through these three applications, we can analyze the culinary cultures at both macro and micro levels. We conduct the experiment on a recipe database Yummly-66K with 66,615 recipes from 10 cuisines in Yummly. Qualitative and quantitative evaluation results have validated the effectiveness of topic modeling and visualization, and demonstrated the advantage of the framework in utilizing rich recipe information to analyze and interpret the culinary cultures from different regions.

Cross-Modal Recipe Retrieval: How to Cook This Dish?

Cross-modal Recipe Retrieval with Stacked Attention Model

Cross-domain Cross-modal Food Transfer.

Cross-modal Recipe Retrieval with Rich Food Attributes

Deep Understanding Of Cooking Procedure For Cross-Modal Recipe Retrieval

MCEN: Bridging Cross-Modal Gap Between Cooking Recipes and Dish Images with Latent Variable Model

MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images with Latent Variable Model.

Cross-Modal Food Retrieval: Learning a Joint Embedding of Food Images and Recipes with Semantic Consistency and Attention Mechanism

Deep-based Ingredient Recognition for Cooking Recipe Retrieval

Cross-Modal Recipe Retrieval with Self-Attention Mechanism

Cross-domain Food Image-to-Recipe Retrieval by Weighted Adversarial Learning

Revamping Image-Recipe Cross-Modal Retrieval with Dual Cross Attention Encoders

CREAMY: Cross-Modal Recipe Retrieval By Avoiding Matching Imperfectly

Disambiguity and Alignment: An Effective Multi-Modal Alignment Method for Cross-Modal Recipe Retrieval

Learning From Web Recipe-Image Pairs for Food Recognition: Problem, Baselines and Performance

Exploring latent weight factors and global information for food-oriented cross-modal retrieval

Cross-Modal Retrieval and Synthesis (X-MRS): Closing the Modality Gap in Shared Representation Learning

Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval

Cross-lingual Adaptation for Recipe Retrieval with Mixup

You Are What You Eat: Exploring Rich Recipe Information for Cross-Region Food Analysis