MUWS’22: 1st International Workshop on Multimodal Understanding for the Web and Social Media
Sherzod Hakimov,Karan Sikka,Gullal S. Cheema,R. Shah,Marc A. Kastner
DOI: https://doi.org/10.1145/3487553.3524873
2022-04-25
Abstract:The 1st International Workshop on Multimodal Understanding for the Web and Social Media (MUWS 2022) is co-located with The Web Conference (WWW) and held on the 26th of April, 20221. Multimodal learning and analysis is an emerging research area that cuts through several disciplines like Computer Vision, Natural Language Processing (NLP), Speech Processing, and Multimedia. Recently, several multimodal learning techniques have shown the benefit of combining multiple modalities in video representation learning and downstream tasks on videos. At the core, these methods are focused on modelling the modalities and their complex interactions by using large amounts of data, different loss functions and deep neural network architectures. Although these research directions are exciting and challenging, interdisciplinary fields such as semiotics are rarely considered. Literature in semiotics provides a detailed theory and analysis on meaning creation through signs and symbols via multiple modalities. In general, it provides a compelling view of multimodality and perception that can further expand computational research and applications on the web and social media. The goal of the interdisciplinary MUWS Workshop is to bring together researchers and practitioners from the fields of Information Retrieval, Natural Language Processing, Computer Vision, Human Computation, and Semiotics to discuss and evaluate methods and solutions for effective and efficient analytics of multimodal information present in the Web or social media. We are interested in approaches, tasks, and metrics for effectively analysing multimedia information such as image-text pairs and videos to design methodologies that jointly consider information from multiple modalities. The interdisciplinary nature of processing such multimodal data involves combining ideas and methods from the fields mentioned above. We envision the workshop as a forum for researchers and practitioners from academia and industry for original contributions and practical application on multimodal information processing, mining, retrieval, search, and management. The workshop features advanced methods for combining visual and textual content for problems such as fake news detection, predicting reliability and popularity of news articles, generating image narrative with emotion, and injecting knowledge graph information to improve visual question answering performance. We would like to take this opportunity to sincerely thank the authors and presenters for their inspiring contributions to the workshop. Our sincere thanks are due to the program committee members for reviewing the submissions and ensuring the high quality of our workshop program. We also thank Ichiro Ide for his keynote talk, Chiao-I Tseng and Christian Otto for their invited talks in the workshop. We are also very grateful to the organisers of The Web Conference 2022, and particularly the Workshops Chairs, Nathalie Hernandez and Preslav Nakov, for their support with the workshop organisation.
Computer Science