Proceedings of the First Workshop on Multilingual Modeling

Jagadeesh Jagarlamudi,Sujith Ravi,Xiaojun Wan,Hal Daumé III
2012-01-01
Abstract:The burgeoning community of multilingual users poses variety of new problems and also enables new opportunities. The large number of multilingual corpora requires effective and scalable ways for organizing them. This additional data in different languages provides a different perspective. Resource poor languages can utilize the training data available in other languages and improve the accuracies of monolingual applications.Recently, we have seen an increasing number of researchers working on multilingual problems varying from mining comparable corpora from the web to multilingual part-of-speech tagging. It is encouraging to see how the abundant training data in a resource rich languages (such as English) is used along with very little training data in the target language to solve problems in resource-poor languages. In addition, resource rich languages have been used successfully to bridge the language barrier between two resource poor languages. This workshop is aimed to bring researchers working on different aspects of multilingualism to a common ground to share their experiences so that the entire community can benefit.
What problem does this paper attempt to address?