On Line Course Organization

Ming Zhang,Weichun Wang,Yi Zhou,Yu Yang,Yuhong Xiong,Xiaoming Li
DOI: https://doi.org/10.1007/978-3-540-78139-4_14
2008-01-01
Abstract:In order to help users access on-line materials with more specific questions, we build a learning portal named Fusion(1). First we develop Fusion-Crawler, a link classification focused crawler, to download potential course pages. We then use a binary classifier to pick out the course pages. After the course pages are identified, we use FusionExtractor, a DOM tree based regular expression wrapper, to extract metadata. The metadata include Course Name, Instructor Information, Course Outline, and other relevant information, and they are stored in a database behind the portal. Experimental results show that our approach to organize on-line courses based on focused crawling and metadata extraction approach is effective. The FusionCrawler got average 40-50% more on-topic learning materials than normal focused crawler, while the average FI in FusionExtractor is 85%. With metadata of more than 1,400 MIT OCW, 3000 UIUC and 1000 WISC courses; 300 courses from GreatLearning with 3000 Chinese course videos; and nearly 1000 videos from Internet Achieve; the Fusion portal provides several kinds of searching function, like quick search, advanced search and semantic navigation browsing.
What problem does this paper attempt to address?