Soloist: Generating Mixed-Initiative Tutorials from Existing Guitar Instructional Videos Through Audio Processing

Bryan Wang,Mengyu Yang,Tovi Grossman
DOI: https://doi.org/10.1145/3411764.3445162
2021-01-22
Abstract:Learning musical instruments using online instructional videos has become increasingly prevalent. However, pre-recorded videos lack the instantaneous feedback and personal tailoring that human tutors provide. In addition, existing video navigations are not optimized for instrument learning, making the learning experience encumbered. Guided by our formative interviews with guitar players and prior literature, we designed Soloist, a mixed-initiative learning framework that automatically generates customizable curriculums from off-the-shelf guitar video lessons. Soloist takes raw videos as input and leverages deep-learning based audio processing to extract musical information. This back-end processing is used to provide an interactive visualization to support effective video navigation and real-time feedback on the user's performance, creating a guided learning experience. We demonstrate the capabilities and specific use-cases of Soloist within the domain of learning electric guitar solos using instructional YouTube videos. A remote user study, conducted to gather feedback from guitar players, shows encouraging results as the users unanimously preferred learning with Soloist over unconverted instructional videos.
Human-Computer Interaction
What problem does this paper attempt to address?
This paper aims to solve several key problems in learning the guitar through online teaching videos: 1. **Lack of Immediate Feedback**: Pre - recorded teaching videos cannot provide immediate feedback and personalized guidance like human teachers. By developing the Soloist system, the paper uses audio processing technology to extract musical information and compare it with the user's performance, thereby providing real - time feedback, including melody visualization, note correctness scoring, etc. 2. **Non - optimized Video Navigation**: Existing video navigation tools are not suitable for instrument learning, causing users to frequently shift their attention between the instrument and the video player during practice, which affects learning efficiency. The paper proposes a waveform - and - area - based navigation method, allowing users to locate and replay video segments of interest more efficiently. 3. **Limited Teaching Resources and Difficult Customization**: Existing music teaching systems usually need to hand - craft new tutorials from scratch, which is both time - consuming and costly. The Soloist system can automatically generate personalized tutorials from existing music teaching videos, reducing the amount of customization work. By solving the above problems, the paper aims to improve the user experience and effectiveness of learning the guitar using online teaching videos.