Collaboration of Digital Human Gestures and Teaching Materials for Enhanced Integration in MOOC Teaching Scenarios

Yaxin Liu,Xiaomei Nie,Zhiyong Wu
DOI: https://doi.org/10.1007/978-3-031-61953-3_19
2024-01-01
Abstract:Intelligent-driven digital humans, typically taking text or voice as input, achieve realistic human-like images or models with driven facial expressions, lip synchronization, and body movements, and have already been widely used in short video production and other fields. This technology also offers a novel solution to the issues of teachers' reluctance to appear on camera and the lengthy recording process in MOOCs. However, using digital humans for teaching requires educational adaptation in MOOC scenarios. This paper, starting from the teacher's indicative functions in teaching, enhances the interaction between digital humans and the virtual teaching environment based on existing intelligent-driven digital human technology. We record digital human poses based on the BVH skeletal structure, using text scripts and corresponding Power-Points (PPTs) as the initial input, and then complete the synthesis of collaborative gestures through three steps of Data Preparation, Data-driven Pose Generation and Collaborative Gesture Synthesis. The first step obtains the timing and position information of keywords, which will be used for the generation of inverse kinematics(IK)-controlled gesture animation in the third step. After stitching and rendering, the digital human will behaviorally emphasize key teaching information. Future work will focus on integrating varied collaborative gestures, incorporating spatial and temporal input data, and calculating relative distances and orientations in complex scenes. This will establish a robust mapping between objects and gestures, enhancing the collaboration between digital humans and educational materials in 3D space.
What problem does this paper attempt to address?