Combining Gesture and Voice Control for Mid-Air Manipulation of CAD Models in VR Environments

Markus Friedrich,Stefan Langer,Fabian Frey
DOI: https://doi.org/10.48550/arXiv.2011.09138
2020-11-18
Abstract:Modeling 3D objects in domains like Computer Aided Design (CAD) is time-consuming and comes with a steep learning curve needed to master the design process as well as tool complexities. In order to simplify the modeling process, we designed and implemented a prototypical system that leverages the strengths of Virtual Reality (VR) hand gesture recognition in combination with the expressiveness of a voice-based interface for the task of 3D modeling. Furthermore, we use the Constructive Solid Geometry (CSG) tree representation for 3D models within the VR environment to let the user manipulate objects from the ground up, giving an intuitive understanding of how the underlying basic shapes connect. The system uses standard mid-air 3D object manipulation techniques and adds a set of voice commands to help mitigate the deficiencies of current hand gesture recognition techniques. A user study was conducted to evaluate the proposed prototype. The combination of our hybrid input paradigm shows to be a promising step towards easier to use CAD modeling.
Human-Computer Interaction,Computational Geometry
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to simplify the 3D computer - aided design (CAD) modeling process and provide a more intuitive, easy - to - learn and efficient 3D model operation method by combining gesture recognition and voice control technologies in the virtual reality (VR) environment. Specifically, the paper aims to: 1. **Overcome the limitations of traditional CAD tools**: Traditional CAD modeling tools mainly rely on mouse control, which has the problems of complexity and a steep learning curve when operating 3D objects on 2D input devices, especially for beginners. 2. **Take advantage of VR and gesture recognition**: With the emergence of affordable VR devices and robust gesture recognition systems, researchers hope to use these immersive input methods to improve the intuitiveness of 3D modeling, reduce the learning difficulty, and improve efficiency. 3. **Solve the challenges of gesture recognition systems**: Although gesture recognition systems have advantages in some aspects, they still face many challenges in practical applications, such as lack of precision and robustness, and may cause fatigue after long - term use (for example, "gorilla arm syndrome"). Therefore, the paper proposes a hybrid method combining gesture recognition and voice control to make up for the deficiencies of a single technology. 4. **Achieve intuitive operation of CSG tree structures**: The paper adopts the Constructive Solid Geometry (CSG) tree structure to represent 3D models. This representation method not only saves memory but is also more intuitive for beginners, enabling users to build and understand 3D models from basic shapes. Through the above goals, the paper proposes a new interaction concept and verifies its effectiveness and potential advantages through a prototype system and user studies.