Conversational Challenges in AI-Powered Data Science: Obstacles, Needs, and Design Opportunities
Bhavya Chopra,Ananya Singha,Anna Fariha,Sumit Gulwani,Chris Parnin,Ashish Tiwari,Austin Z. Henley
DOI: https://doi.org/10.48550/arXiv.2310.16164
IF: 6.4588
2023-10-24
Human-Computer Interaction
Abstract:Large Language Models (LLMs) are being increasingly employed in data science for tasks like data preprocessing and analytics. However, data scientists encounter substantial obstacles when conversing with LLM-powered chatbots and acting on their suggestions and answers. We conducted a mixed-methods study, including contextual observations, semi-structured interviews (n=14), and a survey (n=114), to identify these challenges. Our findings highlight key issues faced by data scientists, including contextual data retrieval, formulating prompts for complex tasks, adapting generated code to local environments, and refining prompts iteratively. Based on these insights, we propose actionable design recommendations, such as data brushing to support context selection, and inquisitive feedback loops to improve communications with AI-based assistants in data-science tools.