Abstract:Technical question and answer Q&A platforms, such as Stack Overflow, provide a platform for users to ask and answer questions about a wide variety of programming topics. These platforms accumulate a large amount of knowledge, including hundreds of thousands lines of source code. Developers can benefit from the source code that is attached to the questions and answers on Q&A platforms by copying or learning from (parts of) it. By understanding how developers utilize source code from Q&A platforms, we can provide insights for researchers which can be used to improve next-generation Q&A platforms to help developers reuse source code fast and easily. In this paper, we first conduct an exploratory study on 289 files from 182 open-source projects, which contain source code that has an explicit reference to a Stack Overflow post. Our goal is to understand how developers utilize code from Q&A platforms and to reveal barriers that may make code reuse more difficult. In 31.5% of the studied files, developers needed to modify source code from Stack Overflow to make it work in their own projects. The degree of required modification varied from simply renaming variables to rewriting the whole algorithm. Developers sometimes chose to implement an algorithm from scratch based on the descriptions from Stack Overflow answers, even if there was an implementation readily available in the post. In 35.5% of the studied files, developers used Stack Overflow posts as an information source for later reference. To further understand the barriers of reusing code and to obtain suggestions for improving the code reuse process on Q&A platforms, we conducted a survey with 453 open-source developers who are also on Stack Overflow. We found that the top 3 barriers that make it difficult for developers to reuse code from Stack Overflow are: (1) too much code modification required to fit in their projects, (2) incomprehensive code, and (3) low code quality. We summarized and analyzed all survey responses and we identified that developers suggest improvements for future Q&A platforms along the following dimensions: code quality, information enhancement & management, data organization, license, and the human factor. For instance, developers suggest to improve the code quality by adding an integrated validator that can test source code online, and an outdated code detection mechanism. Our findings can be used as a roadmap for researchers and developers to improve code reuse.

Discovering, Explaining and Summarizing Controversial Discussions in Community Q&A Sites

Automatic Solution Summarization for Crash Bugs

Exploring Controversy Via Sentiment Divergences of Aspects in Reviews.

ControversialQA: Exploring Controversy in Question Answering

An empirical study of question discussions on Stack Overflow

What are developers talking about? An analysis of topics and trends in Stack Overflow

Explaining Controversy on Social Media via Stance Summarization

AnswerBot: an Answer Summary Generation Tool Based on Stack Overflow

Insights into Software Development Approaches: Mining Q&A Repositories

Reading Answers on Stack Overflow: Not Enough!

An Empirical Study on Developer Interactions in StackOverflow.

Exploring Research Interest in Stack Overflow -- A Systematic Mapping Study and Quality Evaluation

Is Stack Overflow Overflowing With Questions and Tags

How do developers utilize source code from stack overflow?

Is It Good To Be Like Wikipedia?: Exploring The Trade-Offs Of Introducing Collaborative Editing Model To Q& A Sites

Software Engineers' Questions and Answers on Stack Exchange

Towards Better Answers: Automated Stack Overflow Post Updating

Question Answering System Based on Community QA

Leveraging Unsupervised Learning to Summarize APIs Discussed in Stack Overflow

How Do Java Developers Reuse StackOverflow Answers in Their GitHub Projects?

Searching crowd knowledge to recommend solutions for API usage tasks