Abstract: Nowadays, software engineers use a variety of online media to search and become informed of new and interesting technologies, and to learn from and help one another. We refer to these kinds of online media which help software engineers improve their performance in software development, maintenance and test processes as software information sites. It is common to see tags in software information sites and many sites allow users to tag various objects with their own words. Users increasingly use tags to describe the most important features of their posted contents or projects. In this paper, we propose TagCombine, an automatic tag recommendation method which analyzes objects in software information sites. TagCombine has 3 different components: 1. multi-label ranking component which considers tag recommendation as a multi-label learning problem; 2. similarity based ranking component which recommends tags from similar objects; 3. tag-term based ranking component which considers the relationship between different terms and tags, and recommends tags after analyzing the terms in the objects. We evaluate TagCombine on 2 software information sites, StackOverflow and Freecode, which contain 47,668 and 39,231 text documents, respectively, and 437 and 243 tags, respectively. Experiment results show that for StackOverflow, our TagCombine achieves [email protected] and [email protected] scores of 0.5964 and 0.7239, respectively; For Freecode, it achieves [email protected] and [email protected] scores of 0.6391 and 0.7773, respectively. Moreover, averaging over StackOverflow and Freecode results, we improve TagRec proposed by Al-Kofahi et al. by 22.65% and 14.95%, and the tag recommendation method proposed by Zangerle et al. by 18.5% and 7.35% for [email protected] and [email protected] scores.

Web Clustering Based on Tag Set Similarity.

A Probabilistic Method for Tag Ranking in Tagging System

TagClus: a Random Walk-Based Method for Tag Clustering

WTCluster: utilizing tags for web services clustering

Tags Are Related: Measurement of Semantic Relatedness Based on Folksonomy Network

An Image Clustering Algorithm in Collaborative Tagging System

A Neighborhood Search Method for Link-Based Tag Clustering

Exploiting User Tagging for Web Service Co-Clustering.

Tag recommendation in software information sites

Complex Network Analysis of Tag As a Social Network

Tag Clustering Algorithm Using Object-based Feature Vector

Clustering Web Services to Facilitate Service Discovery

Analysis Of Tag Within Online Social Networks

Wt-Lda: User Tagging Augmented Lda For Web Service Clustering

Annotation-aware web clustering based on topic model and random walks

A Web Service Clustering Method Based on Semantic Similarity and Multidimensional Scaling Analysis

Tag Clustering and Refinement on Semantic Unity Graph

Tag Clusters as Information Retrieval Interfaces

Web Image Clustering by Consistent Utilization of Visual Features and Surrounding Texts.

WSTRank: Ranking Tags to Facilitate Web Service Mining

A New Measurement of Similarity and Related Clustering Algorithm