Abstract:With increasing knowledge demands and limited availability of expertise and resources within organizations, professionals often rely on external sources when seeking knowledge. Online knowledge communities are Internet based virtual communities that specialize in knowledge seeking and sharing. They provide a virtual media environment where individuals with common interests seek and share knowledge across time and space. A large online community may have millions of participants who have accrued a large knowledge repository with millions of text documents. However, due to the low information quality of user-generated content, it is very challenging to develop an effective knowledge management system for facilitating knowledge seeking and sharing in online communities. Knowledge management literature suggests that effective knowledge management should make accessible not only written knowledge but also experts who are a source of information and can perform a given organizational or social function. Existing expert finding systems evaluate one's expertise based on either the contents of authored documents or one's social status within his or her knowledge community. However, very few studies consider both indicators collectively. In addition, very few studies focus on virtual communities where information quality is often poorer than that in organizational knowledge repositories. In this study we propose a novel expert finding algorithm, ExpertRank, that evaluates expertise based on both document-based relevance and one's authority in his or her knowledge community. We modify the PageRank algorithm to evaluate one's authority so that it reduces the effect of certain biasing communication behavior in online communities. We explore three different expert ranking strategies that combine document-based relevance and authority: linear combination, cascade ranking, and multiplication scaling. We evaluate ExpertRank using a popular online knowledge community. Experiments show that the proposed algorithm achieves the best performance when both document-based relevance and authority are considered.

An Efficient Parallel Topic-Sensitive Expert Finding Algorithm Using Spark

Distributed High-Dimension Matrix Operation Optimization on Spark

KunPeng: Parameter Server Based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial

Distributed Collaborative Hashing and Its Applications in Ant Financial

An Effective High-Performance Multiway Spatial Join Algorithm with Spark

Evaluating Large Graph Processing in MapReduce Based on Message Passing

Study of ELM Algorithm Parallelization Based on Spark

Parallelization of Machine Learning Algorithms Respectively on Single Machine and Spark

Expertrank: A Topic-Aware Expert Finding Algorithm for Online Knowledge Communities

Design and Implementation of Parallel DBSCAN Algorithm Based on Spark

Parallelization of Classification Algorithms Based on SparkR

Application of Improved Recommendation System Based on Spark Platform in Big Data Analysis

Data Mining Algorithm for Cloud Network Information Based on Artificial Intelligence Decision Mechanism

A Comparative Study on Parallel Lda Algorithms in Mapreduce Framework

ExpertRank: an Expert User Ranking Algorithm in Online Communities

Expert finding in community question answering: a review

Mining Area Skyline Objects from Map-based Big Data using Apache Spark Framework

SparkRDF: Elastic Discreted RDF Graph Processing Engine with Distributed Memory

Asynchronous Page-Rank Computation in Spark.

High-quality domain expert finding method in CQA based on multi-granularity semantic analysis and interest drift

A Parallel Approach to Link Sign Prediction in Large-Scale Online Social Networks.