Study of ELM Algorithm Parallelization Based on Spark

Peng LIU,Xue-kui WANG,Yi-hua HUANG,Lei MENG,En-jie DING
DOI: https://doi.org/10.11896/j.issn.1002-137X.2017.12.006
2018-01-01
Abstract:Extreme learning mechine(ELM) has high training speed,but with lots of matrix operations,it remains poor efficiency while applied to massive amount of data.After thorough research on parallel computation of Spark resilient distributed dataset (RDD),we proposed and implemented a parallelized algorithm of ELM based on Spark.And for convenience of performance comparison,Hadoop-MapReduce-based version was also implemented.Experimental results show that the training efficiency of the Spark-based ELM parallelization algorithm is significantly improved than the Hadoop-MapReduce-based version.If the amount of data processed is greater,the advantage of Spark in efficiency is more obvious.
What problem does this paper attempt to address?