Analysis of Plant Breeding on Hadoop and Spark

Shuangxi Chen,Chunming Wu,Yongmao Yu
DOI: https://doi.org/10.1155/2016/7081491
2016-01-01
Advances in Agriculture
Abstract:Analysis of crop breeding technology is one of the important means of computer-assisted breeding techniques which have huge data, high dimensions, and a lot of unstructured data. We propose a crop breeding data analysis platform on Spark. The platform consists of Hadoop distributed file system (HDFS) and cluster based on memory iterative components. With this cluster, we achieve crop breeding large data analysis tasks in parallel through API provided by Spark. By experiments and tests of Indica and Japonica rice traits, plant breeding analysis platform can significantly improve the breeding of big data analysis speed, reducing the workload of concurrent programming.
What problem does this paper attempt to address?