Performance Comparison of Programming Languages to Analyze Big Data Sets

Davis Visman
2014-12-10
Abstract:The goal of this research was to determine, when analyzing big data text files, the advantages of different programming languages, including faster processing time and lower memory usage. Compiled and interpreted languages were used; compiled-- C#.NET, Visual Basic.NET, and C++; interpreted-- Python, Perl, Ruby, and Lua. An algorithm we implemented in each language searches a big data text file for target words, which, upon finding, stores the preceding, the target, and the succeeding words for analysis, then displays a count of the number of times each target, preceding, and succeeding words appeared. The algorithm also keeps track of its running time and memory footprint. The conclusions of this research give future developers a benchmark for which language might be best for their specific goals.
Computer Science
What problem does this paper attempt to address?