Yelp Dataset Analysis using Scalable Big Data

Mohsen Alam,Benjamin Cevallos,Oscar Flores,Randall Lunetto,Kotaro Yayoshi,Jongwook Woo
DOI: https://doi.org/10.48550/arXiv.2104.08396
2021-04-17
Abstract:Yelp has served and will continue to serve as a data-driven application. Yelp has published a dataset containing business information, reviews, user information, and check-in information. This paper will examine this dataset to provide descriptive analytics to understand business performance, geo-spatial distribution of businesses, reviewers' rating and other characteristics, and temporal distribution of check-ins in business premises. With these analysis we are able to establish that yelp reviews, tips, elite users and check ins have started to plummet over the years. Coincidentally, the paper also establishes that Canadians have a more stable star ratings as well as sentiment ratings when compared to Americans.
Distributed, Parallel, and Cluster Computing,Performance
What problem does this paper attempt to address?