Survival prediction and analysis of Titanic based on logistic regression and KNN

Jiale Li
DOI: https://doi.org/10.62051/6xfdek19
2024-08-12
Abstract:The main purpose of this paper is to use machine learning methods to identify factors related to the survival of Titanic passengers and analyze model parameters. The study first speculated on the survival factors of Titanic passengers and conducted data visualization tests on these speculations. Subsequently, this article cleaned up the relevant data, abandoned the factor of excessive missing data, made up for a small amount of missing data, and then constructed a relevant machine model. Finally, by comparing the accuracy of the two models, a more efficient and accurate model was selected. Through relevant work, it has been concluded that the logistic regression model is more effective than the K-nearest neighbor (KNN) model and that passenger age, economic status, and cabin class are closely related to passenger survival. This article provides valuable references for relevant experimental research by introducing machine learning methods for effective survival prediction. In the future, research will further delve into the methods and solutions of deep learning.
What problem does this paper attempt to address?