Ranking the locations and predicting future crime occurrence by retrieving news from different Bangla online newspapers

Jumman Hossain,Rajib Chandra Das,Md. Ruhul Amin,Md. Saiful Islam
2023-05-18
Abstract:There have thousands of crimes are happening daily all around. But people keep statistics only few of them, therefore crime rates are increasing day by day. The reason behind can be less concern or less statistics of previous crimes. It is much more important to observe the previous crime statistics for general people to make their outing decision and police for catching the criminals are taking steps to restrain the crimes and tourists to make their travelling decision. National institute of justice releases crime survey data for the country, but does not offer crime statistics up to Union or Thana level. Considering all of these cases we have come up with an approach which can give an approximation to people about the safety of a specific location with crime ranking of different areas locating the crimes on a map including a future crime occurrence prediction mechanism. Our approach relies on different online Bangla newspapers for crawling the crime data, stemming and keyword extraction, location finding algorithm, cosine similarity, naive Bayes classifier, and a custom crime prediction model
Information Retrieval,Computers and Society,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to rank crime locations in different regions and predict the occurrence of future crimes by extracting crime data from different Bengali online news newspapers. Specifically, the researchers aim to: 1. **Obtain real - time crime data**: Most existing crime statistics systems rely on historical databases and fail to provide the latest crime information. This research ensures the timeliness of data by crawling daily news. 2. **Rank crime locations**: Analyze crime events in the news to determine which regions have a higher crime rate, providing a basis for decision - making for the public and the police. 3. **Predict future crimes**: Based on historical crime data, use machine - learning models to predict the probability of future crimes in specific areas, helping relevant departments take preventive measures in advance. 4. **Raise public safety awareness**: Visualize crime hotspots through maps to help ordinary people, tourists, etc. make safer choices when traveling. To achieve these goals, the author adopts a series of technical means, including web crawlers, stemming, keyword extraction, location - finding algorithms, cosine similarity calculation, Naive Bayes classifier, and a custom - made crime prediction model. ### Formula Representation The formula for predicting the probability of crime occurrence involved in the paper is as follows: Let: - \(CZM\) = the number of crimes that occur in a specific area in a certain month - \(CTM\) = the total number of crimes that occur in all areas in a certain month - \(CZ\) = the total number of crimes that occur in a specific area in all months - \(CT\) = the total number of crimes that occur in all areas in all months Then the probability \(P_{ZM}\) of a crime occurring in a specific area in a future month can be expressed as: \[P_{ZM}=\left(\frac{CZM}{CTM}\right)\times\left(\frac{CZ}{CT}\right)\] This formula is valid when \(CZM, CTM, CZ, CT > 0\). ### Summary Through the above methods, the researchers hope to reduce the occurrence of crime events to a certain extent and provide valuable reference information for the government, the public, and the police.