"EBK" : Leveraging Crowd-Sourced Social Media Data to Quantify How Hyperlocal Gang Affiliations Shape Personal Networks and Violence in Chicago's Contemporary Southside

Riley Tucker,Nakwon Rim,Alfred Chao,Elizabeth Gaillard,Marc G. Berman
2024-08-19
Abstract:Recent ethnographic research reveals that gang dynamics in Chicago's Southside have evolved with decentralized micro-gang "set" factions and cross-gang interpersonal networks marking the contemporary landscape. However, standard police datasets lack the depth to analyze gang violence with such granularity. To address this, we employed a natural language processing strategy to analyze text from a Chicago gangs message board. By identifying proper nouns, probabilistically linking them to gang sets, and assuming social connections among names mentioned together, we created a social network dataset of 271 individuals across 11 gang sets. Using Louvain community detection, we found that these individuals often connect with gang-affiliated peers from various gang sets that are physically proximal. Hierarchical logistic regression revealed that individuals with ties to homicide victims and central positions in the overall gang network were at increased risk of victimization, regardless of gang affiliation. This research demonstrates that utilizing crowd-sourced information online can enable the study of otherwise inaccessible topics and populations.
Social and Information Networks
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: How to use crowdsourced social media data to quantify the personal networks among contemporary gang members in the South Side of Chicago and their impact on violent behavior. Specifically, the research aims to construct and validate a social - media - based gang network by analyzing social media content, in order to reveal the social connection patterns among gang members and evaluate whether these connections are related to the occurrence of violent events (such as shootings). ### Research Background and Problem Statement 1. **Limitations of Traditional Research** - Traditional gang research mainly relies on police data, such as arrest records and survey data. However, these data have limitations, including data bias, inability to capture all types of social relationships, and neglect of small gang sets within gangs. - Gang structures have changed from large, centralized "nations" to decentralized small sets, and this change makes it difficult for traditional gang research methods to accurately reflect current gang dynamics. 2. **Advantages of Social Media Data** - Social media platforms provide a new place for gang members to interact, and they often post content about gang activities and personal relationships on these platforms. - By using natural language processing (NLP) technology, information about gang members and their social networks can be extracted from social media, thus making up for the deficiencies of traditional data. 3. **Research Objectives** - Construct a social - media - based gang network and identify social connections among gang members. - Analyze whether individual characteristics in these social networks (such as position in the network, relationship with known victims, etc.) are related to their risk of experiencing violence. - Explore whether social media data can provide a new perspective for understanding gang violence. ### Research Methods - **Data Source**: 57,962 posts were collected from an online forum about Chicago gangs. - **Data Processing**: Use natural language processing technology to identify proper nouns in posts and associate them with gangs. Determine the most likely gang affiliation of each individual by a probabilistic method. - **Network Construction**: Establish a social network based on individuals co - mentioned in posts, and the weight of edges is determined by the frequency of co - mention. - **Community Detection**: Use the Louvain algorithm for community detection to identify sub - communities in the network. - **Statistical Analysis**: Evaluate the relationship between network characteristics (such as degree centrality, proportion of dead neighbors, etc.) and individual death risk through a hierarchical logistic regression model. ### Main Findings - **Social Network Characteristics**: The study found that gang members tend to establish social connections with members from the same gang or neighboring gangs. - **Geographical Proximity**: Geographically close gang members are more likely to be co - mentioned in the network. - **Death Risk Prediction**: An individual's position in the network (such as degree centrality) and connections with known victims (such as how many of their neighbors have died) significantly affect their death risk. ### Conclusion This study shows that social media data can be effectively used to construct social networks of gang members, and these network characteristics can explain the risk of individuals experiencing violence. This provides new tools and perspectives for understanding gang violence and also demonstrates the potential of social media data in social science research.