Controlling Homophily in Social Network Regression Analysis by Machine Learning

Xuanqi Liu,Ke-Wei Huang
DOI: https://doi.org/10.1287/ijoc.2022.0287
IF: 3.288
2024-07-13
INFORMS Journal on Computing
Abstract:Across social science disciplines, empirical studies related to social networks have become the most popular research subjects in recent years. A frequently examined topic within these studies is the estimation of peer influence while controlling for homophily effects. However, although researchers may have access to all observable homophily variables, there is scarce literature addressing latent homophily effects stemming from unobservable features. Recent endeavors have demonstrated the efficacy of node embeddings derived from network structure in controlling latent homophily. Inspired by the network embedding research, this study introduces two methods that integrate node embeddings to better control latent homophily, particularly the nonlinear latent homophily effect. The first method uses double machine learning in the partially linear regression literature to alleviate estimation bias. The second method estimates peer influence effects directly by a novel neural network model. Our experimentation results show that our approaches outperform existing estimators in reducing the omitted variable bias due to homophily effects in network regression models. Theoretical analysis of two new estimation methods is also provided in this paper. History: Accepted by Ram Ramesh, Area Editor for Data Science and Machine Learning. Funding: This research is supported by the National Research Foundation, Singapore under its Industry Alignment Fund - Pre-positioning (IAF-PP) Funding Initiative [Grant A-0003504-02-00]. Supplemental Material: The software that supports the findings of this study is available within the paper and its Supplemental Information ( https://pubsonline.informs.org/doi/suppl/10.1287/ijoc.2022.0287 ) as well as from the IJOC GitHub software repository ( https://github.com/INFORMSJoC/2022.0287 ). The complete IJOC Software and Data Repository is available at https://informsjoc.github.io/ .
computer science, interdisciplinary applications,operations research & management science
What problem does this paper attempt to address?