Marco Caliendo,Katrin Huber,Ingo E. Isphording,Jakob Wegmann
Abstract:Surveys are an indispensable source of data for applied economic research; however, their reliance on self-reported information can introduce bias, especially if core variables such as personal income are misreported. To assess the extent and impact of this misreporting bias, we compare self-reported wages from the German Socio-Economic Panel (SOEP) with administrative wages from social security records (IEB) for the same individuals. Using a novel and unique data linkage (SOEP-ADIAB), we identify a modest but economically significant reporting bias, with SOEP respondents underreporting their administrative wages by about 7.3%. This misreporting varies systematically with individual, household, and especially job and firm characteristics. In replicating common empirical analyses in which wages serve as either dependent or independent variables, we find that misreporting is consequential for some, but not all estimated relationships. It turns out to be inconsequential for examining the returns to education, but relevant for analyzing the gender wage gap. In addition we find that misreporting bias can significantly affect the results when wage is used as the independent variable. Specifically, estimates of the wage-satisfaction relationship are substantially overestimated when based on survey data, although this bias is mitigated when focusing on interpersonal changes. Our findings underscore that survey-based measures of individual wages can significantly bias commonly estimated empirical relationships. They also demonstrate the enormous research potential of linked administrative-survey data.
What problem does this paper attempt to address?
This paper attempts to solve the following main problems:
1. **Degree of wage - reporting bias in survey data**: The authors use the data linkage between the German Socio - Economic Panel (SOEP) and the Integrated Employment Biographies (IEB) to quantify the degree of wage - reporting bias in survey data. The study finds that SOEP respondents on average underestimate their administrative wages by about 7.3%, that is, on average, each person underestimates €186.
2. **Which observable characteristics can predict reporting bias**: The author analyzes which factors at the individual, family, job, and firm levels can predict wage - reporting bias. The results show that personal characteristics such as gender and extroversion, family characteristics such as the interaction between partner income and gender, and occupational and firm characteristics such as union membership, firm size, average wage level, and employee composition are all important predictors.
3. **The impact of using misreported wages on applied research**: The author explores the impact of using survey wages instead of administrative wages in common economic relationships. Specifically:
- **Return on education**: The choice of data type has little impact on the estimation of the return on education, and the estimates based on SOEP and IEB data are similar.
- **Gender wage gap**: Due to the significant relationship between gender and reporting bias, the gender wage gap is slightly but significantly overestimated. Based on IEB data, the conditional gap shrinks from 10.7% to 7.7%.
- **The impact of wages on satisfaction**: Using self - reported wages will lead to a significant overestimation of the impact of wages on subjective indicators such as satisfaction. For example, based on SOEP data, an additional €1,000 in wages increases personal income satisfaction by about 25.4% of the standard deviation, while based on IEB data, this relationship shrinks to 20.2% of the standard deviation.
In summary, through detailed data analysis, this paper reveals the existence of wage - reporting bias in survey data and its potential impact on economic research results, emphasizes the need to be cautious when using self - reported wage data, and suggests using more reliable data sources for verification whenever possible.