Identifying Market Maker Trades As 'retail' from TAQ: No Shortage of False Negatives and False Positives

Robert H. Battalio,Robert H. Jennings,Mehmet Saglam,Jun Wu
DOI: https://doi.org/10.2139/ssrn.4579159
2023-01-01
SSRN Electronic Journal
Abstract:Boehmer et al. (2021) propose a methodology to infer retail trades from publicly available NYSE Trade and Quote (TAQ) data. Their methodology relies on assumptions about what types of orders do and do not trade on non-quote-midpoint sub-penny increments via the Trade Reporting Facility (TRF). We obtain proprietary data from one or more wholesalers known to receive marketable orders from retail brokers. We use these data to demonstrate that the Boehmer et al. (2021) methodology identifies less than one-third of trades generally assumed to be from retail investors and analyze cross-sectional determinants of the technique's identification rate. In addition, we obtain proprietary data on institutional trades from multiple sources and demonstrate that a large number of such trades print on the TRF at non-quote-midpoint sub-penny prices in violation of the assumption that institutional orders trade only on penny or half-penny increments. Thus, there are both Type I and Type II errors that affect the ability to identify retail and only retail trades from TAQ using the Boehmer et al. (2021) methodology. Finally, we demonstrate that these errors can produce different inferences regarding the association between lagged retail order imbalance measures and stock returns.
What problem does this paper attempt to address?