Publications
- Measuring the Effects of Bid-rigging on Prices with Binary Misclassification (with Changsik Kim and Hyunchul Kim), Review of Industrial Organization (2022)
Abstract
The binary indicator of collusion is the key ingredient in estimating overcharges from bid-rigging with a regression-based approach. We develop a method for examining the effects of misclassification error in the indicator of bid-rigging status on estimates of damages from collusion. We derive partial identification of the regression model of winning bids in public procurement auctions and provide informative bounds on the price effects of bid-rigging. We find that the bounds are tight when placing a plausible restriction on the extent of measurement errors. Our findings show that relaxing the nondifferential assumption about misclassification errors leads to wider bounds.
Working Papers
- Heterogeneous Treatment Effects with Endogeneity and High-Dimensional Covariates
Best Job Market Paper Award, Korea-America Economic Association, 2025
Abstract
This paper develops instrumental variable estimation methods for heterogeneous treatment effects in settings with an endogenous treatment and high-dimensional covariates. The goal is to provide valid inference on treatment effect heterogeneity with respect to observed covariates, when covariates are high-dimensional and heterogeneity is discovered using machine learning. I address treatment endogeneity through instrumental variables. To model heterogeneous effects, I interact the treatment with flexible transformations of all covariates and select relevant interactions with a variable selection procedure. Applying the proposed method, I study the heterogeneous treatment effects of Head Start, a public early childhood education program, on children's cognitive skills. The results reveal complementarities between center characteristics and children's background. I find that frequent home visits increase the effect of the program for children who need additional support, and transportation services deliver larger program gains for children who may face access barriers. These findings suggest that children have greater benefits from the program when center features align with their needs, highlighting the importance of considering rich heterogeneity for policy implications.
- Distribution Regression with Censored Selection (with Ivan Fernandez-Val), May 2025
Abstract
We develop a distribution regression model with a censored selection rule, offering a semi-parametric generalization of the Heckman selection model. Our approach applies to the entire distribution, extends beyond the mean or median, accommodates non-Gaussian error structures, and allows heterogeneous effects of covariates on both selection and outcome distributions. By employing a censored selection rule, our model can uncover richer selection patterns according to both outcome and selection variables, compared to the binary selection case. We analyze identification, estimation, and inference of model functionals such as sorting parameters and distributions purged of sample selection. An application to labor supply using data from the UK reveals different selection patterns into full-time and overtime work across gender, marital status, and time. Additionally, decompositions of wage distributions by gender show that selection effects contribute to a decrease in the observed gender wage gap at low quantiles and an increase in the gap at high quantiles for full-time workers. The observed gender wage gap among overtime workers is smaller, which may be driven by different selection behaviors into overtime work across genders.
- Censored Quantile Regression with Many Controls , March 2023
Abstract
This paper develops estimation and inference methods for censored quantile regression models with high-dimensional controls. The methods are based on the application of double/debiased machine learning (DML) framework to the censored quantile regression estimator of Buchinsky and Hahn (1998). I provide valid inference for low-dimensional parameters of interest in the presence of high-dimensional nuisance parameters when implementing machine learning estimators. The proposed estimator is shown to be consistent and asymptotically normal. The performance of the estimator with high-dimensional controls is illustrated with numerical simulation and an empirical application that examines the effect of 401(k) eligibility on savings.
Work in Progress
- Static Games with Machine Learning: Heterogeneous Peer Effects in Analyst Recommendations (with Hyungjin Kim)