### Top 3 Arxiv Papers Today in Econometrics

##### #1. Consistency and matching without replacement
###### Fredrik Sävje
The paper demonstrates that the matching estimator is not generally consistent for the average treatment effect of the treated when the matching is done without replacement using propensity scores. To achieve consistency, practitioners must either assume that no unit exists with a propensity score greater than one-half or assume that there is no confounding among such units. Illustrations suggest that the result applies also to matching using other metrics as long as it is done without replacement.
##### #2. Testing for Sample Selection
This paper provides a unified approach for detecting sample selection in nonparametric conditional mean and quantile functions. In fact, as sample selection leads to a loss of point identification in the nonparametric quantile case, our tests are of particular relevance when interest lies in the conditional distribution. Our testing strategy consists of a two-step procedure: the first test is an omitted predictor test, where the omitted variable is the propensity score. This test has power against generic $\sqrt{n}-$alternatives, and failure to reject the null implies no selection. By contrast, as any omnibus test, we cannot distinguish between a rejection due to genuine selection or to generic mis-specification, when the omitted variable is correlated with the propensity score. Under the maintained assumption of no selection, our second test is therefore designed to detect mis-specification. This is achieved by a localized version of the first test, using only individuals with propensity score close to one. Although the second...
##### #3. Testing for Unobserved Heterogeneity via k-means Clustering
###### Andrew J. Patton, Brian M. Weller
Clustering methods such as k-means have found widespread use in a variety of applications. This paper proposes a formal testing procedure to determine whether a null hypothesis of a single cluster, indicating homogeneity of the data, can be rejected in favor of multiple clusters. The test is simple to implement, valid under relatively mild conditions (including non-normality, and heterogeneity of the data in aspects beyond those in the clustering analysis), and applicable in a range of contexts (including clustering when the time series dimension is small, or clustering on parameters other than the mean). We verify that the test has good size control in finite samples, and we illustrate the test in applications to clustering vehicle manufacturers and U.S. mutual funds.
