We study deep neural networks and their use in semiparametric inference. We
provide new rates of convergence for deep feedforward neural nets and, because
our rates are sufficiently fast (in some cases minimax optimal), prove that
semiparametric inference is valid using deep nets for first-step estimation.
Our estimation rates and semiparametric inference results are the first in the
literature to handle the current standard architecture: fully connected
feedforward neural networks (multi-layer perceptrons), with the now-default
rectified linear unit (ReLU) activation function and a depth explicitly
diverging with the sample size. We discuss other architectures as well,
including fixed-width, very deep networks. We establish nonasymptotic bounds
for these deep ReLU nets, for both least squares and logistic loses in
nonparametric regression. We then apply our theory to develop semiparametric
inference, focusing on treatment effects and expected profits for concreteness,
and demonstrate their effectiveness with an empirical...

"Deep Neural Networks for Estimation and Inference: Application to Causal Effects and Other Semiparametric Estimand…
"Deep Neural Networks for Estimation and Inference: Application to Causal Effects and Other Semiparametric Estimand…
Authors: 3

Total Words: 16097

Unqiue Words: 3842

This paper establishes some asymptotic results such as central limit theorems
and consistency of variance estimation in factor models. We consider a setting
common to modern macroeconomic and financial models where many
counties/regions/macro-variables/assets are observed for many time periods, and
when estimation of a global parameter includes aggregation of a cross-section
of heterogeneous micro-parameters estimated separately for each entity. We
establish a central limit theorem for quantities involving both cross-sectional
and time series aggregation, as well as for quadratic forms in time-aggregated
errors. We also study sufficient conditions when one can consistently estimate
the asymptotic variance. These results are useful for making inferences in
two-step estimation procedures related to factor models. We avoid structural
modeling of cross-sectional dependence but impose time-series independence.

Authors: 2

Total Words: 13471

Unqiue Words: 2192

In this paper, we study estimation of nonlinear models with cross sectional
data using two-step generalized estimating equations (GEE) in the quasi-maximum
likelihood estimation (QMLE) framework. In the interest of improving
efficiency, we propose a grouping estimator to account for the potential
spatial correlation in the underlying innovations. We use a Poisson model and a
Negative Binomial II model for count data and a Probit model for binary
response data to demonstrate the GEE procedure. Under mild weak dependency
assumptions, results on estimation consistency and asymptotic normality are
provided. Monte Carlo simulations show efficiency gain of our approach in
comparison of different estimation methods for count data and binary response
data. Finally we apply the GEE approach to study the determinants of the inflow
foreign direct investment (FDI) to China.

Authors: 3

Total Words: 19695

Unqiue Words: 3910

When an individual purchases a home, they simultaneously purchase its
structural features, its accessibility to work, and the neighborhood amenities.
Some amenities, such as air quality, are measurable whilst others, such as the
prestige or the visual impression of a neighborhood, are difficult to quantify.
Despite the well-known impacts intangible housing features have on house
prices, limited attention has been given to systematically quantifying these
difficult to measure amenities. Two issues have lead to this neglect. Not only
do few quantitative methods exist that can measure the urban environment, but
that the collection of such data is both costly and subjective.
We show that street image and satellite image data can capture these urban
qualities and improve the estimation of house prices. We propose a pipeline
that uses a deep neural network model to automatically extract visual features
from images to estimate house prices in London, UK. We make use of traditional
housing features such as age, size and accessibility as...

Authors: 3

Total Words: 6870

Unqiue Words: 2174

The accumulation of knowledge required to produce economic value is a process
that often relates to nations economic growth. Such a relationship, however, is
misleading when the proxy of such accumulation is the average years of
education. In this paper, we show that the predictive power of this proxy
started to dwindle in 1990 when nations schooling began to homogenized. We
propose a metric of human capital that is less sensitive than average years of
education and remains as a significant predictor of economic growth when tested
with both cross-section data and panel data. We argue that future research on
economic growth will discard educational variables based on quantity as
predictor given the thresholds that these variables are reaching.

Authors: 3

Total Words: 6977

Unqiue Words: 2080

Nonlinear panel data models with fixed individual effects provide an
important set of tools for describing microeconometric data. In a large class
of such models (including probit, proportional hazard and quantile regression
to name just a few) it is impossible to difference out individual effects, and
inference is usually justified in a `large n large T' asymptotic framework.
However, there is a considerable gap in the type of assumptions that are
currently imposed in models with smooth score functions (such as probit, and
proportional hazard) and quantile regression. In the present paper we show that
this gap can be bridged and establish asymptotic unbiased normality for
quantile regression panels under conditions on n,T that are very close to what
is typically assumed in standard nonlinear panels. Our results considerably
improve upon existing theory and show that quantile regression is applicable to
the same type of panel data (in terms of n,T) as other commonly used nonlinear
panel data models. Thorough numerical experiments...

Authors: 3

Total Words: 19065

Unqiue Words: 3644

A large "happiness", or life satisfaction, literature in economics makes use
of Likert-like scales in assessing survey respondents' cognitive evaluations of
their lives. These measures are being used to estimate economic benefits in
every empirical field of economics. Typically, analysis of these data have
shown remarkably low direct returns of education for improving subjective
well-being. In addition, arguably, the inferred impact of material wealth and
income using this method is also unexpectedly low as compared with other,
social factors, and as compared with economists' prior expectations which
underlie, in some sense, support for using GDP as a proxy for more general
quality of life goals. Discrete response scales used ubiquitously for the
reporting of life satisfaction pose cognitive challenges to survey respondents,
so differing cognitive abilities result in different uses of the scale, and
thus potential bias in statistical inference. This problem has so far gone
unnoticed. An overlooked feature of the distribution of...

Authors: 1

Total Words: 12922

Unqiue Words: 3458

This paper proposes a method for estimating multiple change points in panel
data models with unobserved individual effects via ordinary least-squares
(OLS). Typically, in this setting, the OLS slope estimators are inconsistent
due to the unobserved individual effects bias. As a consequence, existing
methods remove the individual effects before change point estimation through
data transformations such as first-differencing. We prove that under reasonable
assumptions, the unobserved individual effects bias has no impact on the
consistent estimation of change points. Our simulations show that since our
method does not remove any variation in the dataset before change point
estimation, it performs better in small samples compared to first-differencing
methods. We focus on short panels because they are commonly used in practice,
and allow for the unobserved individual effects to vary over time. Our method
is illustrated via two applications: the environmental Kuznets curve and the
U.S. house price expectations after the financial crisis.

Authors: 3

Total Words: 15691

Unqiue Words: 3245

Statements for public health purposes such as "1 in 2 will get cancer by age
85" have appeared in public spaces. The meaning drawn from such statements
affects economic welfare, not just public health. Both markets and government
use risk information on all kinds of risks, useful information can, in turn,
improve economic welfare, however inaccuracy can lower it. We adapt the
contingency table approach so that a quoted risk is cross-classified with the
states of nature. We show that bureaucratic objective functions regarding the
accuracy of a reported cancer risk can then be stated.

Authors: 4

Total Words: 3149

Unqiue Words: 1134

This paper proposes a point estimator of the break location for a one-time
structural break in linear regression models. If the break magnitude is small,
the least-squares estimator of the break date has two modes at ends of the
finite sample period, regardless of the true break location. I suggest a
modification of the least-squares objective function to solve this problem. The
modified objective function incorporates estimation uncertainty that varies
across potential break dates. The new break point estimator is consistent and
has a unimodal finite sample distribution under a small break magnitude. A
limit distribution is provided under a in-fill asymptotic framework which
verifies that the new estimator outperforms the least-squares estimator.

Authors: 1

Total Words: 23177

Unqiue Words: 3645

