##### #1. Discussion contribution "Functional models for time-varying random objects'' by Dubey and Müller (to appear in JRSS-B)
###### Wicher Bergsma
In an inspiring paper Dubey and M\"uller (DM) extend PCA to the case that observations are metric-valued functions. As an alternative, we develop a kernel PCA approach, which we show is closely related to the DM approach. While kernel principal components (kPCs) are simply defined, DM require added complexity in the form of "object FPCs'' and "Fr\'echet scores".
##### #2. Infinitesimal generators for two-dimensional Lévy process-driven hypothesis testing
###### Michael Roberts, Indranil SenGupta
In this paper, we present the testing of four hypotheses on two streams of observations that are driven by L\'evy processes. This is applicable for sequential decision making on the state of two-sensor systems. In one case, each sensor receives or does not receive a signal obstructed by noise. In another, each sensor receives data-driven by L\'evy processes with large or small jumps. In either case, these give rise to four possibilities. Infinitesimal generators are presented and analyzed. Bounds for infinitesimal generators in terms of \emph{super-solutions} and \emph{sub-solutions} are computed. An application of this procedure for the stochastic model is also presented in relation to the financial market.
##### #3. Improved clustering algorithms for the Bipartite Stochastic Block Model
###### Mohamed Ndaoud, Suzanne Sigalla, Alexandre B. Tsybakov
We consider a Bipartite Stochastic Block Model (BSBM) on vertex sets $V_1$ and $V_2$, and investigate asymptotic sufficient conditions of exact and almost full recovery for polynomial-time algorithms of clustering over $V_1$, in the regime where the cardinalities satisfy $|V_1|\ll|V_2|$. We improve upon the known conditions of almost full recovery for spectral clustering algorithms in BSBM. Furthermore, we propose a new computationally simple procedure achieving exact recovery under milder conditions than the state of the art. This procedure is a variant of Lloyd's iterations initialized with a well-chosen spectral algorithm leading to what we expect to be optimal conditions for exact recovery in this model. The key elements of the proof techniques are different from classical community detection tools on random graphs. In particular, we develop a heavy-tailed variant of matrix Bernstein inequality. Finally, using the connection between planted satisfiability problems and the BSBM, we improve upon the sufficient number of clauses...
##### #4. Minimax rates of $\ell_p$-losses for high-dimensional linear regression models with additive measurement errors over $\ell_q$-balls
###### Xin Li, Dongya Wu
We study minimax rates for high-dimensional linear regression with additive errors under the $\ell_p\ (1\leq p<\infty)$-losses, where the regression parameter is of weak sparsity. Our lower and upper bounds agree up to constant factors, implying that the proposed estimator is minimax optimal.
##### #5. Sparse recovery via nonconvex regularized $M$-estimators over $\ell_q$-balls
###### Xin Li, Dongya Wu, Chong Li, Jinhua Wang, Jen-Chih Yao
In this paper, we analyse the recovery properties of nonconvex regularized $M$-estimators, under the assumption that the true parameter is of soft sparsity. In the statistical aspect, we establish the recovery bound for any stationary point of the nonconvex regularized $M$-estimator, under restricted strong convexity and some regularity conditions on the loss function and the regularizer, respectively. In the algorithmic aspect, we slightly decompose the objective function and then solve the nonconvex optimization problem via the proximal gradient method, which is proved to achieve a linear convergence rate. In particular, we note that for commonly-used regularizers such as SCAD and MCP, a simpler decomposition is applicable thanks to our assumption on the regularizer, which helps to construct the estimator with better recovery performance. Finally, we demonstrate our theoretical consequences and the advantage of the assumption by several numerical experiments on the corrupted errors-in-variables linear regression model....
##### #6. Graph Topological Aspects of Granger Causal Network Learning
###### R. J. Kinnear, R. R. Mazumdar
We study Granger causality in the context of wide-sense stationary time series, where our focus is on the topological aspects of the underlying causality graph. We establish sufficient conditions (in particular, we develop the notion of a "strongly causal" graph topology) under which the true causality graph can be recovered via pairwise causality testing alone, and provide examples from the gene regulatory network literature suggesting that our concept of a strongly causal graph may be applicable to this field. We implement and detail finite-sample heuristics derived from our theory, and establish through simulation the efficiency gains (both statistical and computational) which can be obtained (in comparison to LASSO-type algorithms) when structural assumptions are met.
##### #7. Maximum Approximate Likelihood Estimation in Accelerated Failure Time Model for Interval-Censored Data
###### Zhong Guan
The approximate Bernstein polynomial model, a mixture of beta distributions, is applied to obtain maximum likelihood estimates of the regression coefficients, and the baseline density and survival functions in an accelerated failure time model based on interval censored data including current status data. The rate of convergence of the proposed estimates are given under some conditions for uncensored and interval censored data. Simulation shows that the proposed method is better than its competitors. The proposed method is illustrated by fitting the Breast Cosmetic Data using the accelerated failure time model.
##### #8. Goodness-of-fit Testing in Linear Regression Models
###### Rok Blagus, Jakob Peterlin, Janez Stare
Model checking plays an important role in linear regression as model misspecification seriously affects the validity and efficiency of regression analysis. In practice, model checking is often performed by subjectively evaluating the plot of the model's residuals. This approach is objectified by constructing a random process from the model's residuals, however due to a very complex covariance function obtaining the exact distribution of the test statistic is intractable. Several solutions to overcome this have been proposed, however the simulation and bootstrap based approaches are only asymptotically valid and can, with a limited sample size, yield tests which have inappropriate size. We therefore propose to estimate the null distribution by using permutations. We show, under some mild assumptions, that with homoscedastic random errors this yields consistent tests under the null and the alternative hypotheses. Small sample properties of the proposed tests are studied in an extensive Monte Carlo simulation study, where it is...
##### #9. Detecting structural breaks in eigensystems of functional time series
###### Holger Dette, Tim Kutta
Detecting structural changes in functional data is a prominent topic in statistical literature. However not all trends in the data are important in applications, but only those of large enough influence. In this paper we address the problem of identifying relevant changes in the eigenfunctions and eigenvalues of covariance kernels of $L^2[0,1]$-valued time series. By self-normalization techniques we derive pivotal, asymptotically consistent tests for relevant changes in these characteristics of the second order structure and investigate their finite sample properties in a simulation study. The applicability of our approach is demonstrated analyzing German annual temperature data.
##### #10. Oracle inequalities for image denoising with total variation regularization
###### Francesco Ortelli, Sara van de Geer
We derive oracle results for discrete image denoising with a total variation penalty. We consider the least squares estimator with a penalty on the $\ell^1$-norm of the total discrete derivative of the image. This estimator falls into the class of analysis estimators. A bound on the effective sparsity by means of an interpolating matrix allows us to obtain oracle inequalities with fast rates. The bound is an extension of the bound by Ortelli and van de Geer [2019c] to the two-dimensional case. We also present an oracle inequality with slow rates, which matches, up to a log-term, the rate obtained for the same estimator by Mammen and van de Geer [1997]. The key ingredient for our results are the projection arguments to bound the empirical process due to Dalalyan et al. [2017].
