Data mining is routinely used to organize ensembles of short temporal
observations so as to reconstruct useful, low-dimensional realizations of the
underlying dynamical systems. By analogy, we use data mining to organize
ensembles of a different type of short observations to reconstruct useful
realizations of bifurcation diagrams. Here the observations arise not through
temporal variation, but rather through the variation of input parameters to the
system: typical examples include short one-parameter steady state continuation
runs, recording components of the steady state along the continuation path
segment. We demonstrate how partial and disorganized "bifurcation observations"
can be integrated in coherent bifurcation surfaces whose dimensionality and
topology/parametrization can be systematically recovered in a data-driven
fashion. The approach can be justified through the Whitney and Takens embedding
theorems, allowing reconstruction of manifolds/attractors through observations.
We discuss extensions to different types of...

Bayesian inference is a widely used and powerful analytical technique in
fields such as astronomy and particle physics but has historically been
underutilized in some other disciplines including semiconductor devices. In
this work, we introduce Bayesim, a Python package that utilizes adaptive grid
sampling to efficiently generate a probability distribution over multiple input
parameters to a forward model using a collection of experimental measurements.
We discuss the implementation choices made in the code, showcase two examples
in photovoltaics, and discuss general prerequisites for the approach to apply
to other systems.

We present a multi-sensor Bayesian passive microwave retrieval algorithm for
flood inundation mapping at high spatial and temporal resolutions. The
algorithm takes advantage of observations from multiple sensors in optical,
short-infrared, and microwave bands, thereby allowing for detection and mapping
of the sub-pixel fraction of inundated areas under almost all-sky conditions.
The method relies on a nearest-neighbor search and a modern sparsity-promoting
inversion method that make use of an a priori dataset in the form of two joint
dictionaries. These dictionaries contain almost overlapping observations by the
Special Sensor Microwave Imager and Sounder (SSMIS) on board the Defense
Meteorological Satellite Program (DMSP) F17 satellite and the Moderate
Resolution Imaging Spectroradiometer (MODIS) on board the Aqua and Terra
satellites. Evaluation of the retrieval algorithm over the Mekong Delta shows
that it is capable of capturing to a good degree the inundation diurnal
variability due to localized convective precipitation. At...

X-ray diffraction (XRD) for crystal structure characterization is among the
most time-consuming and complex steps in the development cycle of novel
materials. We propose a machine-learning-enabled approach to predict
crystallographic dimensionality and space group from a limited number of
experimental thin-film XRD patterns. We overcome the sparse-data problem
intrinsic to novel materials development by coupling a supervised
machine-learning approach with a physics-based data augmentation strategy .
Using this approach, XRD spectrum acquisition and analysis occurs under 5.5
minutes, with accuracy comparable to human expert labeling. We simulate
experimental powder diffraction patterns from crystallographic information
contained in the Inorganic Crystal Structure Database (ICSD). We train a
classification algorithm using a combination of labeled simulated and
experimental augmented datasets, which account for thin-film characteristics
and measurement noise. As a test case, 88 metal-halide thin films spanning 3
dimensionalities and...

Motivated by the presence of deep connections among dynamical equations,
experimental data, physical systems, and statistical modeling, we report on a
series of findings uncovered by the Authors and collaborators during the last
decade within the framework of the so-called Information Geometric Approach to
Chaos (IGAC). The IGAC is a theoretical modeling scheme that combines methods
of information geometry with inductive inference techniques to furnish
probabilistic descriptions of complex systems in presence of limited
information. In addition to relying on curvature and Jacobi field computations,
a suitable indicator of complexity within the IGAC framework is given by the
so-called Information Geometric Entropy (IGE). The IGE is an information
geometric measure of complexity of geodesic paths on curved statistical
manifolds underlying the entropic dynamics of systems specified in terms of
probability distributions. In this manuscript, we discuss several illustrative
examples wherein our modeling scheme is employed to infer...

The reconstruction of broad resonances is important for understanding the
dynamics of heavy ion collisions. However, large combinatorial background makes
this objective very challenging. In this work an innovative iterative method
which identifies signal and background contributions without input models for
normalization constants is presented. This technique is successfully validated
on a simulated thermal cocktail of resonances. This demonstrates that the
iterative procedure is a powerful tool to reconstruct multi-differentially
inclusive resonant signals in high multiplicity events as produced in heavy ion
collisions.

The accurate measurement of microscopic force fields is crucial in many
branches of science and technology, from biophotonics and mechanobiology to
microscopy and optomechanics. These forces are often probed by analysing their
influence on the motion of Brownian particles. Here, we introduce a powerful
algorithm for microscopic Force Reconstruction via Maximum-likelihood-estimator
(MLE) Analysis (FORMA) to retrieve the force field acting on a Brownian
particle from the analysis of its displacements. FORMA yields accurate
simultaneous estimations of both the conservative and non-conservative
components of the force field with important advantages over established
techniques, being parameter-free, requiring ten-fold less data and executing
orders-of-magnitude faster. We first demonstrate FORMA performance using
optical tweezers. We then show how, outperforming any other available
technique, FORMA can identify and characterise stable and unstable equilibrium
points in generic extended force fields. Thanks to its high performance,...

By analyzing sensitivity projections as a statisical estimation problem, we
evaluated different ways of treating radioassay measurement results (values and
upper limits) when projecting sensitivity for low-background experiments. We
developed a figure of merit that incorporates a notion of conservativeness to
quantitatively explore the consequences of attempts to bias sensitivity
projections, and proposed a method to report sensitivity.

Efficiently estimating the integral of functions in high dimensional spaces
is a non-trivial task when an analytical solution cannot be calculated. A
oft-encountered example is in the calculation of the marginal likelihood, in a
context where a sampling algorithm such as a Markov Chain Monte Carlo provides
samples of the function. We present the Adaptive Harmonic Mean Integration
(AHMI) algorithm. Given samples drawn according to a probability distribution
proportional to the function, the algorithm will estimate the integral of the
function and the uncertainty of the estimate by applying a harmonic mean
estimator to adaptively chosen subvolumes of the parameter space. We describe
the algorithm and it's mathematical properties, and report the results using it
on multiple test cases of up to 20 dimensions.

A fundamental problem in geophysical modeling is related to the
identification and approximation of causal structures among physical processes.
However, resolving the bidirectional mappings between physical parameters and
model state variables (i.e., solving the forward and inverse problems) is
challenging, especially when parameter dimensionality is high. Deep learning
has opened a new door toward knowledge representation and complex pattern
identification. In particular, the recently introduced generative adversarial
networks (GANs) hold strong promises in learning cross-domain mappings for
image translation. This study presents a state-parameter identification GAN
(SPID-GAN) for simultaneously learning bidirectional mappings between a
high-dimensional parameter space and the corresponding model state space.
SPID-GAN is demonstrated using a series of representative problems from
subsurface flow modeling. Results show that SPID-GAN achieves satisfactory
performance in identifying the bidirectional state-parameter...

