This paper applies risk analysis to medical problems, through the properties
of nonlinear responses (convex or concave). It shows 1) necessary relations
between the nonlinearity of dose-response and the statistical properties of the
outcomes, particularly the effect of the variance (i.e., the expected frequency
of the various results and other properties such as their average and
variations); 2) The description of "antifragility" as a mathematical property
for local convex response and its generalization and the designation
"fragility" as its opposite, locally concave; 3) necessary relations between
dosage, severity of conditions, and iatrogenics. Iatrogenics seen as the tail
risk from a given intervention can be analyzed in a probabilistic
decision-theoretic way, linking probability to nonlinearity of response. There
is a necessary two-way mathematical relation between nonlinear response and the
tail risk of a given intervention. In short we propose a framework to integrate
the necessary consequences of nonlinearities in...

Due to the recent advances in high-throughput sequencing technologies, it
becomes possible to directly analyze microbial communities in the human body
and in the environment. Knowledge of how microbes interact with each other and
form functional communities can provide a solid foundation to understand
microbiome related diseases; this can serve as a key step towards precision
medicine. In order to understand how microbes form communities, we propose a
two step approach: First, we infer the microbial co-occurrence network by
integrating a graph inference algorithm with phylogenetic information obtained
directly from metagenomic data. Next, we utilize a network-based community
detection algorithm to cluster microbes into functional groups where microbes
in each group are highly correlated. We also curate a "gold standard" network
based on the microbe-metabolic relationships which are extracted directly from
the metagenomic data. Utilizing community detection on the resulting microbial
metabolic pathway bipartite graph, the community...

Cloud computing offers on-demand, scalable computing and storage, and has
become an essential resource for the analyses of big biomedical data. The usual
approach to cloud computing requires users to reserve and provision virtual
servers. An emerging alternative is to have the provider allocate machine
resources dynamically. This type of serverless computing has tremendous
potential for biomedical research in terms of ease-of-use, instantaneous
scalability and cost effectiveness. In our proof of concept example, we
demonstrate how serverless computing provides low cost access to hundreds of
CPUs, on demand, with little or no setup. In particular, we illustrate that the
all-against-all pairwise comparison among all unique human proteins can be
accomplished in approximately 2 minutes, at a cost of less than $1, using
Amazon Web Services Lambda. This is a 250x speedup compared to running the same
task on a typical laptop computer.

Subscribers: 9

Parameter estimation is a major challenge in computational modeling of
biological processes. This is especially the case in image-based modeling where
the inherently quantitative output of the model is measured against image data,
which is typically noisy and non-quantitative. In addition, these models can
have a high computational cost, limiting the number of feasible simulations,
and therefore rendering most traditional parameter estimation methods
unsuitable. In this paper, we present a pipeline that uses Gaussian process
learning to estimate biological parameters from noisy, non-quantitative image
data when the model has a high computational cost. This approach is first
successfully tested on a parametric function with the goal of retrieving the
original parameters. We then apply it to estimating parameters in a biological
setting by fitting artificial in-situ hybridization (ISH) data of the
developing murine limb bud. We expect that this method will be of use in a
variety of modeling scenarios where quantitative data is...

New technologies have enabled the investigation of biology and human health
at an unprecedented scale and in multiple dimensions. These dimensions include
myriad properties describing genome, epigenome, transcriptome, microbiome,
phenotype, and lifestyle. No single data type, however, can capture the
complexity of all the factors relevant to understanding a phenomenon such as a
disease. Integrative methods that combine data from multiple technologies have
thus emerged as critical statistical and computational approaches. The key
challenge in developing such approaches is the identification of effective
models to provide a comprehensive and relevant systems view. An ideal method
can answer a biological or medical question, identifying important features and
predicting outcomes, by harnessing heterogeneous data across several dimensions
of biological variation. In this Review, we describe the principles of data
integration and discuss current methods and available implementations. We
provide examples of successful data integration...

A new algorithm has been developed for delineation of significant points of
various electrocardiographic signal (ECG) waves, taking into account
information from all available leads and providing similar or higher accuracy
in comparison with other modern technologies. The test results for the QT
database show a sensitivity above 97% when detecting ECG wave peaks and 96% for
their onsets and offsets, as well as better positive predictive value compared
to the previously known algorithms. In contrast to the previously published
algorithms, the proposed approach also allows one to determine the morphology
of waves. The segmentation mean errors of all significant points are below the
tolerances defined by the Committee of General Standards for
Electrocardiography (CSE).

This tutorial introduces participants to the design and implementation of an
agent-based model using NetLogo through one of two different projects:
modelling T cell movement within a lymph node or modelling the progress of a
viral infection in an in vitro cell culture monolayer. Each project is broken
into a series of incremental steps of increasing complexity. Each step is
described in detail and the code to type in is initially provided. However,
each project has room to grow in complexity and biological realism so
participants are encouraged to expand their project beyond the scope of the
tutorial or to develop a project of their own.

Statistical and mathematical modeling are crucial to describe, interpret,
compare and predict the behavior of complex biological systems including the
organization of hematopoietic stem and progenitor cells in the bone marrow
environment. The current prominence of high-resolution and live-cell imaging
data provides an unprecedented opportunity to study the spatiotemporal dynamics
of these cells within their stem cell niche and learn more about aberrant, but
also unperturbed, normal hematopoiesis. However, this requires careful
quantitative statistical analysis of the spatial and temporal behavior of cells
and the interaction with their microenvironment. Moreover, such quantification
is a prerequisite for the construction of hypothesis-driven mathematical models
that can provide mechanistic explanations by generating spatiotemporal dynamics
that can be directly compared to experimental observations. Here, we provide a
brief overview of statistical methods in analyzing spatial distribution of
cells, cell motility, cell shapes and...

Current proposed solutions for the high dimensionality of the MRF
reconstruction problem rely on a linear compression step to reduce the matching
computations and boost the efficiency of fast but non-scalable searching
schemes such as the KD-trees. However such methodologies often introduce an
unfavourable compromise in the estimation accuracy when applied to nonlinear
data structures such as the manifold of Bloch responses with possible increased
dynamic complexity and growth in data population. To address this shortcoming
we propose an inexact iterative reconstruction method, dubbed as the Cover
BLoch response Iterative Projection (CoverBLIP). Iterative methods improve the
accuracy of their non-iterative counterparts and are additionally robust
against certain accelerated approximate updates, without compromising their
final accuracy. Leveraging on these results, we accelerate matched-filtering
using an ANNS algorithm based on Cover trees with a robustness feature against
the curse of dimensionality.

A Deep Autoencoder based content retrieval algorithm is proposed for
prediction and differentiation of cancer types based on the presence of
epigenetic patterns of DNA methylation identified in genetic regions known as
CpG islands. The developed deep learning system uses a CpG island state
classification sub-system to complete sets of missing/incomplete island data in
given human cell lines, and is then pipelined with an intricate set of
statistical and signal processing methods to accurately predict the presence of
cancer and further differentiate the type and cell of origin in the event of a
positive result. The proposed system was trained with previously reported data
derived from four case groups of cancer cell lines, achieving overall
Sensitivity of 88.24%, Specificity of 83.33%, Accuracy of 84.75% and Matthews
Correlation Coefficient of 0.687. The ability to predict and differentiate
cancer types using epigenetic events as the identifying patterns was
demonstrated in previously reported data sets from breast, lung,...

