Top 10 Arxiv Papers Today in Quantitative Methods


0.0 Mikeys
#1. (Anti)Fragility and Convex Responses in Medicine
Nassim Nicholas Taleb
This paper applies risk analysis to medical problems, through the properties of nonlinear responses (convex or concave). It shows 1) necessary relations between the nonlinearity of dose-response and the statistical properties of the outcomes, particularly the effect of the variance (i.e., the expected frequency of the various results and other properties such as their average and variations); 2) The description of "antifragility" as a mathematical property for local convex response and its generalization and the designation "fragility" as its opposite, locally concave; 3) necessary relations between dosage, severity of conditions, and iatrogenics. Iatrogenics seen as the tail risk from a given intervention can be analyzed in a probabilistic decision-theoretic way, linking probability to nonlinearity of response. There is a necessary two-way mathematical relation between nonlinear response and the tail risk of a given intervention. In short we propose a framework to integrate the necessary consequences of nonlinearities in...
more | pdf | html
Figures
Tweets
BioPapers: (Anti)Fragility and Convex Responses in Medicine. https://t.co/DzbqVtRqG2
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 8330
Unqiue Words: 3096

0.0 Mikeys
#2. PGLasso: Microbial Community Detection through Phylogenetic Graphical Lasso
Chieh Lo, Radu Marculescu
Due to the recent advances in high-throughput sequencing technologies, it becomes possible to directly analyze microbial communities in the human body and in the environment. Knowledge of how microbes interact with each other and form functional communities can provide a solid foundation to understand microbiome related diseases; this can serve as a key step towards precision medicine. In order to understand how microbes form communities, we propose a two step approach: First, we infer the microbial co-occurrence network by integrating a graph inference algorithm with phylogenetic information obtained directly from metagenomic data. Next, we utilize a network-based community detection algorithm to cluster microbes into functional groups where microbes in each group are highly correlated. We also curate a "gold standard" network based on the microbe-metabolic relationships which are extracted directly from the metagenomic data. Utilizing community detection on the resulting microbial metabolic pathway bipartite graph, the community...
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 3531
Unqiue Words: 1336

0.0 Mikeys
#3. Serverless computing provides on-demand high performance computing for biomedical research
Dimitar Kumanov, Ling-Hong Hung, Wes Lloyd, Ka Yee Yeung
Cloud computing offers on-demand, scalable computing and storage, and has become an essential resource for the analyses of big biomedical data. The usual approach to cloud computing requires users to reserve and provision virtual servers. An emerging alternative is to have the provider allocate machine resources dynamically. This type of serverless computing has tremendous potential for biomedical research in terms of ease-of-use, instantaneous scalability and cost effectiveness. In our proof of concept example, we demonstrate how serverless computing provides low cost access to hundreds of CPUs, on demand, with little or no setup. In particular, we illustrate that the all-against-all pairwise comparison among all unique human proteins can be accomplished in approximately 2 minutes, at a cost of less than $1, using Amazon Web Services Lambda. This is a 250x speedup compared to running the same task on a typical laptop computer.
more | pdf | html
Figures
Tweets
Github

Computational tasks performance in parallel using AWS Lambda and example

Repository: TaskPerform_AWSLambda
User: BioDepot
Language: Python
Stargazers: 0
Subscribers: 9
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 2375
Unqiue Words: 1081

0.0 Mikeys
#4. Global optimization using Gaussian Processes to estimate biological parameters from image data
Diana Barac, Michael D. Multerer, Dagmar Iber
Parameter estimation is a major challenge in computational modeling of biological processes. This is especially the case in image-based modeling where the inherently quantitative output of the model is measured against image data, which is typically noisy and non-quantitative. In addition, these models can have a high computational cost, limiting the number of feasible simulations, and therefore rendering most traditional parameter estimation methods unsuitable. In this paper, we present a pipeline that uses Gaussian process learning to estimate biological parameters from noisy, non-quantitative image data when the model has a high computational cost. This approach is first successfully tested on a parametric function with the goal of retrieving the original parameters. We then apply it to estimating parameters in a biological setting by fitting artificial in-situ hybridization (ISH) data of the developing murine limb bud. We expect that this method will be of use in a variety of modeling scenarios where quantitative data is...
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 10134
Unqiue Words: 2607

0.0 Mikeys
#5. Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities
Marinka Zitnik, Francis Nguyen, Bo Wang, Jure Leskovec, Anna Goldenberg, Michael M. Hoffman
New technologies have enabled the investigation of biology and human health at an unprecedented scale and in multiple dimensions. These dimensions include myriad properties describing genome, epigenome, transcriptome, microbiome, phenotype, and lifestyle. No single data type, however, can capture the complexity of all the factors relevant to understanding a phenomenon such as a disease. Integrative methods that combine data from multiple technologies have thus emerged as critical statistical and computational approaches. The key challenge in developing such approaches is the identification of effective models to provide a comprehensive and relevant systems view. An ideal method can answer a biological or medical question, identifying important features and predicting outcomes, by harnessing heterogeneous data across several dimensions of biological variation. In this Review, we describe the principles of data integration and discuss current methods and available implementations. We provide examples of successful data integration...
more | pdf | html
Figures
Tweets
michaelhoffman: @ShawnMcGuirk @cdavidnaylor Ha! There's an identical preprint available on @arxiv if you need something after the 50 d are up: https://t.co/Y5BH159gJy
michaelhoffman: BTW if you're interested in this sort of integration problem, check out our review "Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities" #JSM2018 https://t.co/Y5BH15qRB6
michaelhoffman: 1/Now available on arXiv: a review of machine learning techniques for integrating multiple data sources in biology and medicine. Led by @marinkazitnik, with contributions from @ACatGatAtATac, Bo Zhang, @jure, @nyulik, and me. https://t.co/Y5BH159gJy
tonets: 28頁、引用300件超の大作がアップされてた [1807.00123] Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities https://t.co/r17ieoqcmT
rnomics: Top #tweeted story in #bioscience: [1807.00123] Machine Learning for Integratin… https://t.co/mqba61cDtT, see more https://t.co/kME3G1o122
BioSquat: [1807.00123] Machine Learning for Integrating Data in Biology and Medicine: P… https://t.co/VikzidDJFe, see more https://t.co/UaSzAxtGUD
mymarkup: #preprintwatch “Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities” https://t.co/6Gwtp36ztK
Jeew333T: Machine learning to answer a biological or medical question, by harnessing heterogeneous data across biol dimensions https://t.co/FbNcBzxjCd
hemonserrat: #MachineLearning [1807.00123] Machine Learning for Integrating Data in Biology… https://t.co/okNNgwcu29, see more https://t.co/5aEBMnesmM
MuinJKhoury: Check out this thorough and timely review on machine learning for integrating data in biology and medicine. https://t.co/H3xtyY8ZrY https://t.co/6Uo5Xake35
bhaibeka: Nice review from @michaelhoffman Lab Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities https://t.co/FaJNEEEYu6
AlexIrrthum: Nice review on ML approaches for data integration in biology and medicine https://t.co/WO27g41cce @marinkazitnik @michaelhoffman https://t.co/scZ56rv540
sroyyors: Stumbled upon this very interesting review on ml algorithms in biology https://t.co/W93E5szVv1
victor_ruiz: Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities https://t.co/pplfAUxUvn
mikaelhuss: https://t.co/nEQRaS1Xjm Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities
MKarimzade: Machine learning for integrating data in biology and medicine: https://t.co/RCHo3FoEaY
quaidmorris: RT @sroyyors: Stumbled upon this very interesting review on ml algorithms in biology https://t.co/W93E5szVv1
mahonylab: RT @sroyyors: Stumbled upon this very interesting review on ml algorithms in biology https://t.co/W93E5szVv1
DrNiazChowdhury: RT @MKarimzade: Machine learning for integrating data in biology and medicine: https://t.co/RCHo3FoEaY
girirajan16: RT @sroyyors: Stumbled upon this very interesting review on ml algorithms in biology https://t.co/W93E5szVv1
gerlach_d: RT @sroyyors: Stumbled upon this very interesting review on ml algorithms in biology https://t.co/W93E5szVv1
SamanthaLWilson: RT @MKarimzade: Machine learning for integrating data in biology and medicine: https://t.co/RCHo3FoEaY
StructBioinfo: RT @sroyyors: Stumbled upon this very interesting review on ml algorithms in biology https://t.co/W93E5szVv1
MethylNation: RT @MKarimzade: Machine learning for integrating data in biology and medicine: https://t.co/RCHo3FoEaY
thanhleviet: RT @MKarimzade: Machine learning for integrating data in biology and medicine: https://t.co/RCHo3FoEaY
maubarsom: RT @mikaelhuss: https://t.co/nEQRaS1Xjm Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunit…
PrecursorCell: RT @MKarimzade: Machine learning for integrating data in biology and medicine: https://t.co/RCHo3FoEaY
nibuasamoht: RT @MKarimzade: Machine learning for integrating data in biology and medicine: https://t.co/RCHo3FoEaY
DaneshMoradi: RT @MKarimzade: Machine learning for integrating data in biology and medicine: https://t.co/RCHo3FoEaY
galib_ewu36: RT @MKarimzade: Machine learning for integrating data in biology and medicine: https://t.co/RCHo3FoEaY
Zhaleh_julie: RT @MKarimzade: Machine learning for integrating data in biology and medicine: https://t.co/RCHo3FoEaY
SJKBio: RT @AlexIrrthum: Nice review on ML approaches for data integration in biology and medicine https://t.co/WO27g41cce @marinkazitnik @michaelh…
01717257469: RT @sroyyors: Stumbled upon this very interesting review on ml algorithms in biology https://t.co/W93E5szVv1
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 26886
Unqiue Words: 7170

0.0 Mikeys
#6. Finding morphology points of electrocardiographic signal waves using wavelet analysis
Alena I. Kalyakulina, Igor I. Yusipov, Victor A. Moskalenko, Alexander V. Nikolskiy, Artem A. Kozlov, Nikolay Yu. Zolotykh, Mikhail V. Ivanchenko
A new algorithm has been developed for delineation of significant points of various electrocardiographic signal (ECG) waves, taking into account information from all available leads and providing similar or higher accuracy in comparison with other modern technologies. The test results for the QT database show a sensitivity above 97% when detecting ECG wave peaks and 96% for their onsets and offsets, as well as better positive predictive value compared to the previously known algorithms. In contrast to the previously published algorithms, the proposed approach also allows one to determine the morphology of waves. The segmentation mean errors of all significant points are below the tolerances defined by the Committee of General Standards for Electrocardiography (CSE).
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 8147
Unqiue Words: 2018

0.0 Mikeys
#7. Tutorial on agent-based models in NetLogo applied to immunology and virology
Catherine A. A. Beauchemin, Laura E. Liao, Kenneth Blahut
This tutorial introduces participants to the design and implementation of an agent-based model using NetLogo through one of two different projects: modelling T cell movement within a lymph node or modelling the progress of a viral infection in an in vitro cell culture monolayer. Each project is broken into a series of incremental steps of increasing complexity. Each step is described in detail and the code to type in is initially provided. However, each project has room to grow in complexity and biological realism so participants are encouraged to expand their project beyond the scope of the tutorial or to develop a project of their own.
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 6851
Unqiue Words: 1661

0.0 Mikeys
#8. Statistical and mathematical modeling of spatiotemporal dynamics of stem cells
Walter de Back, Thomas Zerjatke, Ingo Roeder
Statistical and mathematical modeling are crucial to describe, interpret, compare and predict the behavior of complex biological systems including the organization of hematopoietic stem and progenitor cells in the bone marrow environment. The current prominence of high-resolution and live-cell imaging data provides an unprecedented opportunity to study the spatiotemporal dynamics of these cells within their stem cell niche and learn more about aberrant, but also unperturbed, normal hematopoiesis. However, this requires careful quantitative statistical analysis of the spatial and temporal behavior of cells and the interaction with their microenvironment. Moreover, such quantification is a prerequisite for the construction of hypothesis-driven mathematical models that can provide mechanistic explanations by generating spatiotemporal dynamics that can be directly compared to experimental observations. Here, we provide a brief overview of statistical methods in analyzing spatial distribution of cells, cell motility, cell shapes and...
more | pdf | html
Figures
Tweets
wdeback: Our new preprint on quantitative analysis in phenotypic profiling. A practical introduction in statistical modeling of cell motility, cell shape, spatial distributions and cellular lineage trees. We hope it's clear and useful. https://t.co/8PZEVZTRBC https://t.co/c3bJWU5cwo
BioPapers: Statistical and mathematical modeling of spatiotemporal dynamics of stem cells. https://t.co/QHt1QMsjGI
BioPapers: Statistical and mathematical modeling of spatiotemporal dynamics of stem cells. https://t.co/QHt1QMaIi8
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 9845
Unqiue Words: 3108

0.0 Mikeys
#9. CoverBLIP: scalable iterative matched filtering for MR Fingerprint recovery
Mohammad Golbabaee, Zhouye Chen, Yves Wiaux, Mike E. Davies
Current proposed solutions for the high dimensionality of the MRF reconstruction problem rely on a linear compression step to reduce the matching computations and boost the efficiency of fast but non-scalable searching schemes such as the KD-trees. However such methodologies often introduce an unfavourable compromise in the estimation accuracy when applied to nonlinear data structures such as the manifold of Bloch responses with possible increased dynamic complexity and growth in data population. To address this shortcoming we propose an inexact iterative reconstruction method, dubbed as the Cover BLoch response Iterative Projection (CoverBLIP). Iterative methods improve the accuracy of their non-iterative counterparts and are additionally robust against certain accelerated approximate updates, without compromising their final accuracy. Leveraging on these results, we accelerate matched-filtering using an ANNS algorithm based on Cover trees with a robustness feature against the curse of dimensionality.
more | pdf | html
Figures
Tweets
arxivml: "CoverBLIP: scalable iterative matched filtering for MR Fingerprint recovery", Mohammad Golbabaee, Zhouye Chen, Yve… https://t.co/OmArhNHBs7
BioPapers: CoverBLIP: scalable iterative matched filtering for MR Fingerprint recovery. https://t.co/ySl5400gG7
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 1730
Unqiue Words: 913

0.0 Mikeys
#10. A Deep Autoencoder System for Differentiation of Cancer Types Based on DNA Methylation State
Mohammed Khwaja, Melpomeni Kalofonou, Chris Toumazou
A Deep Autoencoder based content retrieval algorithm is proposed for prediction and differentiation of cancer types based on the presence of epigenetic patterns of DNA methylation identified in genetic regions known as CpG islands. The developed deep learning system uses a CpG island state classification sub-system to complete sets of missing/incomplete island data in given human cell lines, and is then pipelined with an intricate set of statistical and signal processing methods to accurately predict the presence of cancer and further differentiate the type and cell of origin in the event of a positive result. The proposed system was trained with previously reported data derived from four case groups of cancer cell lines, achieving overall Sensitivity of 88.24%, Specificity of 83.33%, Accuracy of 84.75% and Matthews Correlation Coefficient of 0.687. The ability to predict and differentiate cancer types using epigenetic events as the identifying patterns was demonstrated in previously reported data sets from breast, lung,...
more | pdf | html
Figures
Tweets
nmfeeds: [O] https://t.co/sz8RnRFDn3 A Deep Autoencoder System for Differentiation of Cancer Types Based on DNA Methylation State. ...
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 7015
Unqiue Words: 2203

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 72,995 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 72,995 papers.