Top 10 Arxiv Papers Today in Applications


0.0 Mikeys
#1. Gaussian Process Landmarking for Three-Dimensional Geometric Morphometrics
Tingran Gao, Shahar Z. Kovalsky, Doug M. Boyer, Ingrid Daubechies
We demonstrate applications of the Gaussian process-based landmarking algorithm proposed in [T. Gao, S.Z. Kovalsky, I. Daubechies 2018] to geometric morphometrics, a branch of evolutionary biology centered at the analysis and comparisons of anatomical shapes, and compares the automatically sampled landmarks with the "ground truth" landmarks manually placed by evolutionary anthropologists; the results suggest that Gaussian process landmarks perform equally well or better, in terms of both spatial coverage and downstream statistical analysis. We provide a detailed exposition of numerical procedures and feature filtering algorithms for computing high-quality and semantically meaningful diffeomorphisms between disk-type anatomical surfaces.
more | pdf | html
Figures
Tweets
dizzy_my_future: RT @StatsPapers: Gaussian Process Landmarking for Three-Dimensional Geometric Morphometrics. https://t.co/bm1H1Sqt0l
Github

A Matlab implementation accompanying the paper "Gaussian Process Landmarking on Manifolds".

Repository: GPLmkBDMatch
User: shaharkov
Language: Matlab
Stargazers: 2
Subscribers: 3
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 12412
Unqiue Words: 3701

0.0 Mikeys
#2. Background information for meta-analysis evaluation, Info 00-Info 06
S. Stanley Young, Warren Kindzierski
Massive numbers of meta-analysis studies are being published. A Google Scholar search of -systematic review and meta-analysis- returns about 1.8 million hits, July 2018. There is a need to have some way to judge the reliability of a positive claim made in a meta-analysis that uses observational studies. Our idea is to examine the quality of the observational studies used in the meta-analysis and to examine the heterogeneity of those studies. We provide background information and examples: a listing of negative studies, a simulation of p-value plots, and three examples of p-value plots.
more | pdf | html
Figures
Tweets
StatsPapers: Background information for meta-analysis evaluation, Info 00-Info 06. https://t.co/ai5xSYNKTq
EvidenceRobot: RT @StatsPapers: Background information for meta-analysis evaluation, Info 00-Info 06. https://t.co/ai5xSYNKTq
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 3715
Unqiue Words: 1466

0.0 Mikeys
#3. Can Who-Edits-What Predict Edit Survival?
Ali Batuhan Yardım, Victor Kristof, Lucas Maystre, Matthias Grossglauser
As the number of contributors to online peer-production systems grows, it becomes increasingly important to predict whether the edits that users make will eventually be beneficial to the project. Existing solutions either rely on a user reputation system or consist of a highly specialized predictor that is tailored to a specific peer-production system. In this work, we explore a different point in the solution space that goes beyond user reputation but does not involve any content-based feature of the edits. We view each edit as a game between the editor and the component of the project. We posit that the probability that an edit is accepted is a function of the editor's skill, of the difficulty of editing the component and of a user-component interaction term. Our model is broadly applicable, as it only requires observing data about who makes an edit, what the edit affects and whether the edit survives or not. We apply our model on Wikipedia and the Linux kernel, two examples of large-scale peer-production systems, and we seek to...
more | pdf | html
Figures
Tweets
SHSHANK_GAURV: Can Who-Edits-What Predict Edit Survival? #Research https://t.co/h0H474YdfD
Github

The code for the paper "Can Who-Edits-What Predict Edit Survival?"

Repository: interank
User: lca4
Language: Jupyter Notebook
Stargazers: 1
Subscribers: 3
Forks: 1
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 8638
Unqiue Words: 2746

0.0 Mikeys
#4. Identifying Real Estate Opportunities using Machine Learning
Alejandro Baldominos, Antonio José Moreno, Rubén Iturrarte, Óscar Bernárdez, Carlos Afonso
The real estate market is exposed to many fluctuations in prices, because of existing correlations with many variables, some of which cannot be controlled or might even be unknown. Housing prices can increase rapidly (or in some cases, also drop very fast), yet the numerous listings available online where houses are sold or rented are not likely to be updated that often. In some cases, individuals interested in selling a house (or apartment) might include it in some online listing, and forget about updating the price. In other cases, some individuals might be interested in deliberately setting a price below the market price in order to sell the home faster, for various reasons. In this paper we aim at developing a machine learning application that identifies opportunities in the real estate market in real time, i.e., houses that are listed with a price substantially below the market price. This program can be useful for investors interested in the housing market. The application is formally implemented as a regression problem,...
more | pdf | html
Figures
Tweets
arxiv_org: Identifying Real Estate Opportunities using Machine Learning. https://t.co/RCKcdYLKOK https://t.co/yLicH0iWzZ
arxivml: "Identifying Real Estate Opportunities using Machine Learning", Alejandro Baldominos, Antonio José Moreno, Rubén It… https://t.co/42ysmEJDlC
nmfeeds: [O] https://t.co/Hi63ob5QvQ Identifying Real Estate Opportunities using Machine Learning. The real estate market is expose...
StatsPapers: Identifying Real Estate Opportunities using Machine Learning. https://t.co/R6J5XrUSba
Gabriel_Oguna: RT @arxiv_org: Identifying Real Estate Opportunities using Machine Learning. https://t.co/RCKcdYLKOK https://t.co/yLicH0iWzZ
puneethmishra: RT @arxiv_org: Identifying Real Estate Opportunities using Machine Learning. https://t.co/RCKcdYLKOK https://t.co/yLicH0iWzZ
esigma6: RT @arxiv_org: Identifying Real Estate Opportunities using Machine Learning. https://t.co/RCKcdYLKOK https://t.co/yLicH0iWzZ
StBlMuc: RT @arxiv_org: Identifying Real Estate Opportunities using Machine Learning. https://t.co/RCKcdYLKOK https://t.co/yLicH0iWzZ
jie_song: RT @arxiv_org: Identifying Real Estate Opportunities using Machine Learning. https://t.co/RCKcdYLKOK https://t.co/yLicH0iWzZ
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 9464
Unqiue Words: 2668

0.0 Mikeys
#5. Optimal Design in Hierarchical Models with application in Multi-center Trials
Maryna Prus, Norbert Benda, Rainer Schwabe
Hierarchical random effect models are used for different purposes in clinical research and other areas. In general, the main focus is on population parameters related to the expected treatment effects or group differences among all units of an upper level (e.g. subjects in many settings). Optimal design for estimation of population parameters are well established for many models. However, optimal designs for the prediction for the individual units may be different. Several settings are identiffed in which individual prediction may be of interest. In this paper we determine optimal designs for the individual predictions, e.g. in multi-center trials, and compare them to a conventional balanced design with respect to treatment allocation. Our investigations show, that balanced designs are far from optimal if the treatment effects vary strongly as compared to the residual error and more subjects should be recruited to the active (new) treatment in multi-center trials. Nevertheless, effciency loss may be limited resulting in a moderate...
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 3448
Unqiue Words: 901

0.0 Mikeys
#6. Automatic Detection and Diagnosis of Biased Online Experiments
Nanyu Chen, Min Liu, Ya Xu
We have seen a massive growth of online experiments at LinkedIn, and in industry at large. It is now more important than ever to create an intelligent A/B platform that can truly democratize A/B testing by allowing everyone to make quality decisions, regardless of their skillset. With the tremendous knowledge base created around experimentation, we are able to mine through historical data, and discover the most common causes for biased experiments. In this paper, we share four of such common causes, and how we build into our A/B testing platform the automatic detection and diagnosis of such root causes. These root causes range from design-imposed bias, self-selection bias, novelty effect and trigger-day effect. We will discuss in detail what each bias is and the scalable algorithm we developed to detect the bias. Surfacing up the existence and root cause of bias automatically for every experiment is an important milestone towards intelligent A/B testing.
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 7384
Unqiue Words: 2130

0.0 Mikeys
#7. Particle filters for applications in geosciences
Peter Jan van Leeuwen, Hans R. Künsch, Lars Nerger, Roland Potthast, Sebastian Reich
Particle filters contain the promise of fully nonlinear data assimilation. They have been applied in numerous science areas, but their application to the geosciences has been limited due to their inefficiency in high-dimensional systems in standard settings. However, huge progress has been made, and this limitation is disappearing fast due to recent developments in proposal densities, the use of ideas from (optimal) transportation, the use of localisation and intelligent adaptive resampling strategies. Furthermore, powerful hybrids between particle filters and ensemble Kalman filters and variational methods have been developed. We present a state of the art discussion of present efforts of developing particle filters for highly nonlinear geoscience state-estimation problems with an emphasis on atmospheric and oceanic applications, including many new ideas, derivations, and unifications, highlighting hidden connections, and generating a valuable tool and guide for the community. Initial experiments show that particle filters can be...
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 22052
Unqiue Words: 3958

0.0 Mikeys
#8. Investigating the time dynamics of high frequency wind speed in complex terrains by using the Fisher-Shannon method: application to Switzerland
Fabian Guignard, Michele Lovallo, Mohamed Laib, Jean Golay, Mikhail Kanevski, Nora Helbig, Luciano Telesca
In this paper, the time dynamics of the daily means of wind speed measured in complex mountainous regions are investigated. For 293 measuring stations distributed over all Switzerland, the Fisher information measure and the Shannon entropy power are calculated. The results reveal a clear relationship between the computed measures and both the elevation of the wind stations and the slope of the measuring sites. In particular, the Shannon entropy power and the Fisher information measure have their highest (respectively lowest) values in the Alps mountains, where the time dynamics of wind speed follows a more disordered pattern. The spatial mapping of the calculated quantities allows the identification of two regions within Switzerland characterized by more or less organization/order in the time dynamics of wind speed, which is in agreement with the topography of the Swiss territory. The present study could contribute to a better characterization of the temporal dynamics of wind speed in complex mountainous terrains.
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 5182
Unqiue Words: 1774

0.0 Mikeys
#9. New formulation of the Logistic-Normal process to analyze trajectory tracking data
Gianluca Mastrantonio, Clara Grazian, Sara Mancinelli, Enrico Bibbona
Improved communication systems, shrinking battery sizes and the price drop of tracking devices have led to an increasing availability of trajectory tracking data. These data are often analyzed to understand animals behavior using mixture-type model. Due to their straightforward implementation and efficiency, hidden Markov mod- els are generally used but they are based on assumptions that are rarely verified on real data. In this work we propose a new model based on the Logistic-Normal process. Due to a new formalization and the way we specify the coregionalization matrix of the associated multivariate Gaussian process, we show that our model, differently from other proposals, is invariant with respect to the choice of the reference element and the ordering of the probability vectors components. We estimate the model under a Bayesian framework, using an approximation of the Gaussian process needed to avoid impractical computational time. After a simulation study, where we show the ability of the model to retrieve the parameters...
more | pdf | html
Figures
None.
Tweets
StatsPapers: New formulation of the Logistic-Normal process to analyze trajectory tracking data. https://t.co/PF0ZyDDYGR
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 11360
Unqiue Words: 3356

0.0 Mikeys
#10. A New SVDD-Based Multivariate Non-parametric Process Capability Index
Deovrat Kakde, Arin Chaudhuri, Diana Shaw
Process capability index (PCI) is a commonly used statistic to measure ability of a process to operate within the given specifications or to produce products which meet the required quality specifications. PCI can be univariate or multivariate depending upon the number of process specifications or quality characteristics of interest. Most PCIs make distributional assumptions which are often unrealistic in practice. This paper proposes a new multivariate non-parametric process capability index. This index can be used when distribution of the process or quality parameters is either unknown or does not follow commonly used distributions such as multivariate normal.
more | pdf | html
Figures
Tweets
arxiv_org: A New SVDD-Based Multivariate Non-parametric Process Capability Index. https://t.co/ckbpdR0H2a https://t.co/MwA7yYxdRT
arxivml: "A New SVDD-Based Multivariate Non-parametric Process Capability Index", Deovrat Kakde, Arin Chaudhuri, Diana Shaw https://t.co/R5sRvxiYZi
StatsPapers: A New SVDD-Based Multivariate Non-parametric Process Capability Index. https://t.co/SRSH8CwG24
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 4135
Unqiue Words: 1266

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 72,995 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 72,995 papers.