Top 10 Arxiv Papers Today in Other Statistics


0.0 Mikeys
#1. Can everyday AI be ethical. Fairness of Machine Learning Algorithms
Philippe Besse, Celine Castets-Renard, Aurelien Garivier, Jean-Michel Loubes
Combining big data and machine learning algorithms, the power of automatic decision tools induces as much hope as fear. Many recently enacted European legislation (GDPR) and French laws attempt to regulate the use of these tools. Leaving aside the well-identified problems of data confidentiality and impediments to competition, we focus on the risks of discrimination, the problems of transparency and the quality of algorithmic decisions. The detailed perspective of the legal texts, faced with the complexity and opacity of the learning algorithms, reveals the need for important technological disruptions for the detection or reduction of the discrimination risk, and for addressing the right to obtain an explanation of the auto- matic decision. Since trust of the developers and above all of the users (citizens, litigants, customers) is essential, algorithms exploiting personal data must be deployed in a strict ethical framework. In conclusion, to answer this need, we list some ways of controls to be developed: institutional control,...
more | pdf | html
Figures
Tweets
ZelrosAI: Can everyday AI be ethical? Fairness of Machine Learning Algorithms #XAI https://t.co/vjdLbpaOaC https://t.co/Gc4zHigiC9
arxivml: "Can everyday AI be ethical. Fairness of Machine Learning Algorithms", Philippe Besse, Celine Castets-Renard, Aurel… https://t.co/4ZDyiQQa8k
SciFi: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/Mc7FsTvidX
unelmaplatforms: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/HjJ6nLXBoy
Pvalsfr: RT @SciFi: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/Mc7FsTvidX
ThiboNeveu: RT @SciFi: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/Mc7FsTvidX
sussenglish: RT @SciFi: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/Mc7FsTvidX
officialAAAC: RT @SciFi: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/Mc7FsTvidX
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 12229
Unqiue Words: 3833

0.0 Mikeys
#2. Benchmarking in cluster analysis: A white paper
Anne-Laure Boulesteix, Rainer Dangl, Nema Dean, Isabelle Guyon, Christian Hennig, Friedrich Leisch, Douglas Steinley, Iven Van Mechelen
To achieve scientific progress in terms of building a cumulative body of knowledge, careful attention to benchmarking is of the utmost importance. This means that proposals of new methods of data pre-processing, new data-analytic techniques, and new methods of output post-processing, should be extensively and carefully compared with existing alternatives, and that existing methods should be subjected to neutral comparison studies. To date, benchmarking and recommendations for benchmarking have been frequently seen in the context of supervised learning. Unfortunately, there has been a dearth of guidelines for benchmarking in an unsupervised setting, with the area of clustering as an important subdomain. To address this problem, discussion is given to the theoretical conceptual underpinnings of benchmarking in the field of cluster analysis by means of simulated as well as empirical data. Subsequently, the practicalities of how to address benchmarking questions in clustering are dealt with, and foundational recommendations are made.
more | pdf | html
Figures
None.
Tweets
StatsPapers: Benchmarking in cluster analysis: A white paper. https://t.co/YhMV32jdIB
kieranrcampbell: RT @StatsPapers: Benchmarking in cluster analysis: A white paper. https://t.co/YhMV32jdIB
cartalop: RT @StatsPapers: Benchmarking in cluster analysis: A white paper. https://t.co/YhMV32jdIB
oozingslack: RT @StatsPapers: Benchmarking in cluster analysis: A white paper. https://t.co/YhMV32jdIB
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 8
Total Words: 10895
Unqiue Words: 2753

0.0 Mikeys
#3. I can see clearly now: reinterpreting statistical significance
Jonathan Dushoff, Morgan P. Kain, Benjamin M. Bolker
Null hypothesis significance testing remains popular despite decades of concern about misuse and misinterpretation. We believe that much of the problem is due to language: significance testing has little to do with other meanings of the word "significance". Despite the limitations of null-hypothesis tests, we argue here that they remain useful in many contexts as a guide to whether a certain effect can be seen clearly in that context (e.g. whether we can clearly see that a correlation or between-group difference is positive or negative). We therefore suggest that researchers describe the conclusions of null-hypothesis tests in terms of statistical "clarity" rather than statistical "significance". This simple semantic change could substantially enhance clarity in statistical communication.
more | pdf | html
Figures
None.
Tweets
alexvespi: I can see clearly now: reinterpreting statistical significance suggestion: “researchers describe the conclusions of null-hypothesis tests in terms of statistical "clarity" rather than statistical "significance"” https://t.co/SLfygFRXmN https://t.co/seV9paaXVS
jebyrnes: Really intrigued/on board with this proposal to talk about statistical clarity instead of significance - e.g. "There was no clear difference between groups" - https://t.co/9wbhQrs8dt - from @bolkerb and others.
noamross: Interesting paper! @jd_mathbio, @MPKain + @bolkerb argue for using "statistically clear" over "statistically significant" to describe the results of null hypothesis testing. "I can see clearly now: reinterpreting statistical significance", on @arxiv_org https://t.co/QJD9HacB7H https://t.co/udjqlXWbwP
ingorohlfing: suggest researchers describe #NHST in terms of statistical “clarity” rather than “significance” https://t.co/zm6Uxck5MU Don't think it would change a thing https://t.co/5FKo9uEgm2
zerdeve: i was tempted to snark but i’ll resist. however semantics is the least of our worries here (if it is at all). besides, “marginally clear” is even easier to say than “marginally significant”. https://t.co/bHnuXJ1Lvf https://t.co/ciRRH111uL
ruth_baker: Very interesting talk by @jd_mathbio - I can see clearly now: reinterpreting statistical significance: https://t.co/ASL4SHxEHv #StatisticalClarity #birsmath
jillagal: 3rd day at #birsmath and Jonathan Dushoff talks about how the language around p values and statistical significance is misleading #statisticalClarity https://t.co/BvXtpj9vLK
pf_mg: Jonathan Dushoff arguing we should use #StatisticalClarity instead of statistical significance. https://t.co/zEmvKuL5ye Reminds me of the famous Camus quote ! https://t.co/XrUhxiq2wu
CodyJDey: "The language of “statistical clarity” could help researchers escape various logical traps while interpreting the results of null hypothesis significance testing" https://t.co/dMaOeXy9z6 @bolkerb @jd_mathbio https://t.co/XcvA4SNlmy
MichaelPlankNZ: I can see clearly now: reinterpreting statistical significance by Jonathan Dushoff et al https://t.co/CqlU6Wcijo #birsmath #statisticalClarity
anhsmith: Should we say "statistically unclear" instead of "statistically insignificant" for a p-value > 0.05? It could help avoid a common misinterpretation: evidence that the effect is negligible. An interesting suggestion by @jd_mathbio @bolkerb https://t.co/HtYt0x3ywL
StatsPapers: I can see clearly now: reinterpreting statistical significance. https://t.co/pMmdwS7HLt
Sri_Rad15: @nntaleb your ideas have begun to fructify to some extent https://t.co/dgKNDcrrNf
jwalkrunski: A good conversation starter. Can we say “the effect is trending toward statistically clear (p=0.069)?” 😀[1810.06387] I can see clearly now: reinterpreting statistical significance https://t.co/1q0uzGlFwi
MisVoces: RT @StatsPapers: I can see clearly now: reinterpreting statistical significance. https://t.co/pMmdwS7HLt
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 2529
Unqiue Words: 1132

0.0 Mikeys
#4. Complementary Lipschitz continuity results for the distribution of intersections or unions of independent random sets in finite spaces
John Klein
We prove that intersections and unions of independent random sets in finite spaces achieve a form of Lipschitz continuity. More precisely, given the distribution of a random set $\Xi$, the function mapping any random set distribution to the distribution of its intersection (under independence assumption) with $\Xi$ is Lipschitz continuous with unit Lipschitz constant if the space of random set distributions is endowed with a metric defined as the $L_k$ norm distance between inclusion functionals also known as commonalities. Moreover, the function mapping any random set distribution to the distribution of its union (under independence assumption) with $\Xi$ is Lipschitz continuous with unit Lipschitz constant if the space of random set distributions is endowed with a metric defined as the $L_k$ norm distance between hitting functionals also known as plausibilities. Using the epistemic random set interpretation of belief functions, we also discuss the ability of these distances to yield conflict measures. All the proofs in this...
more | pdf | html
Figures
None.
Tweets
StatsPapers: Complementary Lipschitz continuity results for the distribution of intersections or unions of independent random sets in finite spaces. https://t.co/oKANb1d4hs
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 11062
Unqiue Words: 2306

0.0 Mikeys
#5. On the mathematics of the free-choice paradigm
Peter Selinger, Kristopher Tapp
Chen and Risen pointed out a logical flaw affecting the conclusions of a number of past experiments that used the free-choice paradigm to measure choice-induced attitude change. They went on to design and implement a free-choice experiment that used a novel type of control group in order to avoid this logical pitfall. In this paper, we describe a method by which a free-choice experiment can be correctly conducted even without a control group.
more | pdf | html
Figures
None.
Tweets
StatsPapers: On the mathematics of the free-choice paradigm. https://t.co/kTXILShfmF
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 7463
Unqiue Words: 2027

0.0 Mikeys
#6. Allocations of Standby Redundancies to Coherent Systems with Dependent Components
Yiying Zhang
In the context of industrial engineering, standby allocation strategy is usually adopted by engineers to improve the lifetimes of coherent systems. This paper investigates the optimal allocation strategies of standby redundancies for coherent systems comprised of dependent components having left tail weakly stochastic arrangement increasing or right tail weakly stochastic arrangement increasing lifetimes. For the case of independent matched heterogeneous standby redundancies, it is proved that the better redundancy should be put in the node with weaker[better] component in a series[parallel] system. For the case of independent homogeneous standby redundancies, it is shown that more redundancies should be put in standby with weaker[better] component to improve the lifetime of a series[parallel] system. The results developed here generalize and extend those related ones in the literature to the case of dependent components. Numerical examples are presented to provide guidances for practical use of our theoretical findings....
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 9699
Unqiue Words: 1826

0.0 Mikeys
#7. Data scraping, ingestation, and modeling: bringing data from cars.com into the intro stats class
Sarah McDonald, Nicholas Jon Horton
New tools have made it much easier for students to develop skills to work with interesting data sets as they begin to extract meaning from data. To fully appreciate the statistical analysis cycle, students benefit from repeated experiences collecting, ingesting, wrangling, analyzing data and communicating results. How can we bring such opportunities into the classroom? We describe a classroom activity, originally developed by Danny Kaplan (Macalester College), in which students can expand upon statistical problem solving by hand-scraping data from cars.com, ingesting these data into R, then carrying out analyses of the relationships between price, mileage, and model year for a selected type of car.
more | pdf | html
Figures
Tweets
Github

Cars.com scraping and multivariate analysis CAUSE activity webinar

Repository: Cars-Scraping-Webinar
User: Amherst-Statistics
Language: TeX
Stargazers: 0
Subscribers: 3
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 2514
Unqiue Words: 1142

0.0 Mikeys
#8. A Conversation with Jon Wellner
Moulinath Banerjee, Richard J. Samworth
Jon August Wellner was born in Portland, Oregon, in August 1945. He received his Bachelor's degree from the University of Idaho in 1968 and his PhD degree from the University of Washington in 1975. From 1975 until 1983 he was an Assistant Professor and Associate Professor at the University of Rochester. In 1983 he returned to the University of Washington, and has remained at the UW as a faculty member since that time. Over the course of a long and distinguished career, Jon has made seminal contributions to a variety of areas including empirical processes, semiparametric theory, and shape-constrained inference, and has co-authored a number of extremely influential books. He has been honored as the Le Cam lecturer by both the IMS (2015) and the French Statistical Society (2017). He is a Fellow of the IMS, the ASA, and the AAAS, and an elected member of the International Statistical Institute. He has served as co-Editor of Annals of Statistics (2001--2003) and Editor of Statistical Science (2010--2013), and President of IMS...
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 10575
Unqiue Words: 3009

0.0 Mikeys
#9. Game time: statistical contests in the classroom
Sam Doerken, Martin Schumacher, Franz Baumdicker
We describe a contest in variable selection which was part of a statistics course for graduate students. In particular, the possibility to create a contest themselves offered an additional challenge for more advanced students. Since working with data is becoming more important in teaching statistics, we greatly encourage other instructors to try the same.
more | pdf | html
Figures
None.
Tweets
StatsPapers: Game time: statistical contests in the classroom. https://t.co/moZF1cnMwS
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 3264
Unqiue Words: 1302

0.0 Mikeys
#10. Perspective from the Literature on the Role of Expert Judgment in Scientific and Statistical Research and Practice
Naomi C Brownstein
This article, produced as a result of the Symposium on Statistical Inference, is an introduction to the literature on the function of expertise, judgment, and choice in the practice of statistics and scientific research. In particular, expert judgment plays a critical role in conducting Frequentist hypothesis tests and Bayesian models, especially in selection of appropriate prior distributions for model parameters. The subtlety of interpreting results is also discussed. Finally, external recommendations are collected for how to more effectively encourage proper use of judgment in statistics. The paper synthesizes the literature for the purpose of creating a single reference and inciting more productive discussions on how to improve the future of statistics and science.
more | pdf | html
Figures
None.
Tweets
StatsPapers: Perspective from the Literature on the Role of Expert Judgment in Scientific and Statistical Research and Practice. https://t.co/XusFQIkr4H
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 12020
Unqiue Words: 3845

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 72,995 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 72,995 papers.