### Top 10 Arxiv Papers Today in Other Statistics

##### #1. Can everyday AI be ethical. Fairness of Machine Learning Algorithms
###### Philippe Besse, Celine Castets-Renard, Aurelien Garivier, Jean-Michel Loubes
Combining big data and machine learning algorithms, the power of automatic decision tools induces as much hope as fear. Many recently enacted European legislation (GDPR) and French laws attempt to regulate the use of these tools. Leaving aside the well-identified problems of data confidentiality and impediments to competition, we focus on the risks of discrimination, the problems of transparency and the quality of algorithmic decisions. The detailed perspective of the legal texts, faced with the complexity and opacity of the learning algorithms, reveals the need for important technological disruptions for the detection or reduction of the discrimination risk, and for addressing the right to obtain an explanation of the auto- matic decision. Since trust of the developers and above all of the users (citizens, litigants, customers) is essential, algorithms exploiting personal data must be deployed in a strict ethical framework. In conclusion, to answer this need, we list some ways of controls to be developed: institutional control,...
more | pdf | html
###### Tweets
ZelrosAI: Can everyday AI be ethical? Fairness of Machine Learning Algorithms #XAI https://t.co/vjdLbpaOaC https://t.co/Gc4zHigiC9
arxivml: "Can everyday AI be ethical． Fairness of Machine Learning Algorithms", Philippe Besse, Celine Castets-Renard, Aurel… https://t.co/4ZDyiQQa8k
SciFi: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/Mc7FsTvidX
unelmaplatforms: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/HjJ6nLXBoy
Pvalsfr: RT @SciFi: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/Mc7FsTvidX
ThiboNeveu: RT @SciFi: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/Mc7FsTvidX
sussenglish: RT @SciFi: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/Mc7FsTvidX
officialAAAC: RT @SciFi: Can everyday AI be ethical. Fairness of Machine Learning Algorithms. https://t.co/Mc7FsTvidX
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 12229
Unqiue Words: 3833

##### #2. Benchmarking in cluster analysis: A white paper
###### Anne-Laure Boulesteix, Rainer Dangl, Nema Dean, Isabelle Guyon, Christian Hennig, Friedrich Leisch, Douglas Steinley, Iven Van Mechelen
To achieve scientific progress in terms of building a cumulative body of knowledge, careful attention to benchmarking is of the utmost importance. This means that proposals of new methods of data pre-processing, new data-analytic techniques, and new methods of output post-processing, should be extensively and carefully compared with existing alternatives, and that existing methods should be subjected to neutral comparison studies. To date, benchmarking and recommendations for benchmarking have been frequently seen in the context of supervised learning. Unfortunately, there has been a dearth of guidelines for benchmarking in an unsupervised setting, with the area of clustering as an important subdomain. To address this problem, discussion is given to the theoretical conceptual underpinnings of benchmarking in the field of cluster analysis by means of simulated as well as empirical data. Subsequently, the practicalities of how to address benchmarking questions in clustering are dealt with, and foundational recommendations are made.
more | pdf | html
None.
###### Tweets
StatsPapers: Benchmarking in cluster analysis: A white paper. https://t.co/YhMV32jdIB
kieranrcampbell: RT @StatsPapers: Benchmarking in cluster analysis: A white paper. https://t.co/YhMV32jdIB
cartalop: RT @StatsPapers: Benchmarking in cluster analysis: A white paper. https://t.co/YhMV32jdIB
oozingslack: RT @StatsPapers: Benchmarking in cluster analysis: A white paper. https://t.co/YhMV32jdIB
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 8
Total Words: 10895
Unqiue Words: 2753

##### #3. I can see clearly now: reinterpreting statistical significance
###### Jonathan Dushoff, Morgan P. Kain, Benjamin M. Bolker
Null hypothesis significance testing remains popular despite decades of concern about misuse and misinterpretation. We believe that much of the problem is due to language: significance testing has little to do with other meanings of the word "significance". Despite the limitations of null-hypothesis tests, we argue here that they remain useful in many contexts as a guide to whether a certain effect can be seen clearly in that context (e.g. whether we can clearly see that a correlation or between-group difference is positive or negative). We therefore suggest that researchers describe the conclusions of null-hypothesis tests in terms of statistical "clarity" rather than statistical "significance". This simple semantic change could substantially enhance clarity in statistical communication.
more | pdf | html
None.
###### Tweets
alexvespi: I can see clearly now: reinterpreting statistical significance suggestion: “researchers describe the conclusions of null-hypothesis tests in terms of statistical "clarity" rather than statistical "significance"” https://t.co/SLfygFRXmN https://t.co/seV9paaXVS
jebyrnes: Really intrigued/on board with this proposal to talk about statistical clarity instead of significance - e.g. "There was no clear difference between groups" - https://t.co/9wbhQrs8dt - from @bolkerb and others.
noamross: Interesting paper! @jd_mathbio, @MPKain + @bolkerb argue for using "statistically clear" over "statistically significant" to describe the results of null hypothesis testing. "I can see clearly now: reinterpreting statistical significance", on @arxiv_org https://t.co/QJD9HacB7H https://t.co/udjqlXWbwP
ingorohlfing: suggest researchers describe #NHST in terms of statistical “clarity” rather than “significance” https://t.co/zm6Uxck5MU Don't think it would change a thing https://t.co/5FKo9uEgm2
zerdeve: i was tempted to snark but i’ll resist. however semantics is the least of our worries here (if it is at all). besides, “marginally clear” is even easier to say than “marginally significant”. https://t.co/bHnuXJ1Lvf https://t.co/ciRRH111uL
ruth_baker: Very interesting talk by @jd_mathbio - I can see clearly now: reinterpreting statistical significance: https://t.co/ASL4SHxEHv #StatisticalClarity #birsmath
jillagal: 3rd day at #birsmath and Jonathan Dushoff talks about how the language around p values and statistical significance is misleading #statisticalClarity https://t.co/BvXtpj9vLK
pf_mg: Jonathan Dushoff arguing we should use #StatisticalClarity instead of statistical significance. https://t.co/zEmvKuL5ye Reminds me of the famous Camus quote ! https://t.co/XrUhxiq2wu
CodyJDey: "The language of “statistical clarity” could help researchers escape various logical traps while interpreting the results of null hypothesis significance testing" https://t.co/dMaOeXy9z6 @bolkerb @jd_mathbio https://t.co/XcvA4SNlmy
MichaelPlankNZ: I can see clearly now: reinterpreting statistical significance by Jonathan Dushoff et al https://t.co/CqlU6Wcijo #birsmath #statisticalClarity
anhsmith: Should we say "statistically unclear" instead of "statistically insignificant" for a p-value &gt; 0.05? It could help avoid a common misinterpretation: evidence that the effect is negligible. An interesting suggestion by @jd_mathbio @bolkerb https://t.co/HtYt0x3ywL
StatsPapers: I can see clearly now: reinterpreting statistical significance. https://t.co/pMmdwS7HLt
Sri_Rad15: @nntaleb your ideas have begun to fructify to some extent https://t.co/dgKNDcrrNf
jwalkrunski: A good conversation starter. Can we say “the effect is trending toward statistically clear (p=0.069)?” 😀[1810.06387] I can see clearly now: reinterpreting statistical significance https://t.co/1q0uzGlFwi
MisVoces: RT @StatsPapers: I can see clearly now: reinterpreting statistical significance. https://t.co/pMmdwS7HLt
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 2529
Unqiue Words: 1132

##### #4. Complementary Lipschitz continuity results for the distribution of intersections or unions of independent random sets in finite spaces
###### John Klein
We prove that intersections and unions of independent random sets in finite spaces achieve a form of Lipschitz continuity. More precisely, given the distribution of a random set $\Xi$, the function mapping any random set distribution to the distribution of its intersection (under independence assumption) with $\Xi$ is Lipschitz continuous with unit Lipschitz constant if the space of random set distributions is endowed with a metric defined as the $L_k$ norm distance between inclusion functionals also known as commonalities. Moreover, the function mapping any random set distribution to the distribution of its union (under independence assumption) with $\Xi$ is Lipschitz continuous with unit Lipschitz constant if the space of random set distributions is endowed with a metric defined as the $L_k$ norm distance between hitting functionals also known as plausibilities. Using the epistemic random set interpretation of belief functions, we also discuss the ability of these distances to yield conflict measures. All the proofs in this...
more | pdf | html
None.
###### Tweets
StatsPapers: Complementary Lipschitz continuity results for the distribution of intersections or unions of independent random sets in finite spaces. https://t.co/oKANb1d4hs
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 11062
Unqiue Words: 2306

##### #5. On the mathematics of the free-choice paradigm
###### Peter Selinger, Kristopher Tapp
Chen and Risen pointed out a logical flaw affecting the conclusions of a number of past experiments that used the free-choice paradigm to measure choice-induced attitude change. They went on to design and implement a free-choice experiment that used a novel type of control group in order to avoid this logical pitfall. In this paper, we describe a method by which a free-choice experiment can be correctly conducted even without a control group.
more | pdf | html
None.
###### Tweets
StatsPapers: On the mathematics of the free-choice paradigm. https://t.co/kTXILShfmF
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 7463
Unqiue Words: 2027

##### #6. Allocations of Standby Redundancies to Coherent Systems with Dependent Components
###### Yiying Zhang
In the context of industrial engineering, standby allocation strategy is usually adopted by engineers to improve the lifetimes of coherent systems. This paper investigates the optimal allocation strategies of standby redundancies for coherent systems comprised of dependent components having left tail weakly stochastic arrangement increasing or right tail weakly stochastic arrangement increasing lifetimes. For the case of independent matched heterogeneous standby redundancies, it is proved that the better redundancy should be put in the node with weaker[better] component in a series[parallel] system. For the case of independent homogeneous standby redundancies, it is shown that more redundancies should be put in standby with weaker[better] component to improve the lifetime of a series[parallel] system. The results developed here generalize and extend those related ones in the literature to the case of dependent components. Numerical examples are presented to provide guidances for practical use of our theoretical findings....
more | pdf | html
None.
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 9699
Unqiue Words: 1826

##### #7. Data scraping, ingestation, and modeling: bringing data from cars.com into the intro stats class
###### Sarah McDonald, Nicholas Jon Horton
New tools have made it much easier for students to develop skills to work with interesting data sets as they begin to extract meaning from data. To fully appreciate the statistical analysis cycle, students benefit from repeated experiences collecting, ingesting, wrangling, analyzing data and communicating results. How can we bring such opportunities into the classroom? We describe a classroom activity, originally developed by Danny Kaplan (Macalester College), in which students can expand upon statistical problem solving by hand-scraping data from cars.com, ingesting these data into R, then carrying out analyses of the relationships between price, mileage, and model year for a selected type of car.
more | pdf | html
###### Github

Cars.com scraping and multivariate analysis CAUSE activity webinar

Repository: Cars-Scraping-Webinar
User: Amherst-Statistics
Language: TeX
Stargazers: 0
Subscribers: 3
Forks: 0
Open Issues: 0
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 2514
Unqiue Words: 1142

##### #8. A Conversation with Jon Wellner
###### Moulinath Banerjee, Richard J. Samworth
Jon August Wellner was born in Portland, Oregon, in August 1945. He received his Bachelor's degree from the University of Idaho in 1968 and his PhD degree from the University of Washington in 1975. From 1975 until 1983 he was an Assistant Professor and Associate Professor at the University of Rochester. In 1983 he returned to the University of Washington, and has remained at the UW as a faculty member since that time. Over the course of a long and distinguished career, Jon has made seminal contributions to a variety of areas including empirical processes, semiparametric theory, and shape-constrained inference, and has co-authored a number of extremely influential books. He has been honored as the Le Cam lecturer by both the IMS (2015) and the French Statistical Society (2017). He is a Fellow of the IMS, the ASA, and the AAAS, and an elected member of the International Statistical Institute. He has served as co-Editor of Annals of Statistics (2001--2003) and Editor of Statistical Science (2010--2013), and President of IMS...
more | pdf | html
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 10575
Unqiue Words: 3009

##### #9. Game time: statistical contests in the classroom
###### Sam Doerken, Martin Schumacher, Franz Baumdicker
We describe a contest in variable selection which was part of a statistics course for graduate students. In particular, the possibility to create a contest themselves offered an additional challenge for more advanced students. Since working with data is becoming more important in teaching statistics, we greatly encourage other instructors to try the same.
more | pdf | html
None.
###### Tweets
StatsPapers: Game time: statistical contests in the classroom. https://t.co/moZF1cnMwS
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 3264
Unqiue Words: 1302

##### #10. Perspective from the Literature on the Role of Expert Judgment in Scientific and Statistical Research and Practice
###### Naomi C Brownstein
This article, produced as a result of the Symposium on Statistical Inference, is an introduction to the literature on the function of expertise, judgment, and choice in the practice of statistics and scientific research. In particular, expert judgment plays a critical role in conducting Frequentist hypothesis tests and Bayesian models, especially in selection of appropriate prior distributions for model parameters. The subtlety of interpreting results is also discussed. Finally, external recommendations are collected for how to more effectively encourage proper use of judgment in statistics. The paper synthesizes the literature for the purpose of creating a single reference and inciting more productive discussions on how to improve the future of statistics and science.
more | pdf | html
None.
###### Tweets
StatsPapers: Perspective from the Literature on the Role of Expert Judgment in Scientific and Statistical Research and Practice. https://t.co/XusFQIkr4H
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 12020
Unqiue Words: 3845

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 72,995 papers.

###### Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Online
###### Stats
Tracking 72,995 papers.