Top 10 Arxiv Papers Today in Machine Learning


2.247 Mikeys
#1. Neural Tangents: Fast and Easy Infinite Neural Networks in Python
Roman Novak, Lechao Xiao, Jiri Hron, Jaehoon Lee, Alexander A. Alemi, Jascha Sohl-Dickstein, Samuel S. Schoenholz
Neural Tangents is a library designed to enable research into infinite-width neural networks. It provides a high-level API for specifying complex and hierarchical neural network architectures. These networks can then be trained and evaluated either at finite-width as usual or in their infinite-width limit. Infinite-width networks can be trained analytically using exact Bayesian inference or using gradient descent via the Neural Tangent Kernel. Additionally, Neural Tangents provides tools to study gradient descent training dynamics of wide but finite networks in either function space or weight space. The entire library runs out-of-the-box on CPU, GPU, or TPU. All computations can be automatically distributed over multiple accelerators with near-linear scaling in the number of devices. Neural Tangents is available at www.github.com/google/neural-tangents. We also provide an accompanying interactive Colab notebook.
more | pdf | html
Figures
None.
Tweets
hardmaru: Neural Tangents is a Python library designed to enable research into “infinite-width” neural networks. They provide an API for specifying complex neural network architectures that can then be trained and evaluated in their infinite-width limit. 🙉🤯 https://t.co/Wr2SqlMOwA https://t.co/vAXC02pAs8
Montreal_AI: Neural Tangents: Fast and Easy Infinite Neural Networks in Python Novak et al.: https://t.co/bt7WzIoihH #DeepLearning #NeuralNetworks #Python https://t.co/JuzLIh5kxy
jaschasd: Paper: https://t.co/617vP1bttE Github: https://t.co/fZxNUwBRer Colab Notebook: https://t.co/UwXvlLRpwZ
sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt Github: https://t.co/iutzkEhEOM Colab Notebook: https://t.co/JcxUkWwJ0h
arxivml: "Neural Tangents: Fast and Easy Infinite Neural Networks in Python", Roman Novak, Lechao Xiao, Jiri Hron, Jaehoon L… https://t.co/VgglRsifeV
arxiv_cs_LG: Neural Tangents: Fast and Easy Infinite Neural Networks in Python. Roman Novak, Lechao Xiao, Jiri Hron, Jaehoon Lee, Alexander A. Alemi, Jascha Sohl-Dickstein, and Samuel S. Schoenholz https://t.co/owrRWgj38l
ceobillionaire: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
ceobillionaire: RT @Montreal_AI: Neural Tangents: Fast and Easy Infinite Neural Networks in Python Novak et al.: https://t.co/bt7WzIoihH #DeepLearning #N…
ericjang11: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
brandondamos: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
SingularMattrix: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
ballforest: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
ballforest: RT @StatsPapers: Neural Tangents: Fast and Easy Infinite Neural Networks in Python. https://t.co/yi2WBveWY2
eigenhector: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
LiamFedus: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
ayirpelle: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
TheGregYang: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
EricSchles: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
superbradyon: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
all2one: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
tak_yamm: RT @StatsPapers: Neural Tangents: Fast and Easy Infinite Neural Networks in Python. https://t.co/yi2WBveWY2
deepgradient: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
muktabh: RT @StatsPapers: Neural Tangents: Fast and Easy Infinite Neural Networks in Python. https://t.co/yi2WBveWY2
geoffroeder: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
jhhhuggins: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
__tmats__: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
KouroshMeshgi: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
puneethmishra: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
amlankar95: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
tmasada: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
jrugelesuribe: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
dolhani: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
__nggih: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
harujoh: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
desipoika: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
BoakyeTweets: RT @Montreal_AI: Neural Tangents: Fast and Easy Infinite Neural Networks in Python Novak et al.: https://t.co/bt7WzIoihH #DeepLearning #N…
Daniel_J_Im: RT @jaschasd: Paper: https://t.co/617vP1bttE Github: https://t.co/fZxNUwBRer Colab Notebook: https://t.co/UwXvlLRpwZ
BoakyeTweets: RT @Montreal_AI: Neural Tangents: Fast and Easy Infinite Neural Networks in Python Novak et al.: https://t.co/bt7WzIoihH #DeepLearning #N…
MarcoZorzi: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
_powei: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
hadisalmanX: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
HydryHydra: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
kadarakos: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
westis96: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
iugoaoj: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
MarkTan57229491: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
manuel_lmartin: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
tequehead: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
caseychu9: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
ShirotaShin: RT @StatsPapers: Neural Tangents: Fast and Easy Infinite Neural Networks in Python. https://t.co/yi2WBveWY2
jainnitk: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
DataGeekArun: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
karnadi_1: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
jastner109: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
phi_nate: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
moKhabb: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
amarotaylorw: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
gshartnett: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
brunoboutteau: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
crjaensch: RT @Montreal_AI: Neural Tangents: Fast and Easy Infinite Neural Networks in Python Novak et al.: https://t.co/bt7WzIoihH #DeepLearning #N…
namhoonlee09: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
tristanasharp: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
TinDan_: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
shimoke4869: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
alfo_512: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
ndrmnl: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
Bill_Hally: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
BalcellsD: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
TakaAomidoro: RT @StatsPapers: Neural Tangents: Fast and Easy Infinite Neural Networks in Python. https://t.co/yi2WBveWY2
dyitry: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
Wind_Xiaoli: RT @Montreal_AI: Neural Tangents: Fast and Easy Infinite Neural Networks in Python Novak et al.: https://t.co/bt7WzIoihH #DeepLearning #N…
shivamsaboo17: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
UPPALANSHUK: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
FallintTree: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
dave_co_dev: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
IgiArd: RT @sschoenholz: After a ton of work by a bunch of people, we're releasing an entirely new Neural Tangents. Paper: https://t.co/2KqBv44KJt…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 0
Unqiue Words: 0

2.083 Mikeys
#2. AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty
Dan Hendrycks, Norman Mu, Ekin D. Cubuk, Barret Zoph, Justin Gilmer, Balaji Lakshminarayanan
Modern deep neural networks can achieve high accuracy when the training distribution and test distribution are identically distributed, but this assumption is frequently violated in practice. When the train and test distributions are mismatched, accuracy can plummet. Currently there are few techniques that improve robustness to unforeseen data shifts encountered during deployment. In this work, we propose a technique to improve the robustness and uncertainty estimates of image classifiers. We propose AugMix, a data processing technique that is simple to implement, adds limited computational overhead, and helps models withstand unforeseen corruptions. AugMix significantly improves robustness and uncertainty measures on challenging image classification benchmarks, closing the gap between previous methods and the best possible performance in some cases by more than half.
more | pdf | html
Figures
Tweets
CarlRioux: [R] AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty: Paper: https://t.co/XDp5PukD6w Code: https://t.co/23Yh3ncwm8 We propose AugMix, a data processing technique that mixes augmented images and enforces consistent embeddings… https://t.co/WXDm7Peg7k
arxivml: "AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty", Dan Hendrycks, Norman Mu, Ekin D. … https://t.co/TlUdca9lhA
arxiv_cs_LG: AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty. Dan Hendrycks, Norman Mu, Ekin D. Cubuk, Barret Zoph, Justin Gilmer, and Balaji Lakshminarayanan https://t.co/Yq1BzDhKBY
StatsPapers: AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty. https://t.co/Au2myvfICS
arxiv_cs_cv_pr: AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty. Dan Hendrycks, Norman Mu, Ekin D. Cubuk, Barret Zoph, Justin Gilmer, and Balaji Lakshminarayanan https://t.co/f68O5xKqqt
arxiv_cscv: AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty https://t.co/g4sdqZdOl9
arxiv_cscv: AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty https://t.co/g4sdqZdOl9
hitoriblog: RT @TheNormanMu: @balajiln @DanHendrycks @ekindogus @barret_zoph @jmgilmer Code: https://t.co/ZR0fS5MC6w Paper: https://t.co/I7NOfHRP1v
yu4u: RT @TheNormanMu: @balajiln @DanHendrycks @ekindogus @barret_zoph @jmgilmer Code: https://t.co/ZR0fS5MC6w Paper: https://t.co/I7NOfHRP1v
nickschurch: RT @StatsPapers: AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty. https://t.co/Au2myvfICS
shinmura0: RT @TheNormanMu: @balajiln @DanHendrycks @ekindogus @barret_zoph @jmgilmer Code: https://t.co/ZR0fS5MC6w Paper: https://t.co/I7NOfHRP1v
matsuko_std: RT @TheNormanMu: @balajiln @DanHendrycks @ekindogus @barret_zoph @jmgilmer Code: https://t.co/ZR0fS5MC6w Paper: https://t.co/I7NOfHRP1v
Github

AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

Repository: augmix
User: google-research
Language: Python
Stargazers: 79
Subscribers: 7
Forks: 2
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 6732
Unqiue Words: 2342

2.078 Mikeys
#3. Normalizing Flows for Probabilistic Modeling and Inference
George Papamakarios, Eric Nalisnick, Danilo Jimenez Rezende, Shakir Mohamed, Balaji Lakshminarayanan
Normalizing flows provide a general mechanism for defining expressive probability distributions, only requiring the specification of a (usually simple) base distribution and a series of bijective transformations. There has been much recent work on normalizing flows, ranging from improving their expressive power to expanding their application. We believe the field has now matured and is in need of a unified perspective. In this review, we attempt to provide such a perspective by describing flows through the lens of probabilistic modeling and inference. We place special emphasis on the fundamental principles of flow design, and discuss foundational topics such as expressive power and computational trade-offs. We also broaden the conceptual framing of flows by relating them to more general probability transformations. Lastly, we summarize the use of flows for tasks such as generative modeling, approximate inference, and supervised learning.
more | pdf | html
Figures
None.
Tweets
DeepSpiker: Looking for something to read in your flight to #NeurIPS2019? Read about Normalizing Flows from our extensive review paper (also with new insights on how to think about and derive new flows) https://t.co/cPjQjZn3uf with @gpapamak @eric_nalisnick @DeepSpiker @balajiln @shakir_za https://t.co/EWh8Aui7n0
gpapamak: Check out our extensive review paper on normalizing flows! This paper is the product of years of thinking about flows: it contains everything we know about them, and many new insights. With @eric_nalisnick, @DeepSpiker, @shakir_za, @balajiln. https://t.co/BBymd1uSwx Thread 👇 https://t.co/er8QebcPS2
arxivml: "Normalizing Flows for Probabilistic Modeling and Inference", George Papamakarios, Eric Nalisnick, Danilo Jimenez R… https://t.co/gbvVIxPwuo
reddit_ml: [1912.02762] Normalizing Flows for Probabilistic Modeling and Inference https://t.co/nwoO44Zs2I
arxiv_cs_LG: Normalizing Flows for Probabilistic Modeling and Inference. George Papamakarios, Eric Nalisnick, Danilo Jimenez Rezende, Shakir Mohamed, and Balaji Lakshminarayanan https://t.co/TQmlbVp0Je
hereticreader: Normalizing Flows for Probabilistic Modeling and Inference - https://t.co/9D1COPFWPY https://t.co/NqFK3hewOc
StatsPapers: Normalizing Flows for Probabilistic Modeling and Inference. https://t.co/Hc0w5Bx6yR
ari_seff: And for an extensive review, check out the just-released "Normalizing Flows for Probabilistic Modeling and Inference" (https://t.co/iAOCgNh7Ch) from @gpapamak @eric_nalisnick @DeepSpiker @balajiln @shakir_za
ballforest: RT @StatsPapers: Normalizing Flows for Probabilistic Modeling and Inference. https://t.co/Hc0w5Bx6yR
jd_mashiro: RT @StatsPapers: Normalizing Flows for Probabilistic Modeling and Inference. https://t.co/Hc0w5Bx6yR
mxwlj: RT @StatsPapers: Normalizing Flows for Probabilistic Modeling and Inference. https://t.co/Hc0w5Bx6yR
tak_yamm: RT @StatsPapers: Normalizing Flows for Probabilistic Modeling and Inference. https://t.co/Hc0w5Bx6yR
morioka: RT @StatsPapers: Normalizing Flows for Probabilistic Modeling and Inference. https://t.co/Hc0w5Bx6yR
ShirotaShin: RT @StatsPapers: Normalizing Flows for Probabilistic Modeling and Inference. https://t.co/Hc0w5Bx6yR
GiseopK: RT @StatsPapers: Normalizing Flows for Probabilistic Modeling and Inference. https://t.co/Hc0w5Bx6yR
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

2.054 Mikeys
#4. Deep Ensembles: A Loss Landscape Perspective
Stanislav Fort, Huiyi Hu, Balaji Lakshminarayanan
Deep ensembles have been empirically shown to be a promising approach for improving accuracy, uncertainty and out-of-distribution robustness of deep learning models. While deep ensembles were theoretically motivated by the bootstrap, non-bootstrap ensembles trained with just random initialization also perform well in practice, which suggests that there could be other explanations for why deep ensembles work well. Bayesian neural networks, which learn distributions over the parameters of the network, are theoretically well-motivated by Bayesian principles, but do not perform as well as deep ensembles in practice, particularly under dataset shift. One possible explanation for this gap between theory and practice is that popular scalable approximate Bayesian methods tend to focus on a single mode, whereas deep ensembles tend to explore diverse modes in function space. We investigate this hypothesis by building on recent work on understanding the loss landscape of neural networks and adding our own exploration to measure the...
more | pdf | html
Figures
Tweets
balajiln: @stanislavfort If you'd like to learn more, check out our paper https://t.co/pnvqezb7a9 :) @stanislavfort will also be giving a contributed talk about our work on Dec 13 (Friday) 9-915 AM and presenting a poster at the Bayesian deep learning workshop (https://t.co/OyPfyWua8Z) at #NeurIPS2019 https://t.co/NrpnmTNlDv
balajiln: Why do deep ensembles trained with just random initialization work surprisingly well in practice?  In our recent paper https://t.co/pnvqezb7a9 with @stanislavfort & Huiyi Hu, we investigate this by using insights from recent work on loss landscape of neural nets.  More below:
arxivml: "Deep Ensembles: A Loss Landscape Perspective", Stanislav Fort, Huiyi Hu, Balaji Lakshminarayanan https://t.co/8M3S3xpC2R
stanislavfort: Our newest work /Deep Ensembles: A Loss Landscape Perspective/ on connecting neural network loss landscapes, Bayesian approaches, and ensembling https://t.co/dIJCZg4iQQ. Joint effort with the amazing @balajiln and Huiyi Hu from @DeepMindAI done during my @GoogleAI Residency. https://t.co/BqwfcGLwF1 https://t.co/LrUG0VKf3X
arxiv_cs_LG: Deep Ensembles: A Loss Landscape Perspective. Stanislav Fort, Huiyi Hu, and Balaji Lakshminarayanan https://t.co/yQL4E8PFXT
StatsPapers: Deep Ensembles: A Loss Landscape Perspective. https://t.co/nenHXm7APq
stanislavfort: RT @balajiln: @stanislavfort If you'd like to learn more, check out our paper https://t.co/pnvqezb7a9 :) @stanislavfort will also be givin…
MarkTan57229491: RT @balajiln: @stanislavfort If you'd like to learn more, check out our paper https://t.co/pnvqezb7a9 :) @stanislavfort will also be givin…
ShirotaShin: RT @StatsPapers: Deep Ensembles: A Loss Landscape Perspective. https://t.co/nenHXm7APq
_dongkwan_kim: RT @balajiln: @stanislavfort If you'd like to learn more, check out our paper https://t.co/pnvqezb7a9 :) @stanislavfort will also be givin…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 5478
Unqiue Words: 1783

2.026 Mikeys
#5. Label-Consistent Backdoor Attacks
Alexander Turner, Dimitris Tsipras, Aleksander Madry
Deep neural networks have been demonstrated to be vulnerable to backdoor attacks. Specifically, by injecting a small number of maliciously constructed inputs into the training set, an adversary is able to plant a backdoor into the trained model. This backdoor can then be activated during inference by a backdoor trigger to fully control the model's behavior. While such attacks are very effective, they crucially rely on the adversary injecting arbitrary inputs that are---often blatantly---mislabeled. Such samples would raise suspicion upon human inspection, potentially revealing the attack. Thus, for backdoor attacks to remain undetected, it is crucial that they maintain label-consistency---the condition that injected inputs are consistent with their labels. In this work, we leverage adversarial perturbations and generative models to execute efficient, yet label-consistent, backdoor attacks. Our approach is based on injecting inputs that appear plausible, yet are hard to classify, hence causing the model to rely on the...
more | pdf | html
Figures
None.
Tweets
aleks_madry: Can backdoor attacks be successful without using incorrect labels? Yes, you just need to make poisoned inputs harder! Check out our work with @alex_m_turner and @tsiprasd https://t.co/IrxJh5KU6N https://t.co/bSM7y1Pu4B
arxivml: "Label-Consistent Backdoor Attacks", Alexander Turner, Dimitris Tsipras, Aleksander Madry https://t.co/Q9h1CnAJ0D
arxiv_cs_LG: Label-Consistent Backdoor Attacks. Alexander Turner, Dimitris Tsipras, and Aleksander Madry https://t.co/YfL86F2AMX
StatsPapers: Label-Consistent Backdoor Attacks. https://t.co/ZxGzlmPfHy
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.025 Mikeys
#6. Representing Closed Transformation Paths in Encoded Network Latent Space
Marissa Connor, Christopher Rozell
Deep generative networks have been widely used for learning mappings from a low-dimensional latent space to a high-dimensional data space. In many cases, data transformations are defined by linear paths in this latent space. However, the Euclidean structure of the latent space may be a poor match for the underlying latent structure in the data. In this work, we incorporate a generative manifold model into the latent space of an autoencoder in order to learn the low-dimensional manifold structure from the data and adapt the latent space to accommodate this structure. In particular, we focus on applications in which the data has closed transformation paths which extend from a starting point and return to nearly the same point. Through experiments on data with natural closed transformation paths, we show that this model introduces the ability to learn the latent dynamics of complex systems, generate transformation paths, and classify samples that belong on the same transformation path.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Representing Closed Transformation Paths in Encoded Network Latent Space. Marissa Connor and Christopher Rozell https://t.co/HUqimWu17I
arxivml: "Representing Closed Transformation Paths in Encoded Network Latent Space", Marissa Connor, Christopher Rozell https://t.co/quPQVhHaoL
arxiv_cs_LG: Representing Closed Transformation Paths in Encoded Network Latent Space. Marissa Connor and Christopher Rozell https://t.co/c1FMro2yHF
StatsPapers: Representing Closed Transformation Paths in Encoded Network Latent Space. https://t.co/oQlBQVDz8P
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.024 Mikeys
#7. MetaFun: Meta-Learning with Iterative Functional Updates
Jin Xu, Jean-Francois Ton, Hyunjik Kim, Adam R. Kosiorek, Yee Whye Teh
Few-shot supervised learning leverages experience from previous learning tasks to solve new tasks where only a few labelled examples are available. One successful line of approach to this problem is to use an encoder-decoder meta-learning pipeline, whereby labelled data in a task is encoded to produce task representation, and this representation is used to condition the decoder to make predictions on unlabelled data. We propose an approach that uses this pipeline with two important features. 1) We use infinite-dimensional functional representations of the task rather than fixed-dimensional representations. 2) We iteratively apply functional updates to the representation. We show that our approach can be interpreted as extending functional gradient descent, and delivers performance that is comparable to or outperforms previous state-of-the-art on few-shot classification benchmarks such as miniImageNet and tieredImageNet.
more | pdf | html
Figures
Tweets
arxivml: "MetaFun: Meta-Learning with Iterative Functional Updates", Jin Xu, Jean-Francois Ton, Hyunjik Kim, Adam R. Kosiore… https://t.co/0Q3toQyiJW
arxiv_cs_LG: MetaFun: Meta-Learning with Iterative Functional Updates. Jin Xu, Jean-Francois Ton, Hyunjik Kim, Adam R. Kosiorek, and Yee Whye Teh https://t.co/xbD0jZpR0Y
jinxu06: Learning to learn in function space by extending functional gradient descent. We introduce MetaFun, a SOTA meta-learning approach: https://t.co/hwhALseIb6 With @jeanfrancois287 @hyunjik11 @arkosiorek @yeewhye https://t.co/Uc0ezDaFY9
StatsPapers: MetaFun: Meta-Learning with Iterative Functional Updates. https://t.co/HzfCfnBmxP
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 8773
Unqiue Words: 2680

2.019 Mikeys
#8. Causal structure based root cause analysis of outliers
Dominik Janzing, Kailash Budhathoki, Lenon Minorics, Patrick Blöbaum
We describe a formal approach to identify 'root causes' of outliers observed in $n$ variables $X_1,\dots,X_n$ in a scenario where the causal relation between the variables is a known directed acyclic graph (DAG). To this end, we first introduce a systematic way to define outlier scores. Further, we introduce the concept of 'conditional outlier score' which measures whether a value of some variable is unexpected *given the value of its parents* in the DAG, if one were to assume that the causal structure and the corresponding conditional distributions are also valid for the anomaly. Finally, we quantify to what extent the high outlier score of some target variable can be attributed to outliers of its ancestors. This quantification is defined via Shapley values from cooperative game theory.
more | pdf | html
Figures
None.
Tweets
arxivml: "Causal structure based root cause analysis of outliers", Dominik Janzing, Kailash Budhathoki, Lenon Minorics, Patr… https://t.co/Mh3KSqdjVG
arxiv_cs_LG: Causal structure based root cause analysis of outliers. Dominik Janzing, Kailash Budhathoki, Lenon Minorics, and Patrick Blöbaum https://t.co/vSJS29xXKe
StatsPapers: Causal structure based root cause analysis of outliers. https://t.co/z1ZJvrzltE
ml_unam: RT @StatsPapers: Causal structure based root cause analysis of outliers. https://t.co/z1ZJvrzltE
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.019 Mikeys
#9. A sparse negative binomial mixture model for clustering RNA-seq count data
Tanbin Rahman, Yujia Li, Tianzhou Ma, Lu Tang, George Tseng
Clustering with variable selection is a challenging but critical task for modern small-n-large-p data. Existing methods based on Gaussian mixture models or sparse K-means provide solutions to continuous data. With the prevalence of RNA-seq technology and lack of count data modeling for clustering, the current practice is to normalize count expression data into continuous measures and apply existing models with Gaussian assumption. In this paper, we develop a negative binomial mixture model with gene regularization to cluster samples (small $n$) with high-dimensional gene features (large $p$). EM algorithm and Bayesian information criterion are used for inference and determining tuning parameters. The method is compared with sparse Gaussian mixture model and sparse K-means using extensive simulations and two real transcriptomic applications in breast cancer and rat brain studies. The result shows superior performance of the proposed count data model in clustering accuracy, feature selection and biological interpretation by pathway...
more | pdf | html
Figures
None.
Tweets
arxivml: "A sparse negative binomial mixture model for clustering RNA-seq count data", Tanbin Rahman, Yujia Li, Tianzhou Ma,… https://t.co/lQtLC6oivb
arxiv_cs_LG: A sparse negative binomial mixture model for clustering RNA-seq count data. Tanbin Rahman, Yujia Li, Tianzhou Ma, Lu Tang, and George Tseng https://t.co/NRBShOFH1F
StatsPapers: A sparse negative binomial mixture model for clustering RNA-seq count data. https://t.co/o8Ixmh0EHA
Github
Repository: snbClust
User: mdr56
Language: None
Stargazers: 0
Subscribers: 0
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 6242
Unqiue Words: 1823

2.017 Mikeys
#10. Ordinal Bayesian Optimisation
Victor Picheny, Sattar Vakili, Artem Artemev
Bayesian optimisation is a powerful tool to solve expensive black-box problems, but fails when the stationary assumption made on the objective function is strongly violated, which is the case in particular for ill-conditioned or discontinuous objectives. We tackle this problem by proposing a new Bayesian optimisation framework that only considers the ordering of variables, both in the input and output spaces, to fit a Gaussian process in a latent space. By doing so, our approach is agnostic to the original metrics on the original spaces. We propose two algorithms, respectively based on an optimistic strategy and on Thompson sampling. For the optimistic strategy we prove an optimal performance under the measure of regret in the latent space. We illustrate the capability of our framework on several challenging toy problems.
more | pdf | html
Figures
Tweets
arxivml: "Ordinal Bayesian Optimisation", Victor Picheny, Sattar Vakili, Artem Artemev https://t.co/jriXwByJWO
arxiv_cs_LG: Ordinal Bayesian Optimisation. Victor Picheny, Sattar Vakili, and Artem Artemev https://t.co/1RulpX8vM7
StatsPapers: Ordinal Bayesian Optimisation. https://t.co/fz7aMhe8zk
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 6703
Unqiue Words: 1993

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 234,442 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 234,442 papers.