Top 10 Arxiv Papers Today in Machine Learning


2.643 Mikeys
#1. Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem
Christian Schroeder de Witt, Thomas Hornigold
As global greenhouse gas emissions continue to rise, the use of stratospheric aerosol injection (SAI), a form of solar geoengineering, is increasingly considered in order to artificially mitigate climate change effects. However, initial research in simulation suggests that naive SAI can have catastrophic regional consequences, which may induce serious geostrategic conflicts. Current geo-engineering research treats SAI control in low-dimensional approximation only. We suggest treating SAI as a high-dimensional control problem, with policies trained according to a context-sensitive reward function within the Deep Reinforcement Learning (DRL) paradigm. In order to facilitate training in simulation, we suggest to emulate HadCM3, a widely used General Circulation Model, using deep learning techniques. We believe this is the first application of DRL to the climate sciences.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. Christian Schroeder de Witt and Thomas Hornigold https://t.co/S33lznkoxi
geschichtenpost: #ArXiv Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem https://t.co/ETscCKuGfu
arxivml: "Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem", Christian Schroeder de Witt, Thomas Hor… https://t.co/gkzXfEsBx4
kushnerbomb: for all the talk of skynet or whatever i think the actual way artificial intelligence kills us is a fucking moron billionaire (sam a****n probably) gets it in his head to use deep RL for geoengineering and doesn't initialize the network exactly right https://t.co/SQxkp6Wqjm
arxiv_cs_LG: Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. Christian Schroeder de Witt and Thomas Hornigold https://t.co/5Jl0QPRUrj
StatsPapers: Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. https://t.co/sxcsnefg60
JohnSam57668631: RT @StatsPapers: Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. https://t.co/sxcsnefg60
JohnSam57668631: RT @geschichtenpost: #ArXiv Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem https://t.co/ETscCKuGfu
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.61 Mikeys
#2. POPQORN: Quantifying Robustness of Recurrent Neural Networks
Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, Dahua Lin
The vulnerability to adversarial attacks has been a critical issue for deep neural networks. Addressing this issue requires a reliable way to evaluate the robustness of a network. Recently, several methods have been developed to compute $\textit{robustness quantification}$ for neural networks, namely, certified lower bounds of the minimum adversarial perturbation. Such methods, however, were devised for feed-forward networks, e.g. multi-layer perceptron or convolutional networks. It remains an open problem to quantify robustness for recurrent networks, especially LSTM and GRU. For such networks, there exist additional challenges in computing the robustness quantification, such as handling the inputs at multiple steps and the interaction between gates and states. In this work, we propose $\textit{POPQORN}$ ($\textbf{P}$ropagated-$\textbf{o}$ut$\textbf{p}$ut $\textbf{Q}$uantified R$\textbf{o}$bustness for $\textbf{RN}$Ns), a general algorithm to quantify robustness of RNNs, including vanilla RNNs, LSTMs, and GRUs. We demonstrate...
more | pdf | html
Figures
None.
Tweets
BrundageBot: POPQORN: Quantifying Robustness of Recurrent Neural Networks. Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, and Dahua Lin https://t.co/bAmWwtXYOs
arxivml: "POPQORN: Quantifying Robustness of Recurrent Neural Networks", Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Dan… https://t.co/is6Rbw9K6h
arxiv_cs_LG: POPQORN: Quantifying Robustness of Recurrent Neural Networks. Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, and Dahua Lin https://t.co/JV6yfcLPbj
StatsPapers: POPQORN: Quantifying Robustness of Recurrent Neural Networks. https://t.co/1joHMvKfl6
arxiv_cscv: POPQORN: Quantifying Robustness of Recurrent Neural Networks https://t.co/4PaxAqppfZ
arxiv_cscv: POPQORN: Quantifying Robustness of Recurrent Neural Networks https://t.co/4PaxAqH0Ez
sei_shinagawa: RT @arxiv_cscv: POPQORN: Quantifying Robustness of Recurrent Neural Networks https://t.co/4PaxAqppfZ
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

2.428 Mikeys
#3. DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence
Edvinas Byla, Wei Pang
In this paper we propose DeepSwarm, a novel neural architecture search (NAS) method based on Swarm Intelligence principles. At its core DeepSwarm uses Ant Colony Optimization (ACO) to generate ant population which uses the pheromone information to collectively search for the best neural architecture. Furthermore, by using local and global pheromone update rules our method ensures the balance between exploitation and exploration. On top of this, to make our method more efficient we combine progressive neural architecture search with weight reusability. Furthermore, due to the nature of ACO our method can incorporate heuristic information which can further speed up the search process. After systematic and extensive evaluation, we discover that on three different datasets (MNIST, Fashion-MNIST, and CIFAR-10) when compared to existing systems our proposed method demonstrates competitive performance. Finally, we open source DeepSwarm as a NAS library and hope it can be used by more deep learning researchers and practitioners.
more | pdf | html
Figures
None.
Tweets
BrundageBot: DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. Edvinas Byla and Wei Pang https://t.co/qkum9pgggQ
arxivml: "DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence", Edvinas Byla, Wei Pang https://t.co/sGND1bEupX
arxiv_cs_LG: DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. Edvinas Byla and Wei Pang https://t.co/0Bw1G2FUlO
StatsPapers: DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. https://t.co/1lQf0cOTX9
cd_fuller: RT @StatsPapers: DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. https://t.co/1lQf0cOTX9
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.408 Mikeys
#4. Contrastive Fairness in Machine Learning
Tapabrata Chakraborti, Arijit Patra, Alison Noble
We present contrastive fairness, a new direction in causal inference applied to algorithmic fairness. Earlier methods dealt with the "what if?" question (counterfactual fairness, NeurIPS'17). We establish the theoretical and mathematical implications of the contrastive question "why this and not that?" in context of algorithmic fairness in machine learning. This is essential to defend the fairness of algorithmic decisions in tasks where a person or sub-group of people is chosen over another (job recruitment, university admission, company layovers, etc). This development is also helpful to institutions to ensure or defend the fairness of their automated decision making processes. A test case of employee job location allocation is provided as an illustrative example.
more | pdf | html
Figures
None.
Tweets
arxivml: "Contrastive Fairness in Machine Learning", Tapabrata Chakraborti, Arijit Patra, Alison Noble https://t.co/Pq5X3byd1c
arxiv_cs_LG: Contrastive Fairness in Machine Learning. Tapabrata Chakraborti, Arijit Patra, and Alison Noble https://t.co/xSphGRryzr
StatsPapers: Contrastive Fairness in Machine Learning. https://t.co/aCkHhKsdmJ
SantchiWeb: RT @arxiv_cs_LG: Contrastive Fairness in Machine Learning. Tapabrata Chakraborti, Arijit Patra, and Alison Noble https://t.co/xSphGRryzr
cd_fuller: RT @StatsPapers: Contrastive Fairness in Machine Learning. https://t.co/aCkHhKsdmJ
minsuk_chang: RT @StatsPapers: Contrastive Fairness in Machine Learning. https://t.co/aCkHhKsdmJ
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.37 Mikeys
#5. EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices
Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, Nicholas D. Lane
In recent years, advances in deep learning have resulted in unprecedented leaps in diverse tasks spanning from speech and object recognition to context awareness and health monitoring. As a result, an increasing number of AI-enabled applications are being developed targeting ubiquitous and mobile devices. While deep neural networks (DNNs) are getting bigger and more complex, they also impose a heavy computational and energy burden on the host devices, which has led to the integration of various specialized processors in commodity devices. Given the broad range of competing DNN architectures and the heterogeneity of the target hardware, there is an emerging need to understand the compatibility between DNN-platform pairs and the expected performance benefits on each platform. This work attempts to demystify this landscape by systematically evaluating a collection of state-of-the-art DNNs on a wide variety of commodity devices. In this respect, we identify potential bottlenecks in each architecture and provide important guidelines...
more | pdf | html
Figures
None.
Tweets
BrundageBot: EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices. Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, and Nicholas D. Lane https://t.co/ECtyjhRztD
arxivml: "EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices", Mario Almeid… https://t.co/t3xuZNbsIT
arxiv_cs_LG: EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices. Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, and Nicholas D. Lane https://t.co/CCMV6vn30G
StatsPapers: EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices. https://t.co/bTHxpUucIF
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 5230
Unqiue Words: 2017

2.37 Mikeys
#6. Stochastically Dominant Distributional Reinforcement Learning
John D. Martin, Michal Lyskawinski, Xiaohu Li, Brendan Englot
We describe a new approach for mitigating risk in the Reinforcement Learning paradigm. Instead of reasoning about expected utility, we use second-order stochastic dominance (SSD) to directly compare the inherent risk of random returns induced by different actions. We frame the RL optimization within the space of probability measures to accommodate the SSD relation, treating Bellman's equation as a potential energy functional. This brings us to Wasserstein gradient flows, for which the optimality and convergence are well understood. We propose a discrete-measure approximation algorithm called the Dominant Particle Agent (DPA), and we demonstrate how safety and performance are better balanced with DPA than with existing baselines.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Stochastically Dominant Distributional Reinforcement Learning. John D. Martin, Michal Lyskawinski, Xiaohu Li, and Brendan Englot https://t.co/C2ReFuPcH2
arxivml: "Stochastically Dominant Distributional Reinforcement Learning", John D. Martin, Michal Lyskawinski, Xiaohu Li, Bre… https://t.co/rOZZkuyawi
arxiv_cs_LG: Stochastically Dominant Distributional Reinforcement Learning. John D. Martin, Michal Lyskawinski, Xiaohu Li, and Brendan Englot https://t.co/99JjpBIIOy
StatsPapers: Stochastically Dominant Distributional Reinforcement Learning. https://t.co/FJJKfWghJV
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.37 Mikeys
#7. Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces
Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, Gerhard Neumann
In order to integrate uncertainty estimates into deep time-series modelling, Kalman Filters (KFs) (Kalman et al., 1960) have been integrated with deep learning models, however, such approaches typically rely on approximate inference techniques such as variational inference which makes learning more complex and often less scalable due to approximation errors. We propose a new deep approach to Kalman filtering which can be learned directly in an end-to-end manner using backpropagation without additional approximations. Our approach uses a high-dimensional factorized latent state representation for which the Kalman updates simplify to scalar operations and thus avoids hard to backpropagate, computationally heavy and potentially unstable matrix inversions. Moreover, we use locally linear dynamic models to efficiently propagate the latent state to the next time step. The resulting network architecture, which we call Recurrent Kalman Network (RKN), can be used for any time-series data, similar to a LSTM (Hochreiter & Schmidhuber, 1997)...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces. Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, and Gerhard Neumann https://t.co/Avte1erREq
arxivml: "Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces", Philipp Becker, Harit Pa… https://t.co/kpW4JNdzfT
arxiv_cs_LG: Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces. Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, and Gerhard Neumann https://t.co/4OFctcdu4Y
StatsPapers: Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces. https://t.co/BdMeiN2R3c
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

2.37 Mikeys
#8. Weakly-Supervised Temporal Localization via Occurrence Count Learning
Julien Schroeter, Kirill Sidorov, David Marshall
We propose a novel model for temporal detection and localization which allows the training of deep neural networks using only counts of event occurrences as training labels. This powerful weakly-supervised framework alleviates the burden of the imprecise and time-consuming process of annotating event locations in temporal data. Unlike existing methods, in which localization is explicitly achieved by design, our model learns localization implicitly as a byproduct of learning to count instances. This unique feature is a direct consequence of the model's theoretical properties. We validate the effectiveness of our approach in a number of experiments (drum hit and piano onset detection in audio, digit detection in images) and demonstrate performance comparable to that of fully-supervised state-of-the-art methods, despite much weaker training requirements.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Weakly-Supervised Temporal Localization via Occurrence Count Learning. Julien Schroeter, Kirill Sidorov, and David Marshall https://t.co/9NBB7OiEI0
arxivml: "Weakly-Supervised Temporal Localization via Occurrence Count Learning", Julien Schroeter, Kirill Sidorov, David Ma… https://t.co/UvJPJnTePH
arxiv_cs_LG: Weakly-Supervised Temporal Localization via Occurrence Count Learning. Julien Schroeter, Kirill Sidorov, and David Marshall https://t.co/QolU8zIL46
StatsPapers: Weakly-Supervised Temporal Localization via Occurrence Count Learning. https://t.co/s3f4saAhcM
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.335 Mikeys
#9. Integer Discrete Flows and Lossless Compression
Emiel Hoogeboom, Jorn W. T. Peters, Rianne van den Berg, Max Welling
Lossless compression methods shorten the expected representation size of data without loss of information, using a statistical model. Flow-based models are attractive in this setting because they admit exact likelihood optimization, which is equivalent to minimizing the expected number of bits per message. However, conventional flows assume continuous data, which may lead to reconstruction errors when quantized for compression. For that reason, we introduce a generative flow for ordinal discrete data called Integer Discrete Flow (IDF): a bijective integer map that can learn rich transformations on high-dimensional data. As building blocks for IDFs, we introduce flexible transformation layers called integer discrete coupling and lower triangular coupling. Our experiments show that IDFs are competitive with other flow-based generative models. Furthermore, we demonstrate that IDF based compression achieves state-of-the-art lossless compression rates on CIFAR10, ImageNet32, and ImageNet64.
more | pdf | html
Figures
Tweets
arxivml: "Integer Discrete Flows and Lossless Compression", Emiel Hoogeboom, Jorn W.T. Peters, Rianne van den Berg, Max Well… https://t.co/E0ZiE9UZdB
arxiv_cs_LG: Integer Discrete Flows and Lossless Compression. Emiel Hoogeboom, Jorn W. T. Peters, Rianne van den Berg, and Max Welling https://t.co/GKWwCGi5zY
StatsPapers: Integer Discrete Flows and Lossless Compression. https://t.co/nyTFsG7aP7
arxiv_cscv: Integer Discrete Flows and Lossless Compression https://t.co/WJMrvplprf
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 5137
Unqiue Words: 1940

2.297 Mikeys
#10. TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning
Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan
Off-policy reinforcement learning with eligibility traces is challenging because of the discrepancy between target policy and behavior policy. One common approach is to measure the difference between two policies in a probabilistic way, such as importance sampling and tree-backup. However, existing off-policy learning methods based on probabilistic policy measurement are inefficient when utilizing traces under a greedy target policy, which is ineffective for control problems. The traces are cut immediately when a non-greedy action is taken, which may lose the advantage of eligibility traces and slow down the learning process. Alternatively, some non-probabilistic measurement methods such as General Q($\lambda$) and Naive Q($\lambda$) never cut traces, but face convergence problems in practice. To address the above issues, this paper introduces a new method named TBQ($\sigma$), which effectively unifies the tree-backup algorithm and Naive Q($\lambda$). By introducing a new parameter $\sigma$ to illustrate the \emph{degree}...
more | pdf | html
Figures
None.
Tweets
SciFi: TBQ($\sigma$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. https://t.co/XD08R2M2pe
BrundageBot: TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, and Gang Pan https://t.co/tvdMAkkIp9
arxivml: "TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning", Longxiang Shi, Shijian… https://t.co/sz3zSsxSkm
arxiv_cs_LG: TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, and Gang Pan https://t.co/a2f9UuEdOc
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 128,326 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 128,326 papers.