### Top 10 Arxiv Papers Today in Machine Learning

##### #1. Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem
###### Christian Schroeder de Witt, Thomas Hornigold
As global greenhouse gas emissions continue to rise, the use of stratospheric aerosol injection (SAI), a form of solar geoengineering, is increasingly considered in order to artificially mitigate climate change effects. However, initial research in simulation suggests that naive SAI can have catastrophic regional consequences, which may induce serious geostrategic conflicts. Current geo-engineering research treats SAI control in low-dimensional approximation only. We suggest treating SAI as a high-dimensional control problem, with policies trained according to a context-sensitive reward function within the Deep Reinforcement Learning (DRL) paradigm. In order to facilitate training in simulation, we suggest to emulate HadCM3, a widely used General Circulation Model, using deep learning techniques. We believe this is the first application of DRL to the climate sciences.
more | pdf | html
None.
###### Tweets
BrundageBot: Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. Christian Schroeder de Witt and Thomas Hornigold https://t.co/S33lznkoxi
geschichtenpost: #ArXiv Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem https://t.co/ETscCKuGfu
arxivml: "Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem", Christian Schroeder de Witt, Thomas Hor… https://t.co/gkzXfEsBx4
kushnerbomb: for all the talk of skynet or whatever i think the actual way artificial intelligence kills us is a fucking moron billionaire (sam a****n probably) gets it in his head to use deep RL for geoengineering and doesn't initialize the network exactly right https://t.co/SQxkp6Wqjm
arxiv_cs_LG: Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. Christian Schroeder de Witt and Thomas Hornigold https://t.co/5Jl0QPRUrj
StatsPapers: Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. https://t.co/sxcsnefg60
JohnSam57668631: RT @StatsPapers: Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. https://t.co/sxcsnefg60
JohnSam57668631: RT @geschichtenpost: #ArXiv Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem https://t.co/ETscCKuGfu
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

##### #2. POPQORN: Quantifying Robustness of Recurrent Neural Networks
###### Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, Dahua Lin
The vulnerability to adversarial attacks has been a critical issue for deep neural networks. Addressing this issue requires a reliable way to evaluate the robustness of a network. Recently, several methods have been developed to compute $\textit{robustness quantification}$ for neural networks, namely, certified lower bounds of the minimum adversarial perturbation. Such methods, however, were devised for feed-forward networks, e.g. multi-layer perceptron or convolutional networks. It remains an open problem to quantify robustness for recurrent networks, especially LSTM and GRU. For such networks, there exist additional challenges in computing the robustness quantification, such as handling the inputs at multiple steps and the interaction between gates and states. In this work, we propose $\textit{POPQORN}$ ($\textbf{P}$ropagated-$\textbf{o}$ut$\textbf{p}$ut $\textbf{Q}$uantified R$\textbf{o}$bustness for $\textbf{RN}$Ns), a general algorithm to quantify robustness of RNNs, including vanilla RNNs, LSTMs, and GRUs. We demonstrate...
more | pdf | html
None.
###### Tweets
BrundageBot: POPQORN: Quantifying Robustness of Recurrent Neural Networks. Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, and Dahua Lin https://t.co/bAmWwtXYOs
arxivml: "POPQORN: Quantifying Robustness of Recurrent Neural Networks", Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Dan… https://t.co/is6Rbw9K6h
arxiv_cs_LG: POPQORN: Quantifying Robustness of Recurrent Neural Networks. Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, and Dahua Lin https://t.co/JV6yfcLPbj
StatsPapers: POPQORN: Quantifying Robustness of Recurrent Neural Networks. https://t.co/1joHMvKfl6
arxiv_cscv: POPQORN: Quantifying Robustness of Recurrent Neural Networks https://t.co/4PaxAqppfZ
arxiv_cscv: POPQORN: Quantifying Robustness of Recurrent Neural Networks https://t.co/4PaxAqH0Ez
sei_shinagawa: RT @arxiv_cscv: POPQORN: Quantifying Robustness of Recurrent Neural Networks https://t.co/4PaxAqppfZ
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

##### #3. DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence
###### Edvinas Byla, Wei Pang
In this paper we propose DeepSwarm, a novel neural architecture search (NAS) method based on Swarm Intelligence principles. At its core DeepSwarm uses Ant Colony Optimization (ACO) to generate ant population which uses the pheromone information to collectively search for the best neural architecture. Furthermore, by using local and global pheromone update rules our method ensures the balance between exploitation and exploration. On top of this, to make our method more efficient we combine progressive neural architecture search with weight reusability. Furthermore, due to the nature of ACO our method can incorporate heuristic information which can further speed up the search process. After systematic and extensive evaluation, we discover that on three different datasets (MNIST, Fashion-MNIST, and CIFAR-10) when compared to existing systems our proposed method demonstrates competitive performance. Finally, we open source DeepSwarm as a NAS library and hope it can be used by more deep learning researchers and practitioners.
more | pdf | html
None.
###### Tweets
BrundageBot: DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. Edvinas Byla and Wei Pang https://t.co/qkum9pgggQ
arxivml: "DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence", Edvinas Byla, Wei Pang https://t.co/sGND1bEupX
arxiv_cs_LG: DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. Edvinas Byla and Wei Pang https://t.co/0Bw1G2FUlO
StatsPapers: DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. https://t.co/1lQf0cOTX9
cd_fuller: RT @StatsPapers: DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. https://t.co/1lQf0cOTX9
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

##### #4. Contrastive Fairness in Machine Learning
###### Tapabrata Chakraborti, Arijit Patra, Alison Noble
We present contrastive fairness, a new direction in causal inference applied to algorithmic fairness. Earlier methods dealt with the "what if?" question (counterfactual fairness, NeurIPS'17). We establish the theoretical and mathematical implications of the contrastive question "why this and not that?" in context of algorithmic fairness in machine learning. This is essential to defend the fairness of algorithmic decisions in tasks where a person or sub-group of people is chosen over another (job recruitment, university admission, company layovers, etc). This development is also helpful to institutions to ensure or defend the fairness of their automated decision making processes. A test case of employee job location allocation is provided as an illustrative example.
more | pdf | html
None.
###### Tweets
arxivml: "Contrastive Fairness in Machine Learning", Tapabrata Chakraborti, Arijit Patra, Alison Noble https://t.co/Pq5X3byd1c
arxiv_cs_LG: Contrastive Fairness in Machine Learning. Tapabrata Chakraborti, Arijit Patra, and Alison Noble https://t.co/xSphGRryzr
StatsPapers: Contrastive Fairness in Machine Learning. https://t.co/aCkHhKsdmJ
SantchiWeb: RT @arxiv_cs_LG: Contrastive Fairness in Machine Learning. Tapabrata Chakraborti, Arijit Patra, and Alison Noble https://t.co/xSphGRryzr
cd_fuller: RT @StatsPapers: Contrastive Fairness in Machine Learning. https://t.co/aCkHhKsdmJ
minsuk_chang: RT @StatsPapers: Contrastive Fairness in Machine Learning. https://t.co/aCkHhKsdmJ
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

##### #5. EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices
###### Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, Nicholas D. Lane
In recent years, advances in deep learning have resulted in unprecedented leaps in diverse tasks spanning from speech and object recognition to context awareness and health monitoring. As a result, an increasing number of AI-enabled applications are being developed targeting ubiquitous and mobile devices. While deep neural networks (DNNs) are getting bigger and more complex, they also impose a heavy computational and energy burden on the host devices, which has led to the integration of various specialized processors in commodity devices. Given the broad range of competing DNN architectures and the heterogeneity of the target hardware, there is an emerging need to understand the compatibility between DNN-platform pairs and the expected performance benefits on each platform. This work attempts to demystify this landscape by systematically evaluating a collection of state-of-the-art DNNs on a wide variety of commodity devices. In this respect, we identify potential bottlenecks in each architecture and provide important guidelines...
more | pdf | html
None.
###### Tweets
BrundageBot: EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices. Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, and Nicholas D. Lane https://t.co/ECtyjhRztD
arxivml: "EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices", Mario Almeid… https://t.co/t3xuZNbsIT
arxiv_cs_LG: EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices. Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, and Nicholas D. Lane https://t.co/CCMV6vn30G
StatsPapers: EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices. https://t.co/bTHxpUucIF
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 5
Total Words: 5230
Unqiue Words: 2017

##### #6. Stochastically Dominant Distributional Reinforcement Learning
###### John D. Martin, Michal Lyskawinski, Xiaohu Li, Brendan Englot
We describe a new approach for mitigating risk in the Reinforcement Learning paradigm. Instead of reasoning about expected utility, we use second-order stochastic dominance (SSD) to directly compare the inherent risk of random returns induced by different actions. We frame the RL optimization within the space of probability measures to accommodate the SSD relation, treating Bellman's equation as a potential energy functional. This brings us to Wasserstein gradient flows, for which the optimality and convergence are well understood. We propose a discrete-measure approximation algorithm called the Dominant Particle Agent (DPA), and we demonstrate how safety and performance are better balanced with DPA than with existing baselines.
more | pdf | html
None.
###### Tweets
BrundageBot: Stochastically Dominant Distributional Reinforcement Learning. John D. Martin, Michal Lyskawinski, Xiaohu Li, and Brendan Englot https://t.co/C2ReFuPcH2
arxivml: "Stochastically Dominant Distributional Reinforcement Learning", John D． Martin, Michal Lyskawinski, Xiaohu Li, Bre… https://t.co/rOZZkuyawi
arxiv_cs_LG: Stochastically Dominant Distributional Reinforcement Learning. John D. Martin, Michal Lyskawinski, Xiaohu Li, and Brendan Englot https://t.co/99JjpBIIOy
StatsPapers: Stochastically Dominant Distributional Reinforcement Learning. https://t.co/FJJKfWghJV
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

##### #7. Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces
###### Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, Gerhard Neumann
In order to integrate uncertainty estimates into deep time-series modelling, Kalman Filters (KFs) (Kalman et al., 1960) have been integrated with deep learning models, however, such approaches typically rely on approximate inference techniques such as variational inference which makes learning more complex and often less scalable due to approximation errors. We propose a new deep approach to Kalman filtering which can be learned directly in an end-to-end manner using backpropagation without additional approximations. Our approach uses a high-dimensional factorized latent state representation for which the Kalman updates simplify to scalar operations and thus avoids hard to backpropagate, computationally heavy and potentially unstable matrix inversions. Moreover, we use locally linear dynamic models to efficiently propagate the latent state to the next time step. The resulting network architecture, which we call Recurrent Kalman Network (RKN), can be used for any time-series data, similar to a LSTM (Hochreiter & Schmidhuber, 1997)...
more | pdf | html
None.
###### Tweets
BrundageBot: Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces. Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, and Gerhard Neumann https://t.co/Avte1erREq
arxivml: "Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces", Philipp Becker, Harit Pa… https://t.co/kpW4JNdzfT
arxiv_cs_LG: Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces. Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, and Gerhard Neumann https://t.co/4OFctcdu4Y
StatsPapers: Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces. https://t.co/BdMeiN2R3c
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

##### #8. Weakly-Supervised Temporal Localization via Occurrence Count Learning
###### Julien Schroeter, Kirill Sidorov, David Marshall
We propose a novel model for temporal detection and localization which allows the training of deep neural networks using only counts of event occurrences as training labels. This powerful weakly-supervised framework alleviates the burden of the imprecise and time-consuming process of annotating event locations in temporal data. Unlike existing methods, in which localization is explicitly achieved by design, our model learns localization implicitly as a byproduct of learning to count instances. This unique feature is a direct consequence of the model's theoretical properties. We validate the effectiveness of our approach in a number of experiments (drum hit and piano onset detection in audio, digit detection in images) and demonstrate performance comparable to that of fully-supervised state-of-the-art methods, despite much weaker training requirements.
more | pdf | html
None.
###### Tweets
BrundageBot: Weakly-Supervised Temporal Localization via Occurrence Count Learning. Julien Schroeter, Kirill Sidorov, and David Marshall https://t.co/9NBB7OiEI0
arxivml: "Weakly-Supervised Temporal Localization via Occurrence Count Learning", Julien Schroeter, Kirill Sidorov, David Ma… https://t.co/UvJPJnTePH
arxiv_cs_LG: Weakly-Supervised Temporal Localization via Occurrence Count Learning. Julien Schroeter, Kirill Sidorov, and David Marshall https://t.co/QolU8zIL46
StatsPapers: Weakly-Supervised Temporal Localization via Occurrence Count Learning. https://t.co/s3f4saAhcM
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

##### #9. Integer Discrete Flows and Lossless Compression
###### Emiel Hoogeboom, Jorn W. T. Peters, Rianne van den Berg, Max Welling
Lossless compression methods shorten the expected representation size of data without loss of information, using a statistical model. Flow-based models are attractive in this setting because they admit exact likelihood optimization, which is equivalent to minimizing the expected number of bits per message. However, conventional flows assume continuous data, which may lead to reconstruction errors when quantized for compression. For that reason, we introduce a generative flow for ordinal discrete data called Integer Discrete Flow (IDF): a bijective integer map that can learn rich transformations on high-dimensional data. As building blocks for IDFs, we introduce flexible transformation layers called integer discrete coupling and lower triangular coupling. Our experiments show that IDFs are competitive with other flow-based generative models. Furthermore, we demonstrate that IDF based compression achieves state-of-the-art lossless compression rates on CIFAR10, ImageNet32, and ImageNet64.
more | pdf | html
###### Tweets
arxivml: "Integer Discrete Flows and Lossless Compression", Emiel Hoogeboom, Jorn W．T． Peters, Rianne van den Berg, Max Well… https://t.co/E0ZiE9UZdB
arxiv_cs_LG: Integer Discrete Flows and Lossless Compression. Emiel Hoogeboom, Jorn W. T. Peters, Rianne van den Berg, and Max Welling https://t.co/GKWwCGi5zY
StatsPapers: Integer Discrete Flows and Lossless Compression. https://t.co/nyTFsG7aP7
arxiv_cscv: Integer Discrete Flows and Lossless Compression https://t.co/WJMrvplprf
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 5137
Unqiue Words: 1940

##### #10. TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning
###### Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan
Off-policy reinforcement learning with eligibility traces is challenging because of the discrepancy between target policy and behavior policy. One common approach is to measure the difference between two policies in a probabilistic way, such as importance sampling and tree-backup. However, existing off-policy learning methods based on probabilistic policy measurement are inefficient when utilizing traces under a greedy target policy, which is ineffective for control problems. The traces are cut immediately when a non-greedy action is taken, which may lose the advantage of eligibility traces and slow down the learning process. Alternatively, some non-probabilistic measurement methods such as General Q($\lambda$) and Naive Q($\lambda$) never cut traces, but face convergence problems in practice. To address the above issues, this paper introduces a new method named TBQ($\sigma$), which effectively unifies the tree-backup algorithm and Naive Q($\lambda$). By introducing a new parameter $\sigma$ to illustrate the \emph{degree}...
more | pdf | html
None.
###### Tweets
SciFi: TBQ($\sigma$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. https://t.co/XD08R2M2pe
BrundageBot: TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, and Gang Pan https://t.co/tvdMAkkIp9
arxivml: "TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning", Longxiang Shi, Shijian… https://t.co/sz3zSsxSkm
arxiv_cs_LG: TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, and Gang Pan https://t.co/a2f9UuEdOc
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 128,326 papers.

###### Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Online
###### Stats
Tracking 128,326 papers.