As global greenhouse gas emissions continue to rise, the use of stratospheric
aerosol injection (SAI), a form of solar geoengineering, is increasingly
considered in order to artificially mitigate climate change effects. However,
initial research in simulation suggests that naive SAI can have catastrophic
regional consequences, which may induce serious geostrategic conflicts. Current
geo-engineering research treats SAI control in low-dimensional approximation
only. We suggest treating SAI as a high-dimensional control problem, with
policies trained according to a context-sensitive reward function within the
Deep Reinforcement Learning (DRL) paradigm. In order to facilitate training in
simulation, we suggest to emulate HadCM3, a widely used General Circulation
Model, using deep learning techniques. We believe this is the first application
of DRL to the climate sciences.

more |
pdf
| html
None.

BrundageBot:
Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. Christian Schroeder de Witt and Thomas Hornigold https://t.co/S33lznkoxi

geschichtenpost:
#ArXiv Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem https://t.co/ETscCKuGfu

arxivml:
"Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem",
Christian Schroeder de Witt, Thomas Hor…
https://t.co/gkzXfEsBx4

kushnerbomb:
for all the talk of skynet or whatever i think the actual way artificial intelligence kills us is a fucking moron billionaire (sam a****n probably) gets it in his head to use deep RL for geoengineering and doesn't initialize the network exactly right https://t.co/SQxkp6Wqjm

arxiv_cs_LG:
Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. Christian Schroeder de Witt and Thomas Hornigold https://t.co/5Jl0QPRUrj

StatsPapers:
Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. https://t.co/sxcsnefg60

JohnSam57668631:
RT @StatsPapers: Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem. https://t.co/sxcsnefg60

JohnSam57668631:
RT @geschichtenpost: #ArXiv Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem https://t.co/ETscCKuGfu

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

The vulnerability to adversarial attacks has been a critical issue for deep
neural networks. Addressing this issue requires a reliable way to evaluate the
robustness of a network. Recently, several methods have been developed to
compute $\textit{robustness quantification}$ for neural networks, namely,
certified lower bounds of the minimum adversarial perturbation. Such methods,
however, were devised for feed-forward networks, e.g. multi-layer perceptron or
convolutional networks. It remains an open problem to quantify robustness for
recurrent networks, especially LSTM and GRU. For such networks, there exist
additional challenges in computing the robustness quantification, such as
handling the inputs at multiple steps and the interaction between gates and
states. In this work, we propose $\textit{POPQORN}$
($\textbf{P}$ropagated-$\textbf{o}$ut$\textbf{p}$ut $\textbf{Q}$uantified
R$\textbf{o}$bustness for $\textbf{RN}$Ns), a general algorithm to quantify
robustness of RNNs, including vanilla RNNs, LSTMs, and GRUs. We demonstrate...

more |
pdf
| html
None.

BrundageBot:
POPQORN: Quantifying Robustness of Recurrent Neural Networks. Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, and Dahua Lin https://t.co/bAmWwtXYOs

arxivml:
"POPQORN: Quantifying Robustness of Recurrent Neural Networks",
Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Dan…
https://t.co/is6Rbw9K6h

arxiv_cs_LG:
POPQORN: Quantifying Robustness of Recurrent Neural Networks. Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, and Dahua Lin https://t.co/JV6yfcLPbj

StatsPapers:
POPQORN: Quantifying Robustness of Recurrent Neural Networks. https://t.co/1joHMvKfl6

arxiv_cscv:
POPQORN: Quantifying Robustness of Recurrent Neural Networks https://t.co/4PaxAqppfZ

arxiv_cscv:
POPQORN: Quantifying Robustness of Recurrent Neural Networks https://t.co/4PaxAqH0Ez

sei_shinagawa:
RT @arxiv_cscv: POPQORN: Quantifying Robustness of Recurrent Neural Networks https://t.co/4PaxAqppfZ

None.

None.

Sample Sizes : None.

Authors: 6

Total Words: 0

Unqiue Words: 0

In this paper we propose DeepSwarm, a novel neural architecture search (NAS)
method based on Swarm Intelligence principles. At its core DeepSwarm uses Ant
Colony Optimization (ACO) to generate ant population which uses the pheromone
information to collectively search for the best neural architecture.
Furthermore, by using local and global pheromone update rules our method
ensures the balance between exploitation and exploration. On top of this, to
make our method more efficient we combine progressive neural architecture
search with weight reusability. Furthermore, due to the nature of ACO our
method can incorporate heuristic information which can further speed up the
search process. After systematic and extensive evaluation, we discover that on
three different datasets (MNIST, Fashion-MNIST, and CIFAR-10) when compared to
existing systems our proposed method demonstrates competitive performance.
Finally, we open source DeepSwarm as a NAS library and hope it can be used by
more deep learning researchers and practitioners.

more |
pdf
| html
None.

BrundageBot:
DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. Edvinas Byla and Wei Pang https://t.co/qkum9pgggQ

arxivml:
"DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence",
Edvinas Byla, Wei Pang
https://t.co/sGND1bEupX

arxiv_cs_LG:
DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. Edvinas Byla and Wei Pang https://t.co/0Bw1G2FUlO

StatsPapers:
DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. https://t.co/1lQf0cOTX9

cd_fuller:
RT @StatsPapers: DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence. https://t.co/1lQf0cOTX9

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

We present contrastive fairness, a new direction in causal inference applied
to algorithmic fairness. Earlier methods dealt with the "what if?" question
(counterfactual fairness, NeurIPS'17). We establish the theoretical and
mathematical implications of the contrastive question "why this and not that?"
in context of algorithmic fairness in machine learning. This is essential to
defend the fairness of algorithmic decisions in tasks where a person or
sub-group of people is chosen over another (job recruitment, university
admission, company layovers, etc). This development is also helpful to
institutions to ensure or defend the fairness of their automated decision
making processes. A test case of employee job location allocation is provided
as an illustrative example.

more |
pdf
| html
None.

arxivml:
"Contrastive Fairness in Machine Learning",
Tapabrata Chakraborti, Arijit Patra, Alison Noble
https://t.co/Pq5X3byd1c

arxiv_cs_LG:
Contrastive Fairness in Machine Learning. Tapabrata Chakraborti, Arijit Patra, and Alison Noble https://t.co/xSphGRryzr

StatsPapers:
Contrastive Fairness in Machine Learning. https://t.co/aCkHhKsdmJ

SantchiWeb:
RT @arxiv_cs_LG: Contrastive Fairness in Machine Learning. Tapabrata Chakraborti, Arijit Patra, and Alison Noble https://t.co/xSphGRryzr

cd_fuller:
RT @StatsPapers: Contrastive Fairness in Machine Learning. https://t.co/aCkHhKsdmJ

minsuk_chang:
RT @StatsPapers: Contrastive Fairness in Machine Learning. https://t.co/aCkHhKsdmJ

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 0

Unqiue Words: 0

In recent years, advances in deep learning have resulted in unprecedented
leaps in diverse tasks spanning from speech and object recognition to context
awareness and health monitoring. As a result, an increasing number of
AI-enabled applications are being developed targeting ubiquitous and mobile
devices. While deep neural networks (DNNs) are getting bigger and more complex,
they also impose a heavy computational and energy burden on the host devices,
which has led to the integration of various specialized processors in commodity
devices. Given the broad range of competing DNN architectures and the
heterogeneity of the target hardware, there is an emerging need to understand
the compatibility between DNN-platform pairs and the expected performance
benefits on each platform. This work attempts to demystify this landscape by
systematically evaluating a collection of state-of-the-art DNNs on a wide
variety of commodity devices. In this respect, we identify potential
bottlenecks in each architecture and provide important guidelines...

more |
pdf
| html
None.

BrundageBot:
EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices. Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, and Nicholas D. Lane https://t.co/ECtyjhRztD

arxivml:
"EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices",
Mario Almeid…
https://t.co/t3xuZNbsIT

arxiv_cs_LG:
EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices. Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, and Nicholas D. Lane https://t.co/CCMV6vn30G

StatsPapers:
EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices. https://t.co/bTHxpUucIF

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 5230

Unqiue Words: 2017

We describe a new approach for mitigating risk in the Reinforcement Learning
paradigm. Instead of reasoning about expected utility, we use second-order
stochastic dominance (SSD) to directly compare the inherent risk of random
returns induced by different actions. We frame the RL optimization within the
space of probability measures to accommodate the SSD relation, treating
Bellman's equation as a potential energy functional. This brings us to
Wasserstein gradient flows, for which the optimality and convergence are well
understood. We propose a discrete-measure approximation algorithm called the
Dominant Particle Agent (DPA), and we demonstrate how safety and performance
are better balanced with DPA than with existing baselines.

more |
pdf
| html
None.

BrundageBot:
Stochastically Dominant Distributional Reinforcement Learning. John D. Martin, Michal Lyskawinski, Xiaohu Li, and Brendan Englot https://t.co/C2ReFuPcH2

arxivml:
"Stochastically Dominant Distributional Reinforcement Learning",
John D． Martin, Michal Lyskawinski, Xiaohu Li, Bre…
https://t.co/rOZZkuyawi

arxiv_cs_LG:
Stochastically Dominant Distributional Reinforcement Learning. John D. Martin, Michal Lyskawinski, Xiaohu Li, and Brendan Englot https://t.co/99JjpBIIOy

StatsPapers:
Stochastically Dominant Distributional Reinforcement Learning. https://t.co/FJJKfWghJV

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 0

Unqiue Words: 0

In order to integrate uncertainty estimates into deep time-series modelling,
Kalman Filters (KFs) (Kalman et al., 1960) have been integrated with deep
learning models, however, such approaches typically rely on approximate
inference techniques such as variational inference which makes learning more
complex and often less scalable due to approximation errors. We propose a new
deep approach to Kalman filtering which can be learned directly in an
end-to-end manner using backpropagation without additional approximations. Our
approach uses a high-dimensional factorized latent state representation for
which the Kalman updates simplify to scalar operations and thus avoids hard to
backpropagate, computationally heavy and potentially unstable matrix
inversions. Moreover, we use locally linear dynamic models to efficiently
propagate the latent state to the next time step. The resulting network
architecture, which we call Recurrent Kalman Network (RKN), can be used for any
time-series data, similar to a LSTM (Hochreiter & Schmidhuber, 1997)...

more |
pdf
| html
None.

BrundageBot:
Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces. Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, and Gerhard Neumann https://t.co/Avte1erREq

arxivml:
"Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces",
Philipp Becker, Harit Pa…
https://t.co/kpW4JNdzfT

arxiv_cs_LG:
Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces. Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, and Gerhard Neumann https://t.co/4OFctcdu4Y

StatsPapers:
Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces. https://t.co/BdMeiN2R3c

None.

None.

Sample Sizes : None.

Authors: 6

Total Words: 0

Unqiue Words: 0

We propose a novel model for temporal detection and localization which allows
the training of deep neural networks using only counts of event occurrences as
training labels. This powerful weakly-supervised framework alleviates the
burden of the imprecise and time-consuming process of annotating event
locations in temporal data. Unlike existing methods, in which localization is
explicitly achieved by design, our model learns localization implicitly as a
byproduct of learning to count instances. This unique feature is a direct
consequence of the model's theoretical properties. We validate the
effectiveness of our approach in a number of experiments (drum hit and piano
onset detection in audio, digit detection in images) and demonstrate
performance comparable to that of fully-supervised state-of-the-art methods,
despite much weaker training requirements.

more |
pdf
| html
None.

BrundageBot:
Weakly-Supervised Temporal Localization via Occurrence Count Learning. Julien Schroeter, Kirill Sidorov, and David Marshall https://t.co/9NBB7OiEI0

arxivml:
"Weakly-Supervised Temporal Localization via Occurrence Count Learning",
Julien Schroeter, Kirill Sidorov, David Ma…
https://t.co/UvJPJnTePH

arxiv_cs_LG:
Weakly-Supervised Temporal Localization via Occurrence Count Learning. Julien Schroeter, Kirill Sidorov, and David Marshall https://t.co/QolU8zIL46

StatsPapers:
Weakly-Supervised Temporal Localization via Occurrence Count Learning. https://t.co/s3f4saAhcM

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 0

Unqiue Words: 0

Lossless compression methods shorten the expected representation size of data
without loss of information, using a statistical model. Flow-based models are
attractive in this setting because they admit exact likelihood optimization,
which is equivalent to minimizing the expected number of bits per message.
However, conventional flows assume continuous data, which may lead to
reconstruction errors when quantized for compression. For that reason, we
introduce a generative flow for ordinal discrete data called Integer Discrete
Flow (IDF): a bijective integer map that can learn rich transformations on
high-dimensional data. As building blocks for IDFs, we introduce flexible
transformation layers called integer discrete coupling and lower triangular
coupling. Our experiments show that IDFs are competitive with other flow-based
generative models. Furthermore, we demonstrate that IDF based compression
achieves state-of-the-art lossless compression rates on CIFAR10, ImageNet32,
and ImageNet64.

more |
pdf
| html
arxivml:
"Integer Discrete Flows and Lossless Compression",
Emiel Hoogeboom, Jorn W．T． Peters, Rianne van den Berg, Max Well…
https://t.co/E0ZiE9UZdB

arxiv_cs_LG:
Integer Discrete Flows and Lossless Compression. Emiel Hoogeboom, Jorn W. T. Peters, Rianne van den Berg, and Max Welling https://t.co/GKWwCGi5zY

StatsPapers:
Integer Discrete Flows and Lossless Compression. https://t.co/nyTFsG7aP7

arxiv_cscv:
Integer Discrete Flows and Lossless Compression https://t.co/WJMrvplprf

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 5137

Unqiue Words: 1940

Off-policy reinforcement learning with eligibility traces is challenging
because of the discrepancy between target policy and behavior policy. One
common approach is to measure the difference between two policies in a
probabilistic way, such as importance sampling and tree-backup. However,
existing off-policy learning methods based on probabilistic policy measurement
are inefficient when utilizing traces under a greedy target policy, which is
ineffective for control problems. The traces are cut immediately when a
non-greedy action is taken, which may lose the advantage of eligibility traces
and slow down the learning process. Alternatively, some non-probabilistic
measurement methods such as General Q($\lambda$) and Naive Q($\lambda$) never
cut traces, but face convergence problems in practice. To address the above
issues, this paper introduces a new method named TBQ($\sigma$), which
effectively unifies the tree-backup algorithm and Naive Q($\lambda$). By
introducing a new parameter $\sigma$ to illustrate the \emph{degree}...

more |
pdf
| html
None.

SciFi:
TBQ($\sigma$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. https://t.co/XD08R2M2pe

BrundageBot:
TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, and Gang Pan https://t.co/tvdMAkkIp9

arxivml:
"TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning",
Longxiang Shi, Shijian…
https://t.co/sz3zSsxSkm

arxiv_cs_LG:
TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, and Gang Pan https://t.co/a2f9UuEdOc

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 0

Unqiue Words: 0

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

*Tracking 128,326 papers.*

Sort results based on if they are interesting or reproducible.

Interesting

Reproducible