### Top 10 Arxiv Papers Today in Distributed, Parallel, And Cluster Computing

##### #1. Introduction to the Tezos Blockchain
###### Victor Allombert, Mathias Bourgoin, Julien Tesson
Tezos is an innovative blockchain that improves on several aspects compared to more established blockchains. It offers an original proof-of-stake consensus algorithm and can be used as a decentralized smart contract platform. It has the capacity to amend its own economic protocol through a voting mechanism and focuses on formal methods to improve safety.
more | pdf | html
None.
###### Tweets
cryptoassetco: #仮想通貨 テゾス https://t.co/hEH8ZjnXXE
m0h1can: Tezosの解説あがってた https://t.co/GcpkLQI7Kl
LabosNomades: Victor Allombert, Mathias Bourgoin and Julien Tesson from Nomadic Labs just published an introduction to the Tezos blockchain, a detailed summary about the technology behind #tezos #XTZ #blockchain #SmartContracts https://t.co/LYRsEfywgH
CitezenB: @BlackmonTrader @GemCrypto "Introduction to the Tezos Blockchain" https://t.co/JQmUQymA2F #tezos
CryptCoinPortal: RT @m0h1can: Tezosの解説あがってた https://t.co/GcpkLQI7Kl
camloeba: RT @m0h1can: Tezosの解説あがってた https://t.co/GcpkLQI7Kl
yoshihiro503: RT @m0h1can: Tezosの解説あがってた https://t.co/GcpkLQI7Kl
kururu_goedel: RT @m0h1can: Tezosの解説あがってた https://t.co/GcpkLQI7Kl
keigoi: RT @m0h1can: Tezosの解説あがってた https://t.co/GcpkLQI7Kl
cryptoassetco: RT @m0h1can: Tezosの解説あがってた https://t.co/GcpkLQI7Kl
nobsun: RT @m0h1can: Tezosの解説あがってた https://t.co/GcpkLQI7Kl
CitezenB: RT @m0h1can: Tezosの解説あがってた https://t.co/GcpkLQI7Kl
GranCube: RT @cryptoassetco: #仮想通貨 テゾス https://t.co/hEH8ZjnXXE
haochenxie: RT @m0h1can: Tezosの解説あがってた https://t.co/GcpkLQI7Kl
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 7689
Unqiue Words: 2353

##### #2. Heterogeneity-Aware Asynchronous Decentralized Training
###### Qinyi Luo, Jiaao He, Youwei Zhuo, Xuehai Qian
Distributed deep learning training usually adopts All-Reduce as the synchronization mechanism for data parallel algorithms due to its high performance in homogeneous environment. However, its performance is bounded by the slowest worker among all workers, and is significantly slower in heterogeneous situations. AD-PSGD, a newly proposed synchronization method which provides numerically fast convergence and heterogeneity tolerance, suffers from deadlock issues and high synchronization overhead. Is it possible to get the best of both worlds - designing a distributed training method that has both high performance as All-Reduce in homogeneous environment and good heterogeneity tolerance as AD-PSGD? In this paper, we propose Ripples, a high-performance heterogeneity-aware asynchronous decentralized training approach. We achieve the above goal with intensive synchronization optimization, emphasizing the interplay between algorithm and system implementation. To reduce synchronization cost, we propose a novel communication primitive...
more | pdf | html
None.
###### Tweets
arxiv_org: Heterogeneity-Aware Asynchronous Decentralized Training. https://t.co/hmJ7tfA6bR https://t.co/nCAJrgVFmd
arxivml: "Heterogeneity-Aware Asynchronous Decentralized Training", Qinyi Luo, Jiaao He, Youwei Zhuo, Xuehai Qian https://t.co/JSAukMBFnC
arxiv_cs_LG: Heterogeneity-Aware Asynchronous Decentralized Training. Qinyi Luo, Jiaao He, Youwei Zhuo, and Xuehai Qian https://t.co/xNuzmg2l23
Memoirs: Heterogeneity-Aware Asynchronous Decentralized Training. https://t.co/jggA98umC1
Rosenchild: RT @arxiv_org: Heterogeneity-Aware Asynchronous Decentralized Training. https://t.co/hmJ7tfA6bR https://t.co/nCAJrgVFmd
HubBucket: RT @arxiv_org: Heterogeneity-Aware Asynchronous Decentralized Training. https://t.co/hmJ7tfA6bR https://t.co/nCAJrgVFmd
jaialkdanel: RT @arxiv_org: Heterogeneity-Aware Asynchronous Decentralized Training. https://t.co/hmJ7tfA6bR https://t.co/nCAJrgVFmd
subhobrata1: RT @arxiv_org: Heterogeneity-Aware Asynchronous Decentralized Training. https://t.co/hmJ7tfA6bR https://t.co/nCAJrgVFmd
MassBassLol: RT @arxiv_org: Heterogeneity-Aware Asynchronous Decentralized Training. https://t.co/hmJ7tfA6bR https://t.co/nCAJrgVFmd
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

##### #3. Distributed Answer Set Coloring: Stable Models Computation via Graph Coloring
###### Marco De Bortoli
Answer Set Programming (ASP) is a famous logic language for knowledge representation, which has been really successful in the last years, as witnessed by the great interest into the development of efficient solvers for ASP. Yet, the great request of resources for certain types of problems, as the planning ones, still constitutes a big limitation for problem solving. Particularly, in the case the program is grounded before the resolving phase, an exponential blow up of the grounding can generate a huge ground file, infeasible for single machines with limited resources, thus preventing even the discovering of a single non-optimal solution. To address this problem, in this paper we present a distributed approach to ASP solving, exploiting distributed computation benefits in order to overcome the just explained limitations. The here presented tool, which is called Distributed Answer Set Coloring (DASC), is a pure solver based on the well-known Graph Coloring algorithm. DASC is part of a bigger project aiming to bring logic programming...
more | pdf | html
None.
###### Tweets
arxivml: "Distributed Answer Set Coloring: Stable Models Computation via Graph Coloring", Marco De Bortoli https://t.co/4HbgtctELI
okateim: 2019/09/19 [7] Distributed Answer Set Coloring: Stable Models Computation via Graph Coloring (https://t.co/FF8OIwDil5)
SciFi: Distributed Answer Set Coloring: Stable Models Computation via Graph Coloring. https://t.co/yD0EIx9yJA
arxiv_cslo: Distributed Answer Set Coloring: Stable Models Computation via Graph Coloring https://t.co/XNj4YVKYdF
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

##### #4. Message Reduction in the Local Model is a Free Lunch
###### Shimon Bitton, Yuval Emek, Taisuke Izumi, Shay Kutten
A new \emph{spanner} construction algorithm is presented, working under the \emph{LOCAL} model with unique edge IDs. Given an $n$-node communication graph, a spanner with a constant stretch and $O (n^{1 + \varepsilon})$ edges (for an arbitrarily small constant $\varepsilon > 0$) is constructed in a constant number of rounds sending $O (n^{1 + \varepsilon})$ messages whp. Consequently, we conclude that every $t$-round LOCAL algorithm can be transformed into an $O (t)$-round LOCAL algorithm that sends $O (t \cdot n^{1 + \varepsilon})$ messages whp. This improves upon all previous message-reduction schemes for LOCAL algorithms that incur a $\log^{\Omega (1)} n$ blow-up of the round complexity.
more | pdf | html
None.
###### Tweets
okateim: 2019/09/19 [6] Message Reduction in the Local Model is a Free Lunch (https://t.co/eBMLNI73IR)
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 7932
Unqiue Words: 2137

##### #5. DeepDriveMD: Deep-Learning Driven Adaptive Molecular Simulations for Protein Folding
###### Hyungro Lee, Heng Ma, Matteo Turilli, Debsindhu Bhowmik, Shantenu Jha, Arvind Ramanathan
Simulations of biological macromolecules play an important role in understanding the physical basis of a number of complex processes such as protein folding. Even with increasing computational power and evolution of specialized architectures, the ability to simulate protein folding at atomistic scales still remains challenging. This stems from the dual aspects of high dimensionality of protein conformational landscapes, and the inability of atomistic molecular dynamics (MD) simulations to sufficiently sample these landscapes to observe folding events. Machine learning/deep learning (ML/DL) techniques, when combined with atomistic MD simulations offer the opportunity to potentially overcome these limitations by: (1) effectively reducing the dimensionality of MD simulations to automatically build latent representations that correspond to biophysically relevant reaction coordinates (RCs), and (2) driving MD simulations to automatically sample potentially novel conformational states based on these RCs. We examine how coupling DL...
more | pdf | html
None.
###### Tweets
arxiv_cs_LG: DeepDriveMD: Deep-Learning Driven Adaptive Molecular Simulations for Protein Folding. Hyungro Lee, Heng Ma, Matteo Turilli, Debsindhu Bhowmik, Shantenu Jha, and Arvind Ramanathan https://t.co/yIAC6UR4Vf
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

##### #6. Network-Aware Container Scheduling in Multi-Tenant Data Center
###### Leonardo R. Rodrigues, Marcelo Pasin, Omir C. Alves Jr., Charles C. Miers, Mauricio A. Pillon, Pascal Felber, Guilherme P. Koslovski
Network management on multi-tenant container-based data centers has critical impact on performance. Tenants encapsulate applications in containers abstracting away details on hosting infrastructures, and entrust data centers management framework with the provisioning of network QoS requirements. In this paper, we propose a network-aware multi-criteria container scheduler to jointly process containers and network requirements. We introduce a new Mixed Integer Linear Programming formulation for network-aware scheduling encompassing both tenants and providers metrics. We describe two GPU-accelerated modules to address the complexity barrier of the problem and efficiently process scheduling requests. Our experiments show that our scheduling approach accounting for both network and containers outperforms traditional algorithms used by containers orchestrators.
more | pdf | html
None.
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 7
Total Words: 0
Unqiue Words: 0

##### #7. Certifying Blockchain Byzantine Fault Tolerance
###### Pierre Tholoniat, Vincent Gramoli
To implement a blockchain, the trend is now to integrate a non-trivial Byzantine fault tolerant consensus algorithm instead of the seminal idea of waiting to receive blocks to decide upon the longest branch. After a decade of existence, blockchains trade now large amounts of valuable assets and a simple disagreement could lead to disastrous losses. Unfortunately, Byzantine consensus solutions used in blockchains are at best proved correct "by hand'' as we are not aware of any of them having been certified. In this paper, we propose two contributions: (i) we illustrate the severity of the problem by listing six vulnerabilities of blockchain consensus including two new counter-examples; (ii) we then certify two Byzantine fault tolerant components of Red Belly Blockchain using the ByMC model checker: First, we specify a simple broadcast primitive in 116 lines that is certified in 40 seconds on a 2-core Intel machine and a blockchain consensus algorithm written in 309 lines of code and certified, using MPI, in 17 minutes on a...
more | pdf | html
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 10646
Unqiue Words: 2839

##### #8. HERALD: Optimizing Heterogeneous DNN Accelerators for Edge Devices
###### Hyoukjun Kwon, Liangzhen Lai, Tushar Krishna, Vikas Chandra
Recent advances in deep neural networks (DNNs) have made DNNs the backbone of many applications on edge devices such as face recognition, object detection, and so on. To deal with massive computation requirements of DNN inferences within stringent energy and latency constraints, DNN accelerator (i.e., hardware specialized forDNN inferences), have emerged as a promising solution. Such advancement of hardware supporting DNNs has led to multiple DNN-based applications running at the same time on edge devices. They often run in parallel as background processes or as sub-tasks of a complex application. Thus, DNN workloads on a DNN accelerator now include a variety of layer operations and sizes from DNN models for diverse applications making them heterogeneous in layer granularity. Such heterogeneous workloads introduce a new major challenge for monolithic DNN accelerators because the efficiency of DNN accelerators relies on its dataflow, and different DNN layer types and shapes prefer different dataflows. In this work, we propose to...
more | pdf | html
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 8562
Unqiue Words: 2312

##### #9. Mitigating Network Noise on Dragonfly Networks through Application-Aware Routing
###### Daniele De Sensi, Salvatore Di Girolamo, Torsten Hoefler
System noise can negatively impact the performance of HPC systems, and the interconnection network is one of the main factors contributing to this problem. To mitigate this effect, adaptive routing sends packets on non-minimal paths if they are less congested. However, while this may mitigate interference caused by congestion, it also generates more traffic since packets traverse additional hops, causing in turn congestion on other applications and on the application itself. In this paper, we first describe how to estimate network noise. By following these guidelines, we show how noise can be reduced by using routing algorithms which select minimal paths with a higher probability. We exploit this knowledge to design an algorithm which changes the probability of selecting minimal paths according to the application characteristics. We validate our solution on microbenchmarks and real-world applications on two systems relying on a Dragonfly interconnection network, showing noise reduction and performance improvement.
more | pdf | html
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 11766
Unqiue Words: 3076

##### #10. When Two is Worse Than One
###### R. Guerin
This note is concerned with the impact on job latency of splitting a token bucket into multiple sub-token buckets with equal aggregate parameters and offered the same job arrival process. The situation commonly arises in distributed computing environments where job arrivals are rate controlled (each job needs one token to enter the system), but capacity limitations call for distributing jobs across multiple compute resources with scalability considerations preventing the use of a centralized rate control component (each compute resource is responsible for monitoring and enforcing that the job stream it receives conforms to a certain traffic envelope). The question we address is to what extent splitting a token bucket into multiple sub-token buckets that individually rate control a subset of the original arrival process affects job latency, when jobs wait for a token whenever the token bucket is empty upon their arrival. Our contribution is to establish that independent of the job arrival process and how jobs are distributed across...
more | pdf | html
None.
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 192,914 papers.

###### Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Online
###### Stats
Tracking 192,914 papers.