### Top 10 Arxiv Papers Today in Distributed, Parallel, And Cluster Computing

##### #1. Virtual Net: a Decentralized Architecture for Interaction in Mobile Virtual Worlds
###### Bingqing Shen, Jingzhi Guo
With the development of mobile technology, mobile virtual worlds have attracted massive users. To improve scalability, a peer-to-peer virtual world provides the solution to accommodate more users without increasing hardware investment. In mobile settings, however, existing P2P solutions are not applicable due to the unreliability of mobile devices and the instability of mobile networks. To address the issue, a novel infrastructure model, called Virtual Net, is proposed to provide fault-tolerance in managing user content and object state. In this paper, the key problem, namely object state update, is resolved to maintain state consistency and high interaction responsiveness. This work is important in implementing a scalable mobile virtual world.
more | pdf | html
###### Tweets
ComputerPapers: Virtual Net: a Decentralized Architecture for Interaction in Mobile Virtual Worlds. https://t.co/5ZwBWyV4Us
###### Github

Evaluation code for the paper "Virtual Net: a Decentralized Architecture for Interaction in Mobile Virtual Worlds"

Repository: VirtualNetEventHandling
User: sunniel
Language: C++
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 0
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 16327
Unqiue Words: 3274

##### #2. A Performance Vocabulary for Affine Loop Transformations
###### Martin Kong, Louis-Noël Pouchet
Modern polyhedral compilers excel at aggressively optimizing codes with static control parts, but the state-of-practice to find high-performance polyhedral transformations especially for different hardware targets still largely involves auto-tuning. In this work we propose a novel polyhedral scheduling technique, with the aim to reduce the need for auto-tuning while allowing to build customizable and specific transformation strategies. We design constraints and objectives that model several crucial aspects of performance such as stride optimization or the trade-off between parallelism and reuse, while taking into account important architectural features of the target machine. The developed set of objectives embody a Performance Vocabulary for loop transformations. The goal is to use this vocabulary, consisting of performance idioms, to construct transformation recipes adapted to a number of program classes. We evaluate our work using the PolyBench/C benchmark suite and experimentally validate it against large optimization spaces...
more | pdf | html
None.
###### Tweets
ComputerPapers: A Performance Vocabulary for Affine Loop Transformations. https://t.co/8VhXEx6Ffx
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 11398
Unqiue Words: 3434

##### #3. Decentralized Data Storages: Technologies of Construction
###### Alexander Kryukov, Andrey Demichev
The paper presents a comparative overview of decentralized data storages of various types. It is shown that although they have a number of common properties that are typical of all peer-to-peer (P2P) networks, the problems to be solved and, accordingly, the technologies used to build different types of storages differ significantly.
more | pdf | html
###### Tweets
ComputerPapers: Decentralized Data Storages: Technologies of Construction. https://t.co/7GLWENyygy
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 3785
Unqiue Words: 1355

##### #4. The Amortized Analysis of a Non-blocking Chromatic Tree
###### Jeremy Ko
A non-blocking chromatic tree is a type of balanced binary search tree where multiple processes can concurrently perform search and update operations. We prove that a certain implementation has amortized cost $O(\dot{c} + \log n)$ for each operation, where $\dot{c}$ is the maximum number of concurrent operations during the execution and $n$ is the maximum number of keys in the tree during the operation. This amortized analysis presents new challenges compared to existing analyses of other non-blocking data structures.
more | pdf | html
None.
###### Tweets
ComputerPapers: The Amortized Analysis of a Non-blocking Chromatic Tree. https://t.co/3r5LyqqJTz
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 50624
Unqiue Words: 3309

##### #5. AMGCL: an Efficient, Flexible, and Extensible Algebraic Multigrid Implementation
###### Denis Demidov
The paper presents AMGCL -- an opensource C++ library implementing the algebraic multigrid method (AMG) for solution of large sparse linear systems of equations, usually arising from discretization of partial differential equations on an unstructured grid. The library supports both shared and distributed memory computation, allows to utilize modern massively parallel processors via OpenMP, OpenCL, or CUDA technologies, has minimal dependencies, and is easily extensible. The design principles behind AMGCL are discussed and it is shown that the code performance is on par with alternative implementations.
more | pdf | html
None.
###### Tweets
ComputerPapers: AMGCL: an Efficient, Flexible, and Extensible Algebraic Multigrid Implementation. https://t.co/sfVHo6N7IE
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 4964
Unqiue Words: 1811

##### #6. Large-Scale Distributed Algorithms for Facility Location with Outliers
###### Tanmay Inamdar, Shreyas Pai, Sriram V. Pemmaraju
This paper presents fast, distributed, $O(1)$-approximation algorithms for metric facility location problems with outliers in the Congested Clique model, Massively Parallel Computation (MPC) model, and in the $k$-machine model. The paper considers Robust Facility Location and Facility Location with Penalties, two versions of the facility location problem with outliers proposed by Charikar et al. (SODA 2001). The paper also considers two alternatives for specifying the input: the input metric can be provided explicitly (as an $n \times n$ matrix distributed among the machines) or implicitly as the shortest path metric of a given edge-weighted graph. The results in the paper are: - Implicit metric: For both problems, $O(1)$-approximation algorithms running in $O(\mbox{poly}(\log n))$ rounds in the Congested Clique and the MPC model and $O(1)$-approximation algorithms running in $\tilde{O}(n/k)$ rounds in the $k$-machine model. - Explicit metric: For both problems, $O(1)$-approximation algorithms running in $O(\log\log\log n)$...
more | pdf | html
None.
###### Tweets
ComputerPapers: Large-Scale Distributed Algorithms for Facility Location with Outliers. https://t.co/P9UofDQQwD
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 20806
Unqiue Words: 2845

##### #7. Latecomers Help to Meet: Deterministic Anonymous Gathering in the Plane
###### Andrzej Pelc, Ram Narayan Yadav
A team of anonymous mobile agents represented by points freely moving in the plane have to gather at a single point and stop. Agents start at different points of the plane and at possibly different times chosen by the adversary. They are equipped with compasses, a common unit of distance and clocks. They execute the same deterministic algorithm and travel at speed 1. When agents are at distance at most $\epsilon$, for some positive constant $\epsilon$ unknown to them, they can exchange all information. Due to the anonymity of the agents and the symmetry of the plane, gathering is impossible, e.g., if agents start simultaneously at distances larger than $\epsilon$. However, if some agents start with a delay with respect to others, gathering may become possible. In which situations such latecomers can enable gathering? To answer this question we consider initial configurations formalized as sets of pairs $\{(p_1,t_1), (p_2,t_2),\dots , (p_n,t_n)\}$, for $n\geq 2$ where $p_i$ is the starting point of the $i$-th agent and $t_i$ is its...
more | pdf | html
None.
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 10486
Unqiue Words: 1903

##### #8. Iterated local search and very large neighborhoods for the parallel-machines total tardiness problem
###### F Croce, Thierry Garaix, A. Grosso
We present computational results with a heuristic algorithm for the parallel machines total weighted tardiness problem. The algorithm combines generalized pairwise interchange neighborhoods, dynasearch optimization and a new machine-based neighborhood whose size is non-polynomial in the number of machines. The computational results significantly improve over the current state of the art for this problem.
more | pdf | html
None.
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 4742
Unqiue Words: 1529

##### #9. SCALE-Sim: Systolic CNN Accelerator
###### Ananda Samajdar, Yuhao Zhu, Paul Whatmough, Matthew Mattina, Tushar Krishna
Systolic Arrays are one of the most popular compute substrates within Deep Learning accelerators today, as they provide extremely high efficiency for running dense matrix multiplications. However, the research community lacks tools to insights on both the design trade-offs and efficient mapping strategies for systolic-array based accelerators. We introduce Systolic CNN Accelerator Simulator (SCALE-Sim), which is a configurable systolic array based cycle accurate DNN accelerator simulator. SCALE-Sim exposes various micro-architectural features as well as system integration parameters to the designer to enable comprehensive design space exploration. This is the first systolic-array simulator tuned for running DNNs to the best of our knowledge. Using SCALE-Sim, we conduct a suite of case studies and demonstrate the effect of bandwidth, data flow and aspect ratio on the overall runtime and energy of Deep Learning kernels across vision, speech, text, and games. We believe that these insights will be highly beneficial to architects and...
more | pdf | html
None.
###### Tweets
nmfeeds: [O] https://t.co/8UgZZEcq1W SCALE-Sim: Systolic CNN Accelerator. Systolic Arrays are one of the most popular compute subst...
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 5
Total Words: 9339
Unqiue Words: 2767

##### #10. Collaboratively Learning the Best Option on Graphs, Using Bounded Local Memory
###### Lili Su, Martin Zubeldia, Nancy Lynch
We consider multi-armed bandit problems in social groups wherein each individual has bounded memory and shares the common goal of learning the best arm/option. We say an individual learns the best option if eventually (as $t\to \infty$) it pulls only the arm with the highest expected reward. While this goal is provably impossible for an isolated individual due to bounded memory, we show that, in social groups, this goal can be achieved easily with the aid of social persuasion (i.e., communication) as long as the communication networks/graphs satisfy some mild conditions. To deal with the interplay between the randomness in the rewards and in the social interaction, we employ the {\em mean-field approximation} method. Considering the possibility that the individuals in the networks may not be exchangeable when the communication networks are not cliques, we go beyond the classic mean-field techniques and apply a refined version of mean-field approximation: (1) Using coupling we show that, if the communication graph is connected...
more | pdf | html
###### Tweets
MathPaper: Collaboratively Learning the Best Option on Graphs, Using Bounded Local Memory. https://t.co/XaweHJoLCP
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 12970
Unqiue Words: 2488

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 57,756 papers.

###### Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Online
###### Stats
Tracking 57,756 papers.