Top 10 Arxiv Papers Today in Robotics


2.129 Mikeys
#1. Multi-Robot Deep Reinforcement Learning with Macro-Actions
Yuchen Xiao, Joshua Hoffman, Tian Xia, Christopher Amato
In many real-world multi-robot tasks, high-quality solutions often require a team of robots to perform asynchronous actions under decentralized control. Multi-agent reinforcement learning methods have difficulty learning decentralized policies because the environment appearing to be non-stationary due to other agents also learning at the same time. In this paper, we address this challenge by proposing a macro-action-based decentralized multi-agent double deep recurrent Q-net (MacDec-MADDRQN) which creates a new double Q-updating rule to train each decentralized Q-net using a centralized Q-net for action selection. A generalized version of MacDec-MADDRQN with two separate training environments, called Parallel-MacDec-MADDRQN, is also presented to cope with the uncertainty in adopting either centralized or decentralized exploration. The advantages and the practical nature of our methods are demonstrated by achieving near-centralized results in simulation experiments and permitting real robots to accomplish a warehouse tool delivery...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Multi-Robot Deep Reinforcement Learning with Macro-Actions. Yuchen Xiao, Joshua Hoffman, Tian Xia, and Christopher Amato https://t.co/U7D5Ph20MC
SciFi: Multi-Robot Deep Reinforcement Learning with Macro-Actions. https://t.co/iZIHDSR45s
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.091 Mikeys
#2. DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning
Vassilios Tsounis, Mitja Alge, Joonho Lee, Farbod Farshidian, Marco Hutter
This paper addresses the problem of legged locomotion in non-flat terrain. As legged robots such as quadrupeds are to be deployed in terrains with geometries which are difficult to model and predict, the need arises to equip them with the capability to generalize well to unforeseen situations. In this work, we propose a novel technique for training neural-network policies for terrain-aware locomotion, which combines state-of-the-art methods for model-based motion planning and reinforcement learning. Our approach is centered on formulating Markov decision processes using the evaluation of dynamic feasibility criteria in place of physical simulation. We thus employ policy-gradient methods to independently train policies which respectively plan and execute foothold and base motions in 3D environments using both proprioceptive and exteroceptive measurements. We apply our method within a challenging suite of simulated terrain scenarios which contain features such as narrow bridges, gaps and stepping-stones, and train policies which...
more | pdf | html
Figures
Tweets
BrundageBot: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning. Vassilios Tsounis, Mitja Alge, Joonho Lee, Farbod Farshidian, and Marco Hutter https://t.co/wX1SO7cq8J
sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://t.co/dqqomFwBXj
arxivml: "DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning", Vassilios Tsounis, Mitja A… https://t.co/ClZxtcqnah
Memoirs: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning. https://t.co/lT5fcLQGA6
ceobillionaire: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
blessingyuki: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
porizou1: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
__tmats__: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
KouroshMeshgi: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
beduffy1: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
mir_k: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
cighos: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
elfeleven11: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
sagarpath: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
sagarpath: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
JianweiLiu93: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
ratneshmadaan: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
viktor_m81: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
MAAATT__11235: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
junja941: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
HwangboJemin: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
dave_co_dev: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
erickTorneroT: RT @sim2realAIorg: DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning https://t.co/jt8Yfch9io https://…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 7603
Unqiue Words: 2363

2.087 Mikeys
#3. Vision-Based Proprioceptive Sensing for Soft Inflatable Actuators
Peter Werner, Matthias Hofer, Carmelo Sferrazza, Raffaello D'Andrea
This paper presents a vision-based sensing approach for a soft linear actuator, which is equipped with an integrated camera. The proposed vision-based sensing pipeline predicts the three-dimensional position of a point of interest on the actuator. To train and evaluate the algorithm, predictions are compared to ground truth data from an external motion capture system. An off-the-shelf distance sensor is integrated in a similar actuator and its performance is used as a baseline for comparison. The resulting sensing pipeline runs at 40 Hz in real-time on a standard laptop and is additionally used for closed loop elongation control of the actuator. It is shown that the approach can achieve comparable accuracy to the distance sensor.
more | pdf | html
Figures
None.
Tweets
arxiv_cscv: Vision-Based Proprioceptive Sensing for Soft Inflatable Actuators https://t.co/c2i21U7RGP
arxiv_cs_cv_pr: Vision-Based Proprioceptive Sensing for Soft Inflatable Actuators. Peter Werner, Matthias Hofer, Carmelo Sferrazza, and Raffaello D'Andrea https://t.co/OGAbkyMpvN
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.087 Mikeys
#4. Assembly of randomly placed parts realized by using only one robot arm with a general parallel-jaw gripper
Jie Zhao, Xin Jiang, Xiaoman Wang, Shengfan Wang, Yunhui Liu
In industry assembly lines, parts feeding machines are widely employed as the prologue of the whole procedure. They play the role of sorting the parts randomly placed in bins to the state with specified pose. With the help of the parts feeding machines, the subsequent assembly processes by robot arm can always start from the same condition. Thus it is expected that function of parting feeding machine and the robotic assembly can be integrated with one robot arm. This scheme can provide great flexibility and can also contribute to reduce the cost. The difficulties involved in this scheme lie in the fact that in the part feeding phase, the pose of the part after grasping may be not proper for the subsequent assembly. Sometimes it can not even guarantee a stable grasp. In this paper, we proposed a method to integrate parts feeding and assembly within one robot arm. This proposal utilizes a specially designed gripper tip mounted on the jaws of a two-fingered gripper. With the modified gripper, in-hand manipulation of the grasped...
more | pdf | html
Figures
None.
Tweets
arxiv_cscv: Assembly of randomly placed parts realized by using only one robot arm with a general parallel-jaw gripper https://t.co/rPYnps9mUe
arxiv_cs_cv_pr: Assembly of randomly placed parts realized by using only one robot arm with a general parallel-jaw gripper. Jie Zhao, Xin Jiang, Xiaoman Wang, Shengfan Wang, and Yunhui Liu https://t.co/Bx49Fbk6ZY
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

2.048 Mikeys
#5. Graph Neural Networks for Human-aware Social Navigation
Luis J. Manso, Ronit R. Jorvekar, Diego R. Faria, Pablo Bustos, Pilar Bachiller
Autonomous navigation is a key skill for assistive and service robots. To be successful, robots have to navigate avoiding going through the personal spaces of the people surrounding them. Complying with social rules such as not getting in the middle of human-to-human and human-to-object interactions is also important. This paper suggests using Graph Neural Networks to model how inconvenient the presence of a robot would be in a particular scenario according to learned human conventions so that it can be used by path planning algorithms. To do so, we propose two ways of modelling social interactions using graphs and benchmark them with different Graph Neural Networks using the SocNav1 dataset. We achieve close-to-human performance in the dataset and argue that, in addition to promising results, the main advantage of the approach is its scalability in terms of the number of social factors that can be considered and easily embedded in code, in comparison with model-based approaches. The code used to train and test the resulting graph...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Graph Neural Networks for Human-aware Social Navigation. Luis J. Manso, Ronit R. Jorvekar, Diego R. Faria, Pablo Bustos, and Pilar Bachiller https://t.co/vbe4F5aP9J
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

2.032 Mikeys
#6. Split Deep Q-Learning for Robust Object Singulation
Iason Sarantopoulos, Marios Kiatos, Zoe Doulgeri, Sotiris Malassiotis
Extracting a known target object from a pile of other objects in a cluttered environment is a challenging robotic manipulation task encountered in many applications of robotics. In such conditions, the target object touches or is covered by adjacent obstacle objects, thus rendering traditional grasping techniques ineffective. In this paper, we propose a pushing policy aiming at singulating the target object from its surrounding clutter, by means of lateral pushing movements of both the neighboring objects and the target object until sufficient 'grasping room' has been achieved. To achieve the above goal we employ reinforcement learning and particularly Deep Q-learning (DQN) to learn optimal push policies by trial and error. A novel Split DQN is proposed to improve the learning rate and increase the modularity of the algorithm. Experiments show that although learning is performed in a simulated environment the transfer of learned policies to a real environment is effective thanks to robust feature selection and learning. Finally,...
more | pdf | html
Figures
Tweets
BrundageBot: Split Deep Q-Learning for Robust Object Singulation. Iason Sarantopoulos, Marios Kiatos, Zoe Doulgeri, and Sotiris Malassiotis https://t.co/CSsH27cVfL
arxivml: "Split Deep Q-Learning for Robust Object Singulation", Iason Sarantopoulos, Marios Kiatos, Zoe Doulgeri, Sotiris Ma… https://t.co/TKNPg5CjSZ
arxiv_cs_LG: Split Deep Q-Learning for Robust Object Singulation. Iason Sarantopoulos, Marios Kiatos, Zoe Doulgeri, and Sotiris Malassiotis https://t.co/Y5oOJ7OobI
Memoirs: Split Deep Q-Learning for Robust Object Singulation. https://t.co/MZtgPDdfA5
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 5656
Unqiue Words: 1734

2.026 Mikeys
#7. Agent Prioritization for Autonomous Navigation
Khaled S. Refaat, Kai Ding, Natalia Ponomareva, Stéphane Ross
In autonomous navigation, a planning system reasons about other agents to plan a safe and plausible trajectory. Before planning starts, agents are typically processed with computationally intensive models for recognition, tracking, motion estimation and prediction. With limited computational resources and a large number of agents to process in real time, it becomes important to efficiently rank agents according to their impact on the decision making process. This allows spending more time processing the most important agents. We propose a system to rank agents around an autonomous vehicle (AV) in real time. We automatically generate a ranking data set by running the planner in simulation on real-world logged data, where we can afford to run more accurate and expensive models on all the agents. The causes of various planner actions are logged and used for assigning ground truth importance scores. The generated data set can be used to learn ranking models. In particular, we show the utility of combining learned features, via a...
more | pdf | html
Figures
None.
Tweets
SciFi: Agent Prioritization for Autonomous Navigation. https://t.co/7YjRHJSz4O
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.009 Mikeys
#8. Visual Measurement Integrity Monitoring for UAV Localization
Chengyao Li, Steven L. Waslander
Unmanned aerial vehicles (UAVs) have increasingly been adopted for safety, security, and rescue missions, for which they need precise and reliable pose estimates relative to their environment. To ensure mission safety when relying on visual perception, it is essential to have an approach to assess the integrity of the visual localization solution. However, to the best of our knowledge, such an approach does not exist for optimization-based visual localization. Receiver autonomous integrity monitoring (RAIM) has been widely used in global navigation satellite systems (GNSS) applications such as automated aircraft landing. In this paper, we propose a novel approach inspired by RAIM to monitor the integrity of optimization-based visual localization and calculate the protection level of a state estimate, i.e. the largest possible translational error in each direction. We also propose a metric that quantitatively evaluates the performance of the error bounds. Finally, we validate the protection level using the EuRoC dataset and...
more | pdf | html
Figures
None.
Tweets
arxiv_cscv: Visual Measurement Integrity Monitoring for UAV Localization https://t.co/e3WLhpSqRb
arxiv_cs_cv_pr: Visual Measurement Integrity Monitoring for UAV Localization. Chengyao Li and Steven L. Waslander https://t.co/FT0r6wpKC0
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.004 Mikeys
#9. Adversarial Feature Training for Generalizable Robotic Visuomotor Control
Xi Chen, Ali Ghadirzadeh, Mårten Björkman, Patric Jensfelt
Deep reinforcement learning (RL) has enabled training action-selection policies, end-to-end, by learning a function which maps image pixels to action outputs. However, it's application to visuomotor robotic policy training has been limited because of the challenge of large-scale data collection when working with physical hardware. A suitable visuomotor policy should perform well not just for the task-setup it has been trained for, but also for all varieties of the task, including novel objects at different viewpoints surrounded by task-irrelevant objects. However, it is impractical for a robotic setup to sufficiently collect interactive samples in a RL framework to generalize well to novel aspects of a task. In this work, we demonstrate that by using adversarial training for domain transfer, it is possible to train visuomotor policies based on RL frameworks, and then transfer the acquired policy to other novel task domains. We propose to leverage the deep RL capabilities to learn complex visuomotor skills for uncomplicated task...
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: Adversarial Feature Training for Generalizable Robotic Visuomotor Control. Xi Chen, Ali Ghadirzadeh, Mårten Björkman, and Patric Jensfelt https://t.co/mbKAFeWOW6
arxiv_cscv: Adversarial Feature Training for Generalizable Robotic Visuomotor Control https://t.co/oVnDbdMxjm
arxiv_cscv: Adversarial Feature Training for Generalizable Robotic Visuomotor Control https://t.co/oVnDbdMxjm
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.004 Mikeys
#10. Learning to Manipulate Object Collections Using Grounded State Representations
Matthew Wilson, Tucker Hermans
We propose a method for sim-to-real robot learning which exploits simulator state information in a way that scales to many objects. First, we train a pair of encoders on raw object pose targets to learn representations that accurately capture the state information of a multi-object environment. Second, we use these encoders in a reinforcement learning algorithm to train image-based policies capable of manipulating many objects. Our pair of encoders consists of one which consumes RGB images and is used in our policy network, and one which directly consumes a set of raw object poses and is used for reward calculation and value estimation. We evaluate our method on the task of pushing a collection of objects to desired tabletop regions. Compared to methods which rely only on images or use fixed-length state encodings, our method achieves higher success rates, performs well in the real world without fine tuning, and generalizes to different numbers and types of objects not seen during training.
more | pdf | html
Figures
Tweets
BrundageBot: Learning to Manipulate Object Collections Using Grounded State Representations. Matthew Wilson and Tucker Hermans https://t.co/MXZgOTIP2G
arxiv_cs_LG: Learning to Manipulate Object Collections Using Grounded State Representations. Matthew Wilson and Tucker Hermans https://t.co/gIKtU1fNXY
matwilso: How to learn policies that can reason about and manipulate many objects simultaneously? New sim-to-real manipulation paper, accepted to Conference on Robot Learning (CoRL 2019)! w/ Tucker Hermans pdf: https://t.co/rAObiOmtNr website: https://t.co/MdglEODqUz https://t.co/16vDx4suu6
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 8230
Unqiue Words: 2578

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 192,914 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 192,914 papers.