Top 10 Arxiv Papers Today in Computer Vision And Pattern Recognition


2.92 Mikeys
#1. Efficient Video Generation on Complex Datasets
Aidan Clark, Jeff Donahue, Karen Simonyan
Generative models of natural images have progressed towards high fidelity samples by the strong leveraging of scale. We attempt to carry this success to the field of video modeling by showing that large Generative Adversarial Networks trained on the complex Kinetics-600 dataset are able to produce video samples of substantially higher complexity than previous work. Our proposed network, Dual Video Discriminator GAN (DVD-GAN), scales to longer and higher resolution videos by leveraging a computationally efficient decomposition of its discriminator. We evaluate on the related tasks of video synthesis and video prediction, and achieve new state of the art Frechet Inception Distance on prediction for Kinetics-600, as well as state of the art Inception Score for synthesis on the UCF-101 dataset, alongside establishing a number of strong baselines on Kinetics-600.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Efficient Video Generation on Complex Datasets. Aidan Clark, Jeff Donahue, and Karen Simonyan https://t.co/vD1qhGuRpg
udoooom: DVD-GAN読んでなるほどって言いながら涙流してる https://t.co/SPyWgf4BBA
arxiv_cs_LG: Efficient Video Generation on Complex Datasets. Aidan Clark, Jeff Donahue, and Karen Simonyan https://t.co/MyjEhgkRX2
StatsPapers: Efficient Video Generation on Complex Datasets. https://t.co/JYANzYN2UC
arxiv_cscv: Efficient Video Generation on Complex Datasets https://t.co/zOKhqPBy6m
Miles_Brundage: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
ceobillionaire: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
negar_rz: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
cvondrick: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
HirokatuKataoka: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
nazifberat: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
mosko_mule: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
abhshkdz: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
ayirpelle: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
mawsonguy: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
yshhrknmr: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
maggie_albrecht: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
blockchen0x: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
KrAbhinavGupta: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
chaosgonewrong: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
MummyComic: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
xuetal: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
talia_konkle: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
peacefulcyborg: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
nova77t: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
ytamaazousti: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
StevenDakin: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
AnantaTama: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
bhargavbardipur: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
iamknighton: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
jbohnslav: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
AISC_TO: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
alessandroleite: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
yassersouri: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
ZimMatthias: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
mathwrath: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
cighos: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
GuptaRajat033: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
samdutter: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
whitesiro1107: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
Lefiish: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
Xelfor: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
KrishaMehta2: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
eree_bay: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
alsombra7: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
matt_vowels: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
MohammadOtoofi: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
lishuai800: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
MC_Bezuidenhout: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
gabeibagon: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
adn_twitts: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
yktktcy: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
d10nator: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
swapp19902: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
Missuaedz: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
khushjammu: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
oskaus: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
HuiHsuX: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
7VLZ7: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
qkisw: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
Nimashiri: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
dmn001: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
Tsingggg: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
iamx9000: RT @roadrunning01: Efficient Video Generation on Complex Datasets pdf: https://t.co/ngwdxDK42E abs: https://t.co/WhfJKvKtLG https://t.co/Yl…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.162 Mikeys
#2. Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis
Yueming Jin, Huaxia Li, Qi Dou, Hao Chen, Jing Qin, Chi-Wing Fu, Pheng-Ann Heng
Surgical tool presence detection and surgical phase recognition are two fundamental yet challenging tasks in surgical video analysis and also very essential components in various applications in modern operating rooms. While these two analysis tasks are highly correlated in clinical practice as the surgical process is well-defined, most previous methods tackled them separately, without making full use of their relatedness. In this paper, we present a novel method by developing a multi-task recurrent convolutional network with correlation loss (MTRCNet-CL) to exploit their relatedness to simultaneously boost the performance of both tasks. Specifically, our proposed MTRCNet-CL model has an end-to-end architecture with two branches, which share earlier feature encoders to extract general visual features while holding respective higher layers targeting for specific tasks. Given that temporal information is crucial for phase recognition, long-short term memory (LSTM) is explored to model the sequential dependencies in the phase...
more | pdf | html
Figures
Tweets
BrundageBot: Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis. Yueming Jin, Huaxia Li, Qi Dou, Hao Chen, Jing Qin, Chi-Wing Fu, and Pheng-Ann Heng https://t.co/scQ7IdI5HA
arxiv_cs_LG: Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis. Yueming Jin, Huaxia Li, Qi Dou, Hao Chen, Jing Qin, Chi-Wing Fu, and Pheng-Ann Heng https://t.co/xjhY7QxjkZ
Memoirs: Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis. https://t.co/KUNdrNVQ46
arxiv_cscv: Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis https://t.co/RNglVYHboy
Github
Repository: MTRCNet-CL
User: YuemingJin
Language: Python
Stargazers: 0
Subscribers: 0
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 13163
Unqiue Words: 3246

2.161 Mikeys
#3. Understanding Deep Learning Techniques for Image Segmentation
Swarnendu Ghosh, Nibaran Das, Ishita Das, Ujjwal Maulik
The machine learning community has been overwhelmed by a plethora of deep learning based approaches. Many challenging computer vision tasks such as detection, localization, recognition and segmentation of objects in unconstrained environment are being efficiently addressed by various types of deep neural networks like convolutional neural networks, recurrent networks, adversarial networks, autoencoders and so on. While there have been plenty of analytical studies regarding the object detection or recognition domain, many new deep learning techniques have surfaced with respect to image segmentation techniques. This paper approaches these various deep learning techniques of image segmentation from an analytical perspective. The main goal of this work is to provide an intuitive understanding of the major techniques that has made significant contribution to the image segmentation domain. Starting from some of the traditional image segmentation approaches, the paper progresses describing the effect deep learning had on the image...
more | pdf | html
Figures
Tweets
BrundageBot: Understanding Deep Learning Techniques for Image Segmentation. Swarnendu Ghosh, Nibaran Das, Ishita Das, and Ujjwal Maulik https://t.co/AhbxlTbX1h
arxivml: "Understanding Deep Learning Techniques for Image Segmentation", Swarnendu Ghosh, Nibaran Das, Ishita Das, Ujjwal M… https://t.co/e7qk4Ri7bm
arxiv_cs_LG: Understanding Deep Learning Techniques for Image Segmentation. Swarnendu Ghosh, Nibaran Das, Ishita Das, and Ujjwal Maulik https://t.co/qcwOU4Q2ib
Memoirs: Understanding Deep Learning Techniques for Image Segmentation. https://t.co/drivL8LZPy
disigandalf: RT @arxiv_cscv: Understanding Deep Learning Techniques for Image Segmentation https://t.co/HCx50t3qTW
yuliangxiu: RT @arxiv_cscv: Understanding Deep Learning Techniques for Image Segmentation https://t.co/HCx50t3qTW
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 20440
Unqiue Words: 5252

2.13 Mikeys
#4. Multimodal deep networks for text and image-based document classification
Nicolas Audebert, Catherine Herold, Kuider Slimani, Cédric Vidal
Classification of document images is a critical step for archival of old manuscripts, online subscription and administrative procedures. Computer vision and deep learning have been suggested as a first solution to classify documents based on their visual appearance. However, achieving the fine-grained classification that is required in real-world setting cannot be achieved by visual analysis alone. Often, the relevant information is in the actual text content of the document. We design a multimodal neural network that is able to learn from word embeddings, computed on text extracted by OCR, and from the image. We show that this approach boosts pure image accuracy by 3% on Tobacco3482 and RVL-CDIP augmented by our new QS-OCR text dataset (https://github.com/Quicksign/ocrized-text-dataset), even without clean text information.
more | pdf | html
Figures
Tweets
BrundageBot: Multimodal deep networks for text and image-based document classification. Nicolas Audebert, Catherine Herold, Kuider Slimani, and Cédric Vidal https://t.co/oGO9KGMSTL
Github

Quicksign OCRized Text Dataset (QS-OCR)

Repository: ocrized-text-dataset
User: Quicksign
Language: Python
Stargazers: 4
Subscribers: 5
Forks: 1
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 6367
Unqiue Words: 2322

2.121 Mikeys
#5. FastV2C-HandNet: Fast Voxel to Coordinate Hand Pose Estimation with 3D Convolutional Neural Networks
Rohan Lekhwani
Hand pose estimation from monocular depth images has been an important and challenging problem in the Computer Vision community. In this paper, we present a novel approach to estimate 3D hand joint locations from 2D depth images. Unlike most of the previous methods, our model captures the 3D spatial information from a depth image thereby giving it a greater understanding of the input. We voxelize the input depth map to capture the 3D features of the input and perform 3D data augmentations to make our network robust to real-world images. Our network is trained in an end-to-end manner which reduces time and space complexity significantly when compared to other methods. Through extensive experiments, we show that our model outperforms state-of-the-art methods with respect to the time it takes to train and predict 3D hand joint locations. This makes our method more suitable for real-world hand pose estimation scenarios.
more | pdf | html
Figures
None.
Tweets
BrundageBot: FastV2C-HandNet: Fast Voxel to Coordinate Hand Pose Estimation with 3D Convolutional Neural Networks. Rohan Lekhwani https://t.co/nWUAGyBRmR
arxiv_cs_LG: FastV2C-HandNet: Fast Voxel to Coordinate Hand Pose Estimation with 3D Convolutional Neural Networks. Rohan Lekhwani https://t.co/NFSBjP8rIN
Memoirs: FastV2C-HandNet: Fast Voxel to Coordinate Hand Pose Estimation with 3D Convolutional Neural Networks. https://t.co/7h9ti5L3L4
arxiv_cshc: FastV2C-HandNet: Fast Voxel to Coordinate Hand Pose Estimation with 3D Convolutional Neural Networks https://t.co/pH6L85sKuD
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

2.116 Mikeys
#6. Recovery Guarantees for Compressible Signals with Adversarial Noise
Jasjeet Dhaliwal, Kyle Hambrook
We provide recovery guarantees for compressible signals that have been corrupted with noise and extend the framework introduced in [1] to defend neural networks against $\ell_0$-norm and $\ell_2$-norm attacks. Concretely, for a signal that is approximately sparse in some transform domain and has been perturbed with noise, we provide guarantees for accurately recovering the signal in the transform domain. We can then use the recovered signal to reconstruct the signal in its original domain while largely removing the noise. Our results are general as they can be directly applied to most unitary transforms used in practice and hold for both $\ell_0$-norm bounded noise and $\ell_2$-norm bounded noise. In the case of $\ell_0$-norm bounded noise, we prove recovery guarantees for Iterative Hard Thresholding (IHT) and Basis Pursuit (BP). For the case of $\ell_2$-norm bounded noise, we provide recovery guarantees for BP. These guarantees theoretically bolster the defense framework introduced in [1] for defending neural networks against...
more | pdf | html
Figures
None.
Tweets
okateim: 2019/07/16 [12] Recovery Guarantees for Compressible Signals with Adversarial Noise (https://t.co/NiGawyLPMJ)
arxiv_cs_LG: Recovery Guarantees for Compressible Signals with Adversarial Noise. Jasjeet Dhaliwal and Kyle Hambrook https://t.co/EhvmkB6agG
StatsPapers: Recovery Guarantees for Compressible Signals with Adversarial Noise. https://t.co/mR7pBYOhbo
arxiv_cscv: Recovery Guarantees for Compressible Signals with Adversarial Noise https://t.co/bFLBxCgODF
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.094 Mikeys
#7. SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds
Minghui Liao, Boyu Song, Minghang He, Shangbang Long, Cong Yao, Xiang Bai
With the development of deep neural networks, the demand for a significant amount of annotated training data becomes the performance bottlenecks in many fields of research and applications. Image synthesis can generate annotated images automatically and freely, which gains increasing attention recently. In this paper, we propose to synthesize scene text images from the 3D virtual worlds, where the precise descriptions of scenes, editable illumination/visibility, and realistic physics are provided. Different from the previous methods which paste the rendered text on static 2D images, our method can render the 3D virtual scene and text instances as an entirety. In this way, complex perspective transforms, various illuminations, and occlusions can be realized in our synthesized scene text images. Moreover, the same text instances with various viewpoints can be produced by randomly moving and rotating the virtual camera, which acts as human eyes. The experiments on the standard scene text detection benchmarks using the generated...
more | pdf | html
Figures
Tweets
BrundageBot: SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds. Minghui Liao, Boyu Song, Minghang He, Shangbang Long, Cong Yao, and Xiang Bai https://t.co/znzugbUy70
Github
Repository: SynthText3D
User: MhLiao
Language: None
Stargazers: 3
Subscribers: 1
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 6165
Unqiue Words: 2016

2.093 Mikeys
#8. FMRI data augmentation via synthesis
Peiye Zhuang, Alexander G. Schwing, Sanmi Koyejo
We present an empirical evaluation of fMRI data augmentation via synthesis. For synthesis we use generative mod-els trained on real neuroimaging data to produce novel task-dependent functional brain images. Analyzed generative mod-els include classic approaches such as the Gaussian mixture model (GMM), and modern implicit generative models such as the generative adversarial network (GAN) and the variational auto-encoder (VAE). In particular, the proposed GAN and VAE models utilize 3-dimensional convolutions, which enables modeling of high-dimensional brain image tensors with structured spatial correlations. The synthesized datasets are then used to augment classifiers designed to predict cognitive and behavioural outcomes. Our results suggest that the proposed models are able to generate high-quality synthetic brain images which are diverse and task-dependent. Perhaps most importantly, the performance improvements of data aug-mentation via synthesis are shown to be complementary to the choice of the predictive model. Thus, our...
more | pdf | html
Figures
None.
Tweets
BrundageBot: FMRI data augmentation via synthesis. Peiye Zhuang, Alexander G. Schwing, and Sanmi Koyejo https://t.co/XqriaX2oNJ
arxiv_cs_LG: FMRI data augmentation via synthesis. Peiye Zhuang, Alexander G. Schwing, and Sanmi Koyejo https://t.co/TBvV136iIj
Memoirs: FMRI data augmentation via synthesis. https://t.co/t15931LtVX
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.088 Mikeys
#9. Using dynamic routing to extract intermediate features for developing scalable capsule networks
Bodhisatwa Mandal, Swarnendu Ghosh, Ritesh Sarkhel, Nibaran Das, Mita Nasipuri
Capsule networks have gained a lot of popularity in short time due to its unique approach to model equivariant class specific properties as capsules from images. However the dynamic routing algorithm comes with a steep computational complexity. In the proposed approach we aim to create scalable versions of the capsule networks that are much faster and provide better accuracy in problems with higher number of classes. By using dynamic routing to extract intermediate features instead of generating output class specific capsules, a large increase in the computational speed has been observed. Moreover, by extracting equivariant feature capsules instead of class specific capsules, the generalization capability of the network has also increased as a result of which there is a boost in accuracy.
more | pdf | html
Figures
None.
Tweets
arxivml: "Using dynamic routing to extract intermediate features for developing scalable capsule networks", Bodhisatwa Manda… https://t.co/fH3Yxbn27R
arxiv_cs_LG: Using dynamic routing to extract intermediate features for developing scalable capsule networks. Bodhisatwa Mandal, Swarnendu Ghosh, Ritesh Sarkhel, Nibaran Das, and Mita Nasipuri https://t.co/OFEqxMf0m1
Memoirs: Using dynamic routing to extract intermediate features for developing scalable capsule networks. https://t.co/wj3hr3hZeS
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

2.065 Mikeys
#10. Color Cerberus
A. ~Savchik, E. ~Ershov, S. ~Karpenko
Simple convolutional neural network was able to win ISISPA color constancy competition. Partial reimplementation of (Bianco, 2017) neural architecture would have shown even better results in this setup.
more | pdf | html
Figures
Tweets
arxiv_cs_LG: Color Cerberus. A. ~Savchik, E. ~Ershov, and S. ~Karpenko https://t.co/cr7rADDyIk
Memoirs: Color Cerberus. https://t.co/zD0F3k8BQo
muktabh: RT @arxiv_cs_LG: Color Cerberus. A. ~Savchik, E. ~Ershov, and S. ~Karpenko https://t.co/cr7rADDyIk
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 2532
Unqiue Words: 1141

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 158,360 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 158,360 papers.