Top 10 Arxiv Papers Today in Computer Vision And Pattern Recognition


2.379 Mikeys
#1. On transfer learning using a MAC model variant
Vincent Marois, T. S. Jayram, Vincent Albouy, Tomasz Kornuta, Younes Bouhadjar, Ahmet S. Ozcan
We introduce a variant of the MAC model (Hudson and Manning, CVPR 2018) with a simplified set of equations that achieves comparable accuracy, while training faster. We evaluate both models on CLEVR and CoGenT, and show that, transfer learning with fine-tuning results in a 15 point increase in accuracy, matching the state of the art. Finally, in contrast, we demonstrate that improper fine-tuning can actually reduce a model's accuracy as well.
more | pdf | html
Figures
Tweets
BrundageBot: On transfer learning using a MAC model variant. Vincent Marois, T. S. Jayram, Vincent Albouy, Tomasz Kornuta, Younes Bouhadjar, and Ahmet S. Ozcan https://t.co/dDqwySI9PU
arxivml: "On transfer learning using a MAC model variant", Vincent Marois, T.S. Jayram, Vincent Albouy, Tomasz Kornuta, Youn… https://t.co/peFrXYb8y9
arxiv_cscv: On transfer learning using a MAC model variant https://t.co/RzyKAKfNxH
Github

Enabling reproducible Machine Learning research

Repository: mi-prometheus
User: IBM
Language: Python
Stargazers: 11
Subscribers: 10
Forks: 7
Open Issues: 30
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 4609
Unqiue Words: 1632

2.172 Mikeys
#2. Guiding the One-to-one Mapping in CycleGAN via Optimal Transport
Guansong Lu, Zhiming Zhou, Yuxuan Song, Kan Ren, Yong Yu
CycleGAN is capable of learning a one-to-one mapping between two data distributions without paired examples, achieving the task of unsupervised data translation. However, there is no theoretical guarantee on the property of the learned one-to-one mapping in CycleGAN. In this paper, we experimentally find that, under some circumstances, the one-to-one mapping learned by CycleGAN is just a random one within the large feasible solution space. Based on this observation, we explore to add extra constraints such that the one-to-one mapping is controllable and satisfies more properties related to specific tasks. We propose to solve an optimal transport mapping restrained by a task-specific cost function that reflects the desired properties, and use the barycenters of optimal transport mapping to serve as references for CycleGAN. Our experiments indicate that the proposed algorithm is capable of learning a one-to-one mapping with the desired properties.
more | pdf | html
Figures
Tweets
BrundageBot: Guiding the One-to-one Mapping in CycleGAN via Optimal Transport. Guansong Lu, Zhiming Zhou, Yuxuan Song, Kan Ren, and Yong Yu https://t.co/4nKOoGq0Vo
arxivml: "Guiding the One-to-one Mapping in CycleGAN via Optimal Transport", Guansong Lu, Zhiming Zhou, Yuxuan Song, Kan Ren… https://t.co/BXNF1TeRry
SciFi: Guiding the One-to-one Mapping in CycleGAN via Optimal Transport. https://t.co/bFLKp4Iq0a
arxiv_cscv: Guiding the One-to-one Mapping in CycleGAN via Optimal Transport https://t.co/mVoujK4ejz
GIS_Sharer: RT @SciFi: Guiding the One-to-one Mapping in CycleGAN via Optimal Transport. https://t.co/bFLKp4Iq0a
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 6368
Unqiue Words: 1738

2.162 Mikeys
#3. GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition
Hanqing Chao, Yiwei He, Junping Zhang, Jianfeng Feng
As a unique biometric feature that can be recognized at a distance, gait has broad applications in crime prevention, forensic identification and social security. To portray a gait, existing gait recognition methods utilize either a gait template, where temporal information is hard to preserve, or a gait sequence, which must keep unnecessary sequential constraints and thus loses the flexibility of gait recognition. In this paper we present a novel perspective, where a gait is regarded as a set consisting of independent frames. We propose a new network named GaitSet to learn identity information from the set. Based on the set perspective, our method is immune to permutation of frames, and can naturally integrate frames from different videos which have been filmed under different scenarios, such as diverse viewing angles, different clothes/carrying conditions. Experiments show that under normal walking conditions, our single-model method achieves an average rank-1 accuracy of 95.0% on the CASIA-B gait dataset and an 87.1% accuracy on...
more | pdf | html
Figures
Tweets
arxiv_cscv: GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition https://t.co/GdZtYYce8X
arxiv_cscv: GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition https://t.co/GdZtYYtP0v
ComputerPapers: GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition. https://t.co/YPiaKlUdBc
Github

A set-based model for gait recognition.

Repository: GaitSet
User: AbnerHqC
Language: Python
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : [31, 31]
Authors: 4
Total Words: 7012
Unqiue Words: 2240

2.155 Mikeys
#4. Adjusting for Confounding in Unsupervised Latent Representations of Images
Craig A. Glastonbury, Michael Ferlaino, Christoffer Nellåker, Cecilia M. Lindgren
Biological imaging data are often partially confounded or contain unwanted variability. Examples of such phenomena include variable lighting across microscopy image captures, stain intensity variation in histological slides, and batch effects for high throughput drug screening assays. Therefore, to develop "fair" models which generalise well to unseen examples, it is crucial to learn data representations that are insensitive to nuisance factors of variation. In this paper, we present a strategy based on adversarial training, capable of learning unsupervised representations invariant to confounders. As an empirical validation of our method, we use deep convolutional autoencoders to learn unbiased cellular representations from microscopy imaging.
more | pdf | html
Figures
Tweets
BrundageBot: Adjusting for Confounding in Unsupervised Latent Representations of Images. Craig A. Glastonbury, Michael Ferlaino, Christoffer Nellåker, and Cecilia M. Lindgren https://t.co/qqtk486Mlm
C_Glastonbury: @DrAnneCarpenter @michaelferlaino @ceclindgren @CNellaker Here you go! It would be great if our paper could be linked to from the dataset site. https://t.co/yspdrGGfrb
arxiv_cscv: Adjusting for Confounding in Unsupervised Latent Representations of Images https://t.co/ZXswZHrSBk
ComputerPapers: Adjusting for Confounding in Unsupervised Latent Representations of Images. https://t.co/nUsCTc8jzf
C_Glastonbury: RT @arxiv_cscv: Adjusting for Confounding in Unsupervised Latent Representations of Images https://t.co/ZXswZHrSBk
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 2887
Unqiue Words: 1319

2.127 Mikeys
#5. Exploring the Deep Feature Space of a Cell Classification Neural Network
Ezra Webb, Cheng Lei, Chun-Jung Huang, Hirofumi Kobayashi, Hideharu Mikami, Keisuke Goda
In this paper, we present contemporary techniques for visualising the feature space of a deep learning image classification neural network. These techniques are viewed in the context of a feed-forward network trained to classify low resolution fluorescence images of white blood cells captured using optofluidic imaging. The model has two output classes corresponding to two different cell types, which are often difficult to distinguish by eye. This paper has two major sections. The first looks to develop the information space presented by dimension reduction techniques, such as t-SNE, used to embed high-dimensional pre-softmax layer activations into a two-dimensional plane. The second section looks at feature visualisation by optimisation to generate feature images representing the learned features of the network. Using and developing these techniques we visualise class separation and structures within the dataset at various depths using clustering algorithms and feature images; track the development of feature complexity as we...
more | pdf | html
Figures
Tweets
arxivml: "Exploring the Deep Feature Space of a Cell Classification Neural Network", Ezra Webb, Cheng Lei, Chun-Jung Huang, … https://t.co/z13k5gOLLT
arxiv_cscv: Exploring the Deep Feature Space of a Cell Classification Neural Network https://t.co/FxKNOtiw6F
arxiv_cscv: Exploring the Deep Feature Space of a Cell Classification Neural Network https://t.co/FxKNOtA6Yd
Soul: Exploring the Deep Feature Space of a Cell Classification Neural Network. https://t.co/6heKlER4JO
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 7479
Unqiue Words: 2192

2.125 Mikeys
#6. A Neurodynamical model of Saliency prediction in V1
David Berga, Xavier Otazu
Computations in the primary visual cortex (area V1 or striate cortex) have long been hypothesized to be responsible, among several visual processing mechanisms, of bottom-up visual attention (also named saliency). In order to validate this hypothesis, images from eye tracking datasets are processed with a biologically plausible model of V1 able to reproduce other visual processes such as brightness, chromatic induction and visual discomfort. Following Li's neurodynamical model, we define V1's lateral connections with a network of firing rate neurons, sensitive to visual features such as brightness, color, orientation and scale. The resulting saliency maps are generated from the model output, representing the neuronal activity of V1 projections towards brain areas involved in eye movement control. Our predictions are supported with eye tracking experimentation and results show an improvement with respect to previous models as well as consistency with human psychophysics. We propose a unified computational architecture of the...
more | pdf | html
Figures
Tweets
arxiv_cscv: A Neurodynamical model of Saliency prediction in V1 https://t.co/EwIAh2Y9VT
ComputerPapers: A Neurodynamical model of Saliency prediction in V1. https://t.co/TntRPJl7b2
Github

Neurodynamical Saliency WAvelet Model

Repository: NSWAM
User: dberga
Language: None
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 8454
Unqiue Words: 3091

2.125 Mikeys
#7. Psychophysical evaluation of individual low-level feature influences on visual attention
David Berga, Xosé Ramón Fdez-Vidal, Xavier Otazu, Víctor Leborán, Xosé María Pardo
In this study we provide the analysis of eye movement behavior elicited by low-level feature distinctiveness with a dataset of synthetically-generated image patterns. Design of visual stimuli was inspired by the ones used in previous psychophysical experiments, namely in free-viewing and visual searching tasks, to provide a total of 15 types of stimuli, divided according to the task and feature to be analyzed. Our interest is to analyze the influences of low-level feature contrast between a salient region and the rest of distractors, providing fixation localization characteristics and reaction time of landing inside the salient region. Eye-tracking data was collected from 34 participants during the viewing of a 230 images dataset. Results show that saliency is predominantly and distinctively influenced by: 1. feature type, 2. feature contrast, 3. temporality of fixations, 4. task difficulty and 5. center bias. This experimentation proposes a new psychophysical basis for saliency model evaluation using synthetic images.
more | pdf | html
Figures
Tweets
arxiv_cscv: Psychophysical evaluation of individual low-level feature influences on visual attention https://t.co/AkbbrnIdOI
ComputerPapers: Psychophysical evaluation of individual low-level feature influences on visual attention. https://t.co/zFgIloQX5u
Github

This is a code for generating synthetic stimuli

Repository: sig4vam
User: dberga
Language: Matlab
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 19350
Unqiue Words: 4547

2.104 Mikeys
#8. From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character Recognition
Mojtaba Heidarysafa, James Reed, Kamran Kowsari, April Celeste R. Leviton, Janet I. Warren, Donald E. Brown
Tracking users' activities on the World Wide Web (WWW) allows researchers to analyze each user's internet behavior as time passes and for the amount of time spent on a particular domain. This analysis can be used in research design, as researchers may access to their participant's behaviors while browsing the web. Web search behavior has been a subject of interest because of its real-world applications in marketing, digital advertisement, and identifying potential threats online. In this paper, we present an image-processing based method to extract domains which are visited by a participant over multiple browsers during a lab session. This method could provide another way to collect users' activities during an online session given that the session recorder collected the data. The method can also be used to collect the textual content of web-pages that an individual visits for later analysis
more | pdf | html
Figures
Tweets
arxiv_cscv: From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character Recognition https://t.co/Av3W3g25ay
ComputerPapers: From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character Recognition. https://t.co/I1g7KB3ew6
Github

extract visited website domain using OCR on screenshots

Repository: OCR-browser-domain-extractor
User: mojtaba-Hsafa
Language: Python
Stargazers: 0
Subscribers: 0
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 4222
Unqiue Words: 1559

2.091 Mikeys
#9. Face Verification and Forgery Detection for Ophthalmic Surgery Images
Kaushal Bhogale, Nishant Shankar, Adheesh Juvekar, Asutosh Padhi
Although modern face verification systems are accessible and accurate, they are not always robust to pose variance and occlusions. Moreover, accurate models require a large amount of data to train. We structure our experiments to operate on small amounts of data obtained from an NGO that funds ophthalmic surgeries. We set up our face verification task as that of verifying pre-operation and post-operation images of a patient that undergoes ophthalmic surgery, and as such the post-operation images have occlusions like an eye patch. In this paper, we present a system that performs the face verification task using one-shot learning. To this end, our paper uses deep convolutional networks and compares different model architectures and loss functions. Our best model achieves 85% test accuracy. During inference time, we also attempt to detect image forgeries in addition to performing face verification. To achieve this, we use Error Level Analysis. Finally, we propose an inference pipeline that demonstrates how these techniques can be...
more | pdf | html
Figures
Tweets
BrundageBot: Face Verification and Forgery Detection for Ophthalmic Surgery Images. Kaushal Bhogale, Nishant Shankar, Adheesh Juvekar, and Asutosh Padhi https://t.co/BdRKB0vmLT
arxiv_cscv: Face Verification and Forgery Detection for Ophthalmic Surgery Images https://t.co/jfEL9lpxtC
ComputerPapers: Face Verification and Forgery Detection for Ophthalmic Surgery Images. https://t.co/O852l9NDbN
Github
None.
Youtube
None.
Other stats
Sample Sizes : [20, 20]
Authors: 4
Total Words: 3784
Unqiue Words: 1433

2.091 Mikeys
#10. Deep Template Matching for Offline Handwritten Chinese Character Recognition
Zhiyuan Li, Min Jin, Qi Wu, Huaxiang Lu
Just like its remarkable achievements in many computer vision tasks, the convolutional neural networks (CNN) provide an end-to-end solution in handwritten Chinese character recognition (HCCR) with great success. However, the process of learning discriminative features for image recognition is difficult in cases where little data is available. In this paper, we propose a novel method for learning siamese neural network which employ a special structure to predict the similarity between handwritten Chinese characters and template images. The optimization of siamese neural network can be treated as a simple binary classification problem. When the training process has been finished, the powerful discriminative features help us to generalize the predictive power not just to new data, but to entirely new classes that never appear in the training set. Experiments performed on the ICDAR-2013 offline HCCR datasets have shown that the proposed method has a very promising generalization ability to the new classes that never appear in the training set.
more | pdf | html
Figures
Tweets
BrundageBot: Deep Template Matching for Offline Handwritten Chinese Character Recognition. Zhiyuan Li, Min Jin, Qi Wu, and Huaxiang Lu https://t.co/6yMZmDuGvg
arxiv_cscv: Deep Template Matching for Offline Handwritten Chinese Character Recognition https://t.co/Nn9SIKorl8
ComputerPapers: Deep Template Matching for Offline Handwritten Chinese Character Recognition. https://t.co/sXjypaL0iD
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 3343
Unqiue Words: 1325

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 57,756 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 57,756 papers.