Top 10 Arxiv Papers Today in Computer Vision And Pattern Recognition


0.0 Mikeys
#1. FD-GAN: Generative Adversarial Networks with Fusion-discriminator for Single Image Dehazing
Yu Dong, Yihao Liu, He Zhang, Shifeng Chen, Yu Qiao
Recently, convolutional neural networks (CNNs) have achieved great improvements in single image dehazing and attained much attention in research. Most existing learning-based dehazing methods are not fully end-to-end, which still follow the traditional dehazing procedure: first estimate the medium transmission and the atmospheric light, then recover the haze-free image based on the atmospheric scattering model. However, in practice, due to lack of priors and constraints, it is hard to precisely estimate these intermediate parameters. Inaccurate estimation further degrades the performance of dehazing, resulting in artifacts, color distortion and insufficient haze removal. To address this, we propose a fully end-to-end Generative Adversarial Networks with Fusion-discriminator (FD-GAN) for image dehazing. With the proposed Fusion-discriminator which takes frequency information as additional priors, our model can generator more natural and realistic dehazed images with less color distortion and fewer artifacts. Moreover, we synthesize...
more | pdf | html
Figures
None.
Tweets
BrundageBot: FD-GAN: Generative Adversarial Networks with Fusion-discriminator for Single Image Dehazing. Yu Dong, Yihao Liu, He Zhang, Shifeng Chen, and Yu Qiao https://t.co/HGThVFOIA6
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#2. A hybrid algorithm for disparity calculation from sparse disparity estimates based on stereo vision
Subhayan Mukherjee, Ram Mohana Reddy Guddeti
In this paper, we have proposed a novel method for stereo disparity estimation by combining the existing methods of block based and region based stereo matching. Our method can generate dense disparity maps from disparity measurements of only 18% pixels of either the left or the right image of a stereo image pair. It works by segmenting the lightness values of image pixels using a fast implementation of K-Means clustering. It then refines those segment boundaries by morphological filtering and connected components analysis, thus removing a lot of redundant boundary pixels. This is followed by determining the boundaries' disparities by the SAD cost function. Lastly, we reconstruct the entire disparity map of the scene from the boundaries' disparities through disparity propagation along the scan lines and disparity prediction of regions of uncertainty by considering disparities of the neighboring regions. Experimental results on the Middlebury stereo vision dataset demonstrate that the proposed method outperforms traditional...
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#3. SAUNet: Shape Attentive U-Net for Interpretable Medical Image Segmentation
Jesse Sun, Fatemeh Darbeha, Mark Zaidi, Bo Wang
Medical image segmentation is a difficult but important task for many clinical operations such as cardiac bi-ventricular volume estimation. More recently, there has been a shift to utilizing deep learning and fully convolutional neural networks (CNNs) to perform image segmentation that has yielded state-of-the-art results in many public benchmark datasets. Despite the progress of deep learning in medical image segmentation, standard CNNs are still not fully adopted in clinical settings as they lack robustness and interpretability. Shapes are generally more meaningful features than solely textures of images, which are features regular CNNs learn, causing a lack of robustness. Likewise, previous works surrounding model interpretability have been focused on post hoc gradient-based saliency methods. However, gradient-based saliency methods typically require additional computations post hoc and have been shown to be unreliable for interpretability. Thus, we present a new architecture called Shape Attentive U-Net (SAUNet) which focuses...
more | pdf | html
Figures
None.
Tweets
arxivml: "SAUNet: Shape Attentive U-Net for Interpretable Medical Image Segmentation", Jesse Sun, Fatemeh Darbeha, Mark Zaid… https://t.co/jD5it3CsYS
arxiv_cs_cv_pr: SAUNet: Shape Attentive U-Net for Interpretable Medical Image Segmentation. Jesse Sun, Fatemeh Darbeha, Mark Zaidi, and Bo Wang https://t.co/WgwUmMfGjY
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#4. G2MF-WA: Geometric Multi-Model Fitting with Weakly Annotated Data
Chao Zhang, Xuequan Lu, Katsuya Hotta, Xi Yang
In this paper we attempt to address the problem of geometric multi-model fitting with resorting to a few weakly annotated (WA) data points, which has been sparsely studied so far. In weak annotating, most of the manual annotations are supposed to be correct yet inevitably mixed with incorrect ones. The WA data can be naturally obtained in an interactive way for specific tasks, for example, in the case of homography estimation, one can easily annotate points on the same plane/object with a single label by observing the image. Motivated by this, we propose a novel method to make full use of the WA data to boost the multi-model fitting performance. Specifically, a graph for model proposal sampling is first constructed using the WA data, given the prior that the WA data annotated with the same weak label has a high probability of being assigned to the same model. By incorporating this prior knowledge into the calculation of edge probabilities, vertices (i.e., data points) lie on/near the latent model are likely to connect together and...
more | pdf | html
Figures
None.
Tweets
arxiv_cscv: G2MF-WA: Geometric Multi-Model Fitting with Weakly Annotated Data https://t.co/xnOGSl3jqP
arxiv_cscv: G2MF-WA: Geometric Multi-Model Fitting with Weakly Annotated Data https://t.co/xnOGSlkUin
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#5. Multimodal Deep Unfolding for Guided Image Super-Resolution
Iman Marivani, Evaggelia Tsiligianni, Bruno Cornelis, Nikos Deligiannis
The reconstruction of a high resolution image given a low resolution observation is an ill-posed inverse problem in imaging. Deep learning methods rely on training data to learn an end-to-end mapping from a low-resolution input to a high-resolution output. Unlike existing deep multimodal models that do not incorporate domain knowledge about the problem, we propose a multimodal deep learning design that incorporates sparse priors and allows the effective integration of information from another image modality into the network architecture. Our solution relies on a novel deep unfolding operator, performing steps similar to an iterative algorithm for convolutional sparse coding with side information; therefore, the proposed neural network is interpretable by design. The deep unfolding architecture is used as a core component of a multimodal framework for guided image super-resolution. An alternative multimodal design is investigated by employing residual learning to improve the training efficiency. The presented multimodal approach is...
more | pdf | html
Figures
None.
Tweets
arxivml: "Multimodal Deep Unfolding for Guided Image Super-Resolution", Iman Marivani, Evaggelia Tsiligianni, Bruno Cornelis… https://t.co/FXEu7WUJHz
arxiv_cs_cv_pr: Multimodal Deep Unfolding for Guided Image Super-Resolution. Iman Marivani, Evaggelia Tsiligianni, Bruno Cornelis, and Nikos Deligiannis https://t.co/TDKvOBG4hv
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#6. P$^2$-GAN: Efficient Style Transfer Using Single Style Image
Zhentan Zheng, Jianyi Liu
Style transfer is a useful image synthesis technique that can re-render given image into another artistic style while preserving its content information. Generative Adversarial Network (GAN) is a widely adopted framework toward this task for its better representation ability on local style patterns than the traditional Gram-matrix based methods. However, most previous methods rely on sufficient amount of pre-collected style images to train the model. In this paper, a novel Patch Permutation GAN (P$^2$-GAN) network that can efficiently learn the stroke style from a single style image is proposed. We use patch permutation to generate multiple training samples from the given style image. A patch discriminator that can simultaneously process patch-wise images and natural images seamlessly is designed. We also propose a local texture descriptor based criterion to quantitatively evaluate the style transfer quality. Experimental results showed that our method can produce finer quality re-renderings from single style image with improved...
more | pdf | html
Figures
None.
Tweets
ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.co/GxplAOvIyE
arxivml: "P$^2$-GAN: Efficient Style Transfer Using Single Style Image", Zhentan Zheng, Jianyi Liu https://t.co/4lU8XU3Kys
arxiv_cs_cv_pr: P$^2$-GAN: Efficient Style Transfer Using Single Style Image. Zhentan Zheng and Jianyi Liu https://t.co/jd073dJNTH
arxiv_cscv: P$^2$-GAN: Efficient Style Transfer Using Single Style Image https://t.co/lwGk0b2M9L
arxiv_cscv: P$^2$-GAN: Efficient Style Transfer Using Single Style Image https://t.co/lwGk0b2M9L
ceobillionaire: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
hrs1985: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
sato_neet: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
KouroshMeshgi: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
jp_axs4ll: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
shimoke4869: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
KenzenAccount: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
UkiwhY: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#7. Detecting Face2Face Facial Reenactment in Videos
Prabhat Kumar, Mayank Vatsa, Richa Singh
Visual content has become the primary source of information, as evident in the billions of images and videos, shared and uploaded on the Internet every single day. This has led to an increase in alterations in images and videos to make them more informative and eye-catching for the viewers worldwide. Some of these alterations are simple, like copy-move, and are easily detectable, while other sophisticated alterations like reenactment based DeepFakes are hard to detect. Reenactment alterations allow the source to change the target expressions and create photo-realistic images and videos. While technology can be potentially used for several applications, the malicious usage of automatic reenactment has a very large social implication. It is therefore important to develop detection techniques to distinguish real images and videos with the altered ones. This research proposes a learning-based algorithm for detecting reenactment based alterations. The proposed algorithm uses a multi-stream network that learns regional artifacts and...
more | pdf | html
Figures
None.
Tweets
arxivml: "Detecting Face2Face Facial Reenactment in Videos", Prabhat Kumar, Mayank Vatsa, Richa Singh https://t.co/5J8w20HAD0
arxiv_cs_cv_pr: Detecting Face2Face Facial Reenactment in Videos. Prabhat Kumar, Mayank Vatsa, and Richa Singh https://t.co/NVRubR54rw
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#8. Learning Diverse Features with Part-Level Resolution for Person Re-Identification
Ben Xie, Xiaofu Wu, Suofei Zhang, Shiliang Zhao, Ming Li
Learning diverse features is key to the success of person re-identification. Various part-based methods have been extensively proposed for learning local representations, which, however, are still inferior to the best-performing methods for person re-identification. This paper proposes to construct a strong lightweight network architecture, termed PLR-OSNet, based on the idea of Part-Level feature Resolution over the Omni-Scale Network (OSNet) for achieving feature diversity. The proposed PLR-OSNet has two branches, one branch for global feature representation and the other branch for local feature representation. The local branch employs a uniform partition strategy for part-level feature resolution but produces only a single identity-prediction loss, which is in sharp contrast to the existing part-based methods. Empirical evidence demonstrates that the proposed PLR-OSNet achieves state-of-the-art performance on popular person Re-ID datasets, including Market1501, DukeMTMC-reID and CUHK03, despite its small model size.
more | pdf | html
Figures
None.
Tweets
arxivml: "Learning Diverse Features with Part-Level Resolution for Person Re-Identification", Ben Xie, Xiaofu Wu, Suofei Zha… https://t.co/w8XHYqTid6
SciFi: Learning Diverse Features with Part-Level Resolution for Person Re-Identification. https://t.co/ceZxzsJrY7
arxiv_cs_cv_pr: Learning Diverse Features with Part-Level Resolution for Person Re-Identification. Ben Xie, Xiaofu Wu, Suofei Zhang, Shiliang Zhao, and Ming Li https://t.co/dnFPIYV79g
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#9. Evaluating Weakly Supervised Object Localization Methods Right
Junsuk Choe, Seong Joon Oh, Seungho Lee, Sanghyuk Chun, Zeynep Akata, Hyunjung Shim
Weakly-supervised object localization (WSOL) has gained popularity over the last years for its promise to train localization models with only image-level labels. Since the seminal WSOL work of class activation mapping (CAM), the field has focused on how to expand the attention regions to cover objects more broadly and localize them better. However, these strategies rely on full localization supervision to validate hyperparameters and for model selection, which is in principle prohibited under the WSOL setup. In this paper, we argue that WSOL task is ill-posed with only image-level labels, and propose a new evaluation protocol where full supervision is limited to only a small held-out set not overlapping with the test set. We observe that, under our protocol, the five most recent WSOL methods have not made a major improvement over the CAM baseline. Moreover, we report that existing WSOL methods have not reached the few-shot learning baseline, where the full-supervision at validation time is used for model training instead. Based on...
more | pdf | html
Figures
None.
Tweets
arxivml: "Evaluating Weakly Supervised Object Localization Methods Right", Junsuk Choe, Seong Joon Oh, Seungho Lee, Sanghyuk… https://t.co/ti7msBRnTI
Memoirs: Evaluating Weakly Supervised Object Localization Methods Right. https://t.co/r6vPvznb8B
JungWooHa2: New fair #WSOL work of Clova AI was released with the code and the data! Congrats to @junsukchoe, @Joon09098593, @SanghyukChun ! - arXiv: https://t.co/DaH4NGwByp -Youtube: https://t.co/esQJbkYwej - Reddit: https://t.co/oA1k8y5UAI -GitHub: https://t.co/NoiIzTY7FY
arxiv_cs_cv_pr: Evaluating Weakly Supervised Object Localization Methods Right. Junsuk Choe, Seong Joon Oh, Seungho Lee, Sanghyuk Chun, Zeynep Akata, and Hyunjung Shim https://t.co/i1nFYuF3Qi
adamoprogresso: RT @Memoirs: Evaluating Weakly Supervised Object Localization Methods Right. https://t.co/r6vPvznb8B
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#10. Adaptive Dithering Using Curved Markov-Gaussian Noise in the Quantized Domain for Mapping SDR to HDR Image
Subhayan Mukherjee, Guan-Ming Su, Irene Cheng
High Dynamic Range (HDR) imaging is gaining increased attention due to its realistic content, for not only regular displays but also smartphones. Before sufficient HDR content is distributed, HDR visualization still relies mostly on converting Standard Dynamic Range (SDR) content. SDR images are often quantized, or bit depth reduced, before SDR-to-HDR conversion, e.g. for video transmission. Quantization can easily lead to banding artefacts. In some computing and/or memory I/O limited environment, the traditional solution using spatial neighborhood information is not feasible. Our method includes noise generation (offline) and noise injection (online), and operates on pixels of the quantized image. We vary the magnitude and structure of the noise pattern adaptively based on the luma of the quantized pixel and the slope of the inverse-tone mapping function. Subjective user evaluations confirm the superior performance of our technique.
more | pdf | html
Figures
None.
Tweets
arxiv_cscv: Adaptive Dithering Using Curved Markov-Gaussian Noise in the Quantized Domain for Mapping SDR to HDR Image https://t.co/671SLz1Yto
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 257,111 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 257,111 papers.