### Top 10 Arxiv Papers Today in Computer Vision And Pattern Recognition

##### #1. FD-GAN: Generative Adversarial Networks with Fusion-discriminator for Single Image Dehazing
###### Yu Dong, Yihao Liu, He Zhang, Shifeng Chen, Yu Qiao
Recently, convolutional neural networks (CNNs) have achieved great improvements in single image dehazing and attained much attention in research. Most existing learning-based dehazing methods are not fully end-to-end, which still follow the traditional dehazing procedure: first estimate the medium transmission and the atmospheric light, then recover the haze-free image based on the atmospheric scattering model. However, in practice, due to lack of priors and constraints, it is hard to precisely estimate these intermediate parameters. Inaccurate estimation further degrades the performance of dehazing, resulting in artifacts, color distortion and insufficient haze removal. To address this, we propose a fully end-to-end Generative Adversarial Networks with Fusion-discriminator (FD-GAN) for image dehazing. With the proposed Fusion-discriminator which takes frequency information as additional priors, our model can generator more natural and realistic dehazed images with less color distortion and fewer artifacts. Moreover, we synthesize...
more | pdf | html
None.
###### Tweets
BrundageBot: FD-GAN: Generative Adversarial Networks with Fusion-discriminator for Single Image Dehazing. Yu Dong, Yihao Liu, He Zhang, Shifeng Chen, and Yu Qiao https://t.co/HGThVFOIA6
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

##### #2. A hybrid algorithm for disparity calculation from sparse disparity estimates based on stereo vision
###### Subhayan Mukherjee, Ram Mohana Reddy Guddeti
In this paper, we have proposed a novel method for stereo disparity estimation by combining the existing methods of block based and region based stereo matching. Our method can generate dense disparity maps from disparity measurements of only 18% pixels of either the left or the right image of a stereo image pair. It works by segmenting the lightness values of image pixels using a fast implementation of K-Means clustering. It then refines those segment boundaries by morphological filtering and connected components analysis, thus removing a lot of redundant boundary pixels. This is followed by determining the boundaries' disparities by the SAD cost function. Lastly, we reconstruct the entire disparity map of the scene from the boundaries' disparities through disparity propagation along the scan lines and disparity prediction of regions of uncertainty by considering disparities of the neighboring regions. Experimental results on the Middlebury stereo vision dataset demonstrate that the proposed method outperforms traditional...
more | pdf | html
None.
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

##### #3. SAUNet: Shape Attentive U-Net for Interpretable Medical Image Segmentation
###### Jesse Sun, Fatemeh Darbeha, Mark Zaidi, Bo Wang
Medical image segmentation is a difficult but important task for many clinical operations such as cardiac bi-ventricular volume estimation. More recently, there has been a shift to utilizing deep learning and fully convolutional neural networks (CNNs) to perform image segmentation that has yielded state-of-the-art results in many public benchmark datasets. Despite the progress of deep learning in medical image segmentation, standard CNNs are still not fully adopted in clinical settings as they lack robustness and interpretability. Shapes are generally more meaningful features than solely textures of images, which are features regular CNNs learn, causing a lack of robustness. Likewise, previous works surrounding model interpretability have been focused on post hoc gradient-based saliency methods. However, gradient-based saliency methods typically require additional computations post hoc and have been shown to be unreliable for interpretability. Thus, we present a new architecture called Shape Attentive U-Net (SAUNet) which focuses...
more | pdf | html
None.
###### Tweets
arxivml: "SAUNet: Shape Attentive U-Net for Interpretable Medical Image Segmentation", Jesse Sun, Fatemeh Darbeha, Mark Zaid… https://t.co/jD5it3CsYS
arxiv_cs_cv_pr: SAUNet: Shape Attentive U-Net for Interpretable Medical Image Segmentation. Jesse Sun, Fatemeh Darbeha, Mark Zaidi, and Bo Wang https://t.co/WgwUmMfGjY
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

##### #4. G2MF-WA: Geometric Multi-Model Fitting with Weakly Annotated Data
###### Chao Zhang, Xuequan Lu, Katsuya Hotta, Xi Yang
In this paper we attempt to address the problem of geometric multi-model fitting with resorting to a few weakly annotated (WA) data points, which has been sparsely studied so far. In weak annotating, most of the manual annotations are supposed to be correct yet inevitably mixed with incorrect ones. The WA data can be naturally obtained in an interactive way for specific tasks, for example, in the case of homography estimation, one can easily annotate points on the same plane/object with a single label by observing the image. Motivated by this, we propose a novel method to make full use of the WA data to boost the multi-model fitting performance. Specifically, a graph for model proposal sampling is first constructed using the WA data, given the prior that the WA data annotated with the same weak label has a high probability of being assigned to the same model. By incorporating this prior knowledge into the calculation of edge probabilities, vertices (i.e., data points) lie on/near the latent model are likely to connect together and...
more | pdf | html
None.
###### Tweets
arxiv_cscv: G2MF-WA: Geometric Multi-Model Fitting with Weakly Annotated Data https://t.co/xnOGSl3jqP
arxiv_cscv: G2MF-WA: Geometric Multi-Model Fitting with Weakly Annotated Data https://t.co/xnOGSlkUin
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

##### #5. Multimodal Deep Unfolding for Guided Image Super-Resolution
###### Iman Marivani, Evaggelia Tsiligianni, Bruno Cornelis, Nikos Deligiannis
The reconstruction of a high resolution image given a low resolution observation is an ill-posed inverse problem in imaging. Deep learning methods rely on training data to learn an end-to-end mapping from a low-resolution input to a high-resolution output. Unlike existing deep multimodal models that do not incorporate domain knowledge about the problem, we propose a multimodal deep learning design that incorporates sparse priors and allows the effective integration of information from another image modality into the network architecture. Our solution relies on a novel deep unfolding operator, performing steps similar to an iterative algorithm for convolutional sparse coding with side information; therefore, the proposed neural network is interpretable by design. The deep unfolding architecture is used as a core component of a multimodal framework for guided image super-resolution. An alternative multimodal design is investigated by employing residual learning to improve the training efficiency. The presented multimodal approach is...
more | pdf | html
None.
###### Tweets
arxivml: "Multimodal Deep Unfolding for Guided Image Super-Resolution", Iman Marivani, Evaggelia Tsiligianni, Bruno Cornelis… https://t.co/FXEu7WUJHz
arxiv_cs_cv_pr: Multimodal Deep Unfolding for Guided Image Super-Resolution. Iman Marivani, Evaggelia Tsiligianni, Bruno Cornelis, and Nikos Deligiannis https://t.co/TDKvOBG4hv
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

##### #6. P$^2$-GAN: Efficient Style Transfer Using Single Style Image
###### Zhentan Zheng, Jianyi Liu
Style transfer is a useful image synthesis technique that can re-render given image into another artistic style while preserving its content information. Generative Adversarial Network (GAN) is a widely adopted framework toward this task for its better representation ability on local style patterns than the traditional Gram-matrix based methods. However, most previous methods rely on sufficient amount of pre-collected style images to train the model. In this paper, a novel Patch Permutation GAN (P$^2$-GAN) network that can efficiently learn the stroke style from a single style image is proposed. We use patch permutation to generate multiple training samples from the given style image. A patch discriminator that can simultaneously process patch-wise images and natural images seamlessly is designed. We also propose a local texture descriptor based criterion to quantitatively evaluate the style transfer quality. Experimental results showed that our method can produce finer quality re-renderings from single style image with improved...
more | pdf | html
None.
###### Tweets
ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.co/GxplAOvIyE
arxivml: "P$^2$-GAN: Efficient Style Transfer Using Single Style Image", Zhentan Zheng, Jianyi Liu https://t.co/4lU8XU3Kys
arxiv_cs_cv_pr: P$^2$-GAN: Efficient Style Transfer Using Single Style Image. Zhentan Zheng and Jianyi Liu https://t.co/jd073dJNTH
arxiv_cscv: P$^2$-GAN: Efficient Style Transfer Using Single Style Image https://t.co/lwGk0b2M9L
arxiv_cscv: P$^2$-GAN: Efficient Style Transfer Using Single Style Image https://t.co/lwGk0b2M9L
ceobillionaire: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
hrs1985: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
sato_neet: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
KouroshMeshgi: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
jp_axs4ll: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
shimoke4869: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
KenzenAccount: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
UkiwhY: RT @ak92501: P2-GAN: Efficient Style Transfer Using Single Style Image pdf: https://t.co/RweFgtn5Wn abs: https://t.co/xGPWWHHOrx https://t.…
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

##### #7. Detecting Face2Face Facial Reenactment in Videos
###### Prabhat Kumar, Mayank Vatsa, Richa Singh
Visual content has become the primary source of information, as evident in the billions of images and videos, shared and uploaded on the Internet every single day. This has led to an increase in alterations in images and videos to make them more informative and eye-catching for the viewers worldwide. Some of these alterations are simple, like copy-move, and are easily detectable, while other sophisticated alterations like reenactment based DeepFakes are hard to detect. Reenactment alterations allow the source to change the target expressions and create photo-realistic images and videos. While technology can be potentially used for several applications, the malicious usage of automatic reenactment has a very large social implication. It is therefore important to develop detection techniques to distinguish real images and videos with the altered ones. This research proposes a learning-based algorithm for detecting reenactment based alterations. The proposed algorithm uses a multi-stream network that learns regional artifacts and...
more | pdf | html
None.
###### Tweets
arxivml: "Detecting Face2Face Facial Reenactment in Videos", Prabhat Kumar, Mayank Vatsa, Richa Singh https://t.co/5J8w20HAD0
arxiv_cs_cv_pr: Detecting Face2Face Facial Reenactment in Videos. Prabhat Kumar, Mayank Vatsa, and Richa Singh https://t.co/NVRubR54rw
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

##### #8. Learning Diverse Features with Part-Level Resolution for Person Re-Identification
###### Ben Xie, Xiaofu Wu, Suofei Zhang, Shiliang Zhao, Ming Li
Learning diverse features is key to the success of person re-identification. Various part-based methods have been extensively proposed for learning local representations, which, however, are still inferior to the best-performing methods for person re-identification. This paper proposes to construct a strong lightweight network architecture, termed PLR-OSNet, based on the idea of Part-Level feature Resolution over the Omni-Scale Network (OSNet) for achieving feature diversity. The proposed PLR-OSNet has two branches, one branch for global feature representation and the other branch for local feature representation. The local branch employs a uniform partition strategy for part-level feature resolution but produces only a single identity-prediction loss, which is in sharp contrast to the existing part-based methods. Empirical evidence demonstrates that the proposed PLR-OSNet achieves state-of-the-art performance on popular person Re-ID datasets, including Market1501, DukeMTMC-reID and CUHK03, despite its small model size.
more | pdf | html
None.
###### Tweets
arxivml: "Learning Diverse Features with Part-Level Resolution for Person Re-Identification", Ben Xie, Xiaofu Wu, Suofei Zha… https://t.co/w8XHYqTid6
SciFi: Learning Diverse Features with Part-Level Resolution for Person Re-Identification. https://t.co/ceZxzsJrY7
arxiv_cs_cv_pr: Learning Diverse Features with Part-Level Resolution for Person Re-Identification. Ben Xie, Xiaofu Wu, Suofei Zhang, Shiliang Zhao, and Ming Li https://t.co/dnFPIYV79g
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

##### #9. Evaluating Weakly Supervised Object Localization Methods Right
###### Junsuk Choe, Seong Joon Oh, Seungho Lee, Sanghyuk Chun, Zeynep Akata, Hyunjung Shim
Weakly-supervised object localization (WSOL) has gained popularity over the last years for its promise to train localization models with only image-level labels. Since the seminal WSOL work of class activation mapping (CAM), the field has focused on how to expand the attention regions to cover objects more broadly and localize them better. However, these strategies rely on full localization supervision to validate hyperparameters and for model selection, which is in principle prohibited under the WSOL setup. In this paper, we argue that WSOL task is ill-posed with only image-level labels, and propose a new evaluation protocol where full supervision is limited to only a small held-out set not overlapping with the test set. We observe that, under our protocol, the five most recent WSOL methods have not made a major improvement over the CAM baseline. Moreover, we report that existing WSOL methods have not reached the few-shot learning baseline, where the full-supervision at validation time is used for model training instead. Based on...
more | pdf | html
None.
###### Tweets
arxivml: "Evaluating Weakly Supervised Object Localization Methods Right", Junsuk Choe, Seong Joon Oh, Seungho Lee, Sanghyuk… https://t.co/ti7msBRnTI
Memoirs: Evaluating Weakly Supervised Object Localization Methods Right. https://t.co/r6vPvznb8B
JungWooHa2: New fair #WSOL work of Clova AI was released with the code and the data! Congrats to @junsukchoe, @Joon09098593, @SanghyukChun ! - arXiv: https://t.co/DaH4NGwByp -Youtube: https://t.co/esQJbkYwej - Reddit: https://t.co/oA1k8y5UAI -GitHub: https://t.co/NoiIzTY7FY
arxiv_cs_cv_pr: Evaluating Weakly Supervised Object Localization Methods Right. Junsuk Choe, Seong Joon Oh, Seungho Lee, Sanghyuk Chun, Zeynep Akata, and Hyunjung Shim https://t.co/i1nFYuF3Qi
adamoprogresso: RT @Memoirs: Evaluating Weakly Supervised Object Localization Methods Right. https://t.co/r6vPvznb8B
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

##### #10. Adaptive Dithering Using Curved Markov-Gaussian Noise in the Quantized Domain for Mapping SDR to HDR Image
###### Subhayan Mukherjee, Guan-Ming Su, Irene Cheng
High Dynamic Range (HDR) imaging is gaining increased attention due to its realistic content, for not only regular displays but also smartphones. Before sufficient HDR content is distributed, HDR visualization still relies mostly on converting Standard Dynamic Range (SDR) content. SDR images are often quantized, or bit depth reduced, before SDR-to-HDR conversion, e.g. for video transmission. Quantization can easily lead to banding artefacts. In some computing and/or memory I/O limited environment, the traditional solution using spatial neighborhood information is not feasible. Our method includes noise generation (offline) and noise injection (online), and operates on pixels of the quantized image. We vary the magnitude and structure of the noise pattern adaptively based on the luma of the quantized pixel and the slope of the inverse-tone mapping function. Subjective user evaluations confirm the superior performance of our technique.
more | pdf | html
None.
###### Tweets
arxiv_cscv: Adaptive Dithering Using Curved Markov-Gaussian Noise in the Quantized Domain for Mapping SDR to HDR Image https://t.co/671SLz1Yto
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 257,111 papers.

###### Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Online
###### Stats
Tracking 257,111 papers.