Speaker diarization has been mainly developed based on the clustering of
speaker embeddings. However, the clustering-based approach has two major
problems; i.e., (i) it is not optimized to minimize diarization errors
directly, and (ii) it cannot handle speaker overlaps correctly. To solve these
problems, the End-to-End Neural Diarization (EEND), in which a bidirectional
long short-term memory (BLSTM) network directly outputs speaker diarization
results given a multi-talker recording, was recently proposed. In this study,
we enhance EEND by introducing self-attention blocks instead of BLSTM blocks.
In contrast to BLSTM, which is conditioned only on its previous and next hidden
states, self-attention is directly conditioned on all the other frames, making
it much suitable for dealing with the speaker diarization problem. We evaluated
our proposed method on simulated mixtures, real telephone calls, and real
dialogue recordings. The experimental results revealed that the self-attention
was the key to achieving good performance and...

more |
pdf
| html
None.

BrundageBot:
End-to-End Neural Speaker Diarization with Self-attention. Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu, and Shinji Watanabe https://t.co/KfBZFkUWB1

ballforest:
RT @arxiv_cscl: End-to-End Neural Speaker Diarization with Self-attention https://t.co/1E9YDmseXl

SythonUK:
RT @arxiv_cscl: End-to-End Neural Speaker Diarization with Self-attention https://t.co/1E9YDmseXl

chbalajitilak:
RT @arxiv_cscl: End-to-End Neural Speaker Diarization with Self-attention https://t.co/1E9YDmseXl

None.

None.

Sample Sizes : None.

Authors: 6

Total Words: 0

Unqiue Words: 0

Deep learning has largely reduced the need for manual feature selection in
image segmentation. Nevertheless, network architecture optimization and
hyperparameter tuning are mostly manual and time consuming. Although there are
increasing research efforts on network architecture search in computer vision,
most works concentrate on image classification but not segmentation, and there
are very limited efforts on medical image segmentation especially in 3D. To
remedy this, here we propose a framework, SegNAS3D, for network architecture
search of 3D image segmentation. In this framework, a network architecture
comprises interconnected building blocks that consist of operations such as
convolution and skip connection. By representing the block structure as a
learnable directed acyclic graph, hyperparameters such as the number of feature
channels and the option of using deep supervision can be learned together
through derivative-free global optimization. Experiments on 43 3D brain
magnetic resonance images with 19 structures achieved an...

more |
pdf
| html
BrundageBot:
SegNAS3D: Network Architecture Search with Derivative-Free Global Optimization for 3D Image Segmentation. Ken C. L. Wong and Mehdi Moradi https://t.co/nXZqeB4xZW

arxivml:
"SegNAS3D: Network Architecture Search with Derivative-Free Global Optimization for 3D Image Segmentation",
Ken C． …
https://t.co/BW0FLq3Bxn

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 3399

Unqiue Words: 1209

Background. The image-based identification of distinct tissues within
dermatological wounds enhances patients' care since it requires no intrusive
evaluations. This manuscript presents an approach, we named QTDU, that combines
deep learning models with superpixel-driven segmentation methods for assessing
the quality of tissues from dermatological ulcers.
Method. QTDU consists of a three-stage pipeline for the obtaining of ulcer
segmentation, tissues' labeling, and wounded area quantification. We set up our
approach by using a real and annotated set of dermatological ulcers for
training several deep learning models to the identification of ulcered
superpixels.
Results. Empirical evaluations on 179,572 superpixels divided into four
classes showed QTDU accurately spot wounded tissues (AUC = 0.986, sensitivity =
0.97, and specificity = 0.974) and outperformed machine-learning approaches in
up to 8.2% regarding F1-Score through fine-tuning of a ResNet-based model.
Last, but not least, experimental evaluations also showed QTDU...

more |
pdf
| html
None.

arxivml:
"A superpixel-driven deep learning approach for the analysis of dermatological wounds",
Gustavo Blanco, Agma J． M． …
https://t.co/mNKsOdddVT

arxiv_cs_LG:
A superpixel-driven deep learning approach for the analysis of dermatological wounds. Gustavo Blanco, Agma J. M. Traina, Caetano Traina Jr., Paulo M. Azevedo-Marques, Ana E. S. Jorge, Daniel de Oliveira, and Marcos V. N. Bedo https://t.co/Ym9xEc4rSj

None.

None.

Sample Sizes : None.

Authors: 7

Total Words: 0

Unqiue Words: 0

Deep neural networks are often called black-boxes due to their
difficult-to-interpret decisions. This is characteristic of a deeper trend in
machine learning, where predictive performance typically comes at the cost of
interpretability. In some domains, such as image-based diagnostic tasks,
understanding the reasons behind machine generated predictions is vital in
assessing trust. In this study, we introduce novel designs of capsule networks
to provide explainable diagnoses. Our proposed deep explainable capsule
architecture, called DX-Caps, can encode high-level visual attributes within
the vectors of capsules in order to simultaneously produce malignancy
predictions for lung cancer as well as approximations of six
visually-interpretable attributes, used by radiologists to explain their
predictions. To reduce parameter and memory burden of this deeper network, we
introduce a new capsule-average pooling function. With this simple, but
fundamental addition, capsule networks can be designed in a deeper fashion than
was possible...

more |
pdf
| html
None.

BrundageBot:
Encoding High-Level Visual Attributes in Capsules for Explainable Medical Diagnoses. Rodney LaLonde, Drew Torigian, and Ulas Bagci https://t.co/KlW39bT4Tf

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 0

Unqiue Words: 0

We present a novel method for image anomaly detection, where algorithms that
use samples drawn from some distribution of "normal" data, aim to detect
out-of-distribution (abnormal) samples. Our approach includes a combination of
encoder and generator for mapping an image distribution to a predefined latent
distribution and vice versa. It leverages Generative Adversarial Networks to
learn these data distributions and uses perceptual loss for the detection of
image abnormality. To accomplish this goal, we introduce a new similarity
metric, which expresses the perceived similarity between images and is robust
to changes in image contrast. Secondly, we introduce a novel approach for the
selection of weights of a multi-objective loss function (image reconstruction
and distribution mapping) in the absence of a validation dataset for
hyperparameter tuning. After training, our model measures the abnormality of
the input image as the perceptual dissimilarity between it and the closest
generated image of the modeled data distribution. The...

more |
pdf
| html
BrundageBot:
Perceptual Image Anomaly Detection. Nina Tuluptceva, Bart Bakker, Irina Fedulova, and Anton Konushin https://t.co/9l9hfpuS82

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 7989

Unqiue Words: 2326

Any control law for aircraft asymptotic stabilization requires the existence
of an equilibrium condition, also called trim flight condition. At a constant
velocity flight, for instance, there must exist an aircraft orientation such
that aerodynamic forces oppose the plane's thrust plus weight, and the torque
balance equals zero. A closer look at the equations characterizing the trim
conditions point out that the existence of aircraft equilibrium configurations
cannot be in general claimed beforehand. By considering aircraft longitudinal
linear dynamics, this paper shows that the existence of flight trim conditions
is a consequence of the vehicle shape or aerodynamics. These results are
obtained independently from the aircraft flight envelope, and do not require
any explicit expression of the aerodynamics acting on the vehicle.

more |
pdf
| html
DanielePucci:
"Any symmetric aircraft can fly in any aerodynamic regime and along any desired trajectory. Shape bisymmetry also guarantees a positive thrust. If shape symmetry is not met, aerodynamic stall ensures the existence of trim conditions" Paper in IEEE CDC 2019 https://t.co/CEts53cxil https://t.co/RLS5d68keS

None.

None.

Sample Sizes : None.

Authors: 1

Total Words: 5590

Unqiue Words: 1543

Spectrum sensing is a key technology for cognitive radios. We present
spectrum sensing as a classification problem and propose a sensing method based
on deep learning classification. We normalize the received signal power to
overcome the effects of noise power uncertainty. We train the model with as
many types of signals as possible as well as noise data to enable the trained
network model to adapt to untrained new signals. We also use transfer learning
strategies to improve the performance for real-world signals. Extensive
experiments are conducted to evaluate the performance of this method. The
simulation results show that the proposed method performs better than two
traditional spectrum sensing methods, i.e., maximum-minimum eigenvalue
ratio-based method and frequency domain entropy-based method. In addition, the
experimental results of the new untrained signal types show that our method can
adapt to the detection of these new signals. Furthermore, the real-world signal
detection experiment results show that the detection...

more |
pdf
| html
None.

arxiv_cs_LG:
Spectrum Sensing Based on Deep Learning Classification for Cognitive Radios. Shilian Zheng, Shichuan Chen, Peihan Qi, Huaji Zhou, and Xiaoniu Yang https://t.co/uHqxDsmJcF

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 0

Unqiue Words: 0

Solution of multi-year, dynamic AC Transmission network expansion planning
(TNEP) problem is gradually taking center stage of planning research owing to
its potential accuracy. However, computational burden for a security
constrained AC TNEP is huge compared to that with DC TNEP. For a dynamic,
security constrained AC TNEP problem, the computational burden becomes so very
excessive that solution for even moderately sized systems becomes almost
impossible. Hence, this paper presents an efficient, four-stage solution
methodology for multi-year, network N-1 contingency and voltage stability
constrained, dynamic ACTNEP problems. Several intelligent logical strategies
are developed and applied to reduce the computational burden of optimization
algorithms. The proposed methodology is applied to Garver 6, IEEE 24 and 118
bus systems to demonstrate its efficiency and ability to solve TNEP for varying
system sizes.

more |
pdf
| html
None.

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 0

Unqiue Words: 0

Enhancing the frequency bandwidth of the seismic data is always the pursuance
at the geophysical community. High resolution of seismic data provides the key
resource to extract detailed stratigraphic knowledge. Here, a novel approach,
based on deep learning model, is introduced by extracting reflections from well
log data to broaden spectrum bandwidth of seismic data through boosting low and
high frequencies. The corresponding improvement is observed from the
enhancement of resolution of seismic data as well as elimination of sidelobe
artifacts from seismic wavelets. During the training stage of deep learning
model, geo-spatial information by taking consideration of multiple wells
simultaneously is fully guaranteed, which assures that laterally and vertically
geological information are constrained by and accurate away from the well
controls during the inversion procedure. Extensive experiments prove that the
enhanced seismic data is consistent with well log information, and honors rock
property relationships defined from the wells...

more |
pdf
| html
None.

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 0

Unqiue Words: 0

The important analytical control designs which are based on the state-space
model of the linear time-invariant system yield a controller whose order is
almost the same as that of the plant model. If a plant is described by a
high-order model, the resulting controller cannot be implemented without
reducing its order to a practically acceptable value. This is achieved using
weighted model order reduction wherein the weights represent a specific
closed-loop performance criterion. In this paper, we present a weighted model
order reduction algorithm, which is computationally efficient and ensures less
weighted error. The algorithm tends to achieve the weighted-H2 error optimality
and guarantee the stability of the reduced-order model, unlike the existing
weighted interpolation algorithms. The proposed algorithm is an effective
design tool to obtain a lower order controller for large-scale plants in a
computationally efficient way. The application of the proposed technique in
achieving this objective is also demonstrated on benchmark problems.

more |
pdf
| html
None.

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 0

Unqiue Words: 0

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

*Tracking 189,566 papers.*

Sort results based on if they are interesting or reproducible.

Interesting

Reproducible