We propose a data-driven design method of perfect-reconstruction filterbank
(PRFB) for sound-source enhancement (SSE) based on deep neural network (DNN).
DNNs have been used to estimate a time-frequency (T-F) mask in the short-time
Fourier transform (STFT) domain. Their training is more stable when a simple
cost function as mean-squared error (MSE) is utilized comparing to some
advanced cost such as objective sound quality assessments. However, such a
simple cost function inherits strong assumptions on the statistics of the
target and/or noise which is often not satisfied, and the mismatch of
assumption results in degraded performance. In this paper, we propose to design
the frequency scale of PRFB from training data so that the assumption on MSE is
satisfied. For designing the frequency scale, the warped filterbank frame
(WFBF) is considered as PRFB. The frequency characteristic of learned WFBF was
in between STFT and the wavelet transform, and its effectiveness was confirmed
by comparison with a standard STFT-based DNN whose...

more |
pdf
| html
yatabe_:
DNN音声強調に対し，データから周波数スケールを学習したフィルタバンクを用いることを提案し，スペクトログラムと比べて雑音除去性能が良くなることを示しました！これも５月のICASSPで発表します！
https://t.co/tCoW2qn2y0 https://t.co/M9GOKxyqDa

ComputerPapers:
Data-driven design of perfect reconstruction filterbank for DNN-based sound source enhancement. https://t.co/nVFeY5lrRw

yatabe_:
RT @ComputerPapers: Data-driven design of perfect reconstruction filterbank for DNN-based sound source enhancement. https://t.co/nVFeY5lrRw

akinori_ito:
RT @yatabe_: DNN音声強調に対し，データから周波数スケールを学習したフィルタバンクを用いることを提案し，スペクトログラムと比べて雑音除去性能が良くなることを示しました！これも５月のICASSPで発表します！
https://t.co/tCoW2qn2y0 http…

fjt:
RT @yatabe_: DNN音声強調に対し，データから周波数スケールを学習したフィルタバンクを用いることを提案し，スペクトログラムと比べて雑音除去性能が良くなることを示しました！これも５月のICASSPで発表します！
https://t.co/tCoW2qn2y0 http…

SythonUK:
RT @ComputerPapers: Data-driven design of perfect reconstruction filterbank for DNN-based sound source enhancement. https://t.co/nVFeY5lrRw

ryo_masumura:
RT @yatabe_: DNN音声強調に対し，データから周波数スケールを学習したフィルタバンクを用いることを提案し，スペクトログラムと比べて雑音除去性能が良くなることを示しました！これも５月のICASSPで発表します！
https://t.co/tCoW2qn2y0 http…

yuma_koizumi:
RT @ComputerPapers: Data-driven design of perfect reconstruction filterbank for DNN-based sound source enhancement. https://t.co/nVFeY5lrRw

yuma_koizumi:
RT @yatabe_: DNN音声強調に対し，データから周波数スケールを学習したフィルタバンクを用いることを提案し，スペクトログラムと比べて雑音除去性能が良くなることを示しました！これも５月のICASSPで発表します！
https://t.co/tCoW2qn2y0 http…

IMUY_asakust:
RT @yatabe_: DNN音声強調に対し，データから周波数スケールを学習したフィルタバンクを用いることを提案し，スペクトログラムと比べて雑音除去性能が良くなることを示しました！これも５月のICASSPで発表します！
https://t.co/tCoW2qn2y0 http…

IMUY_asakust:
RT @ComputerPapers: Data-driven design of perfect reconstruction filterbank for DNN-based sound source enhancement. https://t.co/nVFeY5lrRw

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 4586

Unqiue Words: 1585

Tissue loss in the hippocampi has been heavily correlated with the
progression of Alzheimer's Disease (AD). The shape and structure of the
hippocampus are important factors in terms of early AD diagnosis and prognosis
by clinicians. However, manual segmentation of such subcortical structures in
MR studies is a challenging and subjective task. In this paper, we investigate
variants of the well known 3D U-Net, a type of convolution neural network (CNN)
for semantic segmentation tasks. We propose an alternative form of the 3D
U-Net, which uses dilated convolutions and deep supervision to incorporate
multi-scale information into the model. The proposed method is evaluated on the
task of hippocampus head and body segmentation in an MRI dataset, provided as
part of the MICCAI 2018 segmentation decathlon challenge. The experimental
results show that our approach outperforms other conventional methods in terms
of different segmentation accuracy metrics.

more |
pdf
| html
arxivml:
"Dilated deeply supervised networks for hippocampus segmentation in MRI",
Lukas Folle, Sulaiman Vesal, Nishant Ravi…
https://t.co/aHHWojIMXm

SciFi:
Dilated deeply supervised networks for hippocampus segmentation in MRI. https://t.co/x5MmH9L7mu

arxiv_cs_LG:
Dilated deeply supervised networks for hippocampus segmentation in MRI. Lukas Folle, Sulaiman Vesal, Nishant Ravikumar, and Andreas Maier https://t.co/5XUTpbSYfQ

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 2198

Unqiue Words: 972

Epileptic seizure detection and classification in clinical
electroencephalogram data still is a challenge, and only low sensitivity with a
high rate of false positives has been achieved with commercially available
seizure detection tools, which usually are patient non-specific. Epilepsy
patients suffer from severe detrimental effects like physical injury or
depression due to unpredictable seizures. However, even in hospitals due to the
high rate of false positives the seizure alert systems are of poor help for
patients as tools of seizure detection are mostly trained on unrealistically
clean data, containing little noise and obtained under controlled laboratory
conditions, where patient groups are homogeneous, e.g. in terms of age or type
of seizures. In this study authors present the approach for detection and
classification of a seizure using clinical data of electroencephalograms and a
convolutional neural network trained on features of brain synchronisation and
power spectrum. Various deep learning methods were applied, and...

more |
pdf
| html
arxivml:
"Convolutional neural network for detection and classification of seizures in clinical data",
Tomas Iesmantas, Robe…
https://t.co/sRvKPsaOwC

arxiv_cs_LG:
Convolutional neural network for detection and classification of seizures in clinical data. Tomas Iesmantas and Robertas Alzbutas https://t.co/9ihZdoGsCB

Memoirs:
Convolutional neural network for detection and classification of seizures in clinical data. https://t.co/ELB3xJDvef

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 8560

Unqiue Words: 2655

Convolutional neural networks (CNN) have shown state-of-the-art results for
low-level computer vision problems such as stereo and monocular disparity
estimations, but still, have much room to further improve their performance in
terms of accuracy, numbers of parameters, etc. Recent works have uncovered the
advantages of using an unsupervised scheme to train CNN's to estimate monocular
disparity, where only the relatively-easy-to-obtain stereo images are needed
for training. We propose a novel encoder-decoder architecture that outperforms
previous unsupervised monocular depth estimation networks by (i) taking into
account ambiguities, (ii) efficient fusion between encoder and decoder features
with rectangular convolutions and (iii) domain transformations between encoder
and decoder. Our architecture outperforms the Monodepth baseline in all
metrics, even with a considerable reduction of parameters. Furthermore, our
architecture is capable of estimating a full disparity map in a single forward
pass, whereas the baseline needs two...

more |
pdf
| html
arxivml:
"A Novel Monocular Disparity Estimation Network with Domain Transformation and Ambiguity Learning",
Juan Luis Gonza…
https://t.co/eUdbvwGPHV

arxiv_cs_LG:
A Novel Monocular Disparity Estimation Network with Domain Transformation and Ambiguity Learning. Juan Luis Gonzalez Bello and Munchurl Kim https://t.co/zIUfZEKnvl

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 3646

Unqiue Words: 1421

Massive multiple-input multiple-output (MIMO) is expected to be a vital
component in future 5G systems. As such, there is a need for new modeling in
order to investigate the performance of massive MIMO not only at the physical
layer, but also higher up the networking stack. In this paper, we present
general optimization models for massive MIMO, based on mixed-integer
programming and compatible sets, with both maximum ratio combing and zero
forcing precoding schemes. We then apply our models to the case of joint device
scheduling and power control for heterogeneous devices and traffic demands, in
contrast to existing power control schemes that consider only homogeneous users
and saturated scenarios. Our results show substantial benefits in terms of
energy usage can be achieved without sacrificing throughput, and that both
signalling overhead and the complexity of end devices can be reduced by
abrogating the need for uplink power control through efficient scheduling.

more |
pdf
| html
None.

arxiv_org:
Massive MIMO Optimization with Compatible Sets. https://t.co/HxoLK5BO6n https://t.co/4sfFplqil2

ComputerPapers:
Massive MIMO Optimization with Compatible Sets. https://t.co/nMijfTgE0x

MUKULBHALLA7:
https://t.co/WD41GbbXQx

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 17535

Unqiue Words: 3407

Recent progress in graph signal processing (GSP) has addressed a number of
problems, including sampling and filtering. Proposed methods have focused on
generic graphs and defined signals with certain characteristics, e.g.,
bandlimited signals, based on t he graph Fourier transform (GFT). However, the
effect of GFT properties (e.g., vertex localization) on the behavior of such
methods is not as well understood. In this paper, we propose novel GFT
visualization tools and provide some examples to illustrate certain GFT
properties and their impact on sampling or wavelet transforms.

more |
pdf
| html
elpenta:
Check out our new tool to visualize Graph Fourier Transforms. Paper: https://t.co/3ugESbkpOu Code: https://t.co/tO6LYNIAvO Makes it easier to visualize localization, sampling, filtering #graphsignalprocessing https://t.co/7srnTZrbPu

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 7008

Unqiue Words: 1927

This paper investigates the impact of the number of antennas (8 to 64) and
the array configuration on massive MIMO channel parameters estimation for
multiple propagation scenarios at 3.5 GHz. Different measurement environments
are artificially created by placing several reflectors and absorbers in an
anechoic chamber. Ground truth channel parameters, e.g, path angles, are
obtained by geometry and trigonometric rules. Then, these are compared to the
channel parameters extracted by the applying Space-Alternating Generalized
Expectation-Maximization (SAGE) algorithm on the measurements. Overall, the
estimation errors for various array configurations and the multiple
environments are compared. This paper will help to determine the appropriate
configuration of the antenna array and the parameter extraction algorithm for
outdoor massive MIMO channel sounding campaigns.

more |
pdf
| html
None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 1787

Unqiue Words: 793

The paper addresses the problem of designing radar detectors more robust than
Kelly's detector to possible mismatches of the assumed target signature, but
with no performance degradation under matched conditions. The idea is to model
the received signal under the signal-plus-noise hypothesis by adding a random
component, parameterized via a design covariance matrix, that makes the
hypothesis more plausible in presence of mismatches. Moreover, an unknown power
of such component, to be estimated from the observables, can lead to no
performance loss. Derivation of the (one-step) GLRT is provided for two choices
of the design matrix, obtaining detectors with different complexity and
behavior. A third parametric detector is also obtained by an ad-hoc
generalization of one of such GLRTs. The analysis shows that the proposed
approach can cover a range of different robustness levels that is not
achievable by state-of-the-art with the same performance of Kelly's detector
under matched conditions.

more |
pdf
| html
None.

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 9324

Unqiue Words: 1924

Unlike terrestrial communications, unmanned aerial vehicle (UAV)
communications have some advantages such as the line-of-sight (LoS) environment
and flexible mobility. However, the interference will be still inevitable. In
this paper, we analyze the effect of an interfering node on the UAV
communications by considering the LoS probability and different channel fading
for LoS and non-line-of-sight (NLoS) links, which are affected by the elevation
angle of the communication link. We then derive a closed-form outage
probability in the presence of an interfering node for all the possible
scenarios and environments of main and interference links. After discussing the
impacts of transmitting and interfering node parameters on the outage
probability, we show the existence of the optimal height of the UAV that
minimize the outage probability. We also show the NLoS environment can be
better than the LoS environment if the average received power of the
interference is more dominant than that of the transmitting signal on UAV
communications....

more |
pdf
| html
None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 10814

Unqiue Words: 1815

In the present paper we develop a Bayesian analysis of radar target detection
that uses the parameters of conventional radar analysis to provide a valid
prediction of target presence or absence when received signals cross or fail to
cross chosen threshold values. A Positive Predictive Value parameter is added
to the normal Receiver Operating Characteristic to provide information that
allows the radar operator to make an informed decision in the choice of
threshold.

more |
pdf
| html
None.

None.

Sample Sizes : None.

Authors: 1

Total Words: 2637

Unqiue Words: 724

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

*Tracking 99,586 papers.*

Sort results based on if they are interesting or reproducible.

Interesting

Reproducible