Results 31 to 40 of about 25,441 (258)
Dilated Convolutional Model for Melody Extraction [PDF]
Melody extraction is a challenging task in music information retrieval that enables many down-stream applications. In this paper we propose a simple dilated convolutional model for melody extraction. It takes variable-q transforms as inputs. It first uses consecutive layers of convolution to capture local temporal-frequency patterns.
Javen Shi, Lingqiao Liu, Xian Wang
openaire +1 more source
A theoretical insight into morphological operations in surface measurement by introducing the slope transform [PDF]
As one of the tools for surface analysis, morphological operations, although not as popular as linear convolution operations (e.g. the Gaussian filter), are really useful in mechanical surface reconstruction, surface filtration, functional simulation etc.
Jiang, Xiang +3 more
core +1 more source
Crowd Counting Via Perspective-Guided Fractional-Dilation Convolution [PDF]
Accepted by T-MM ...
Zhaoyi Yan +4 more
openaire +2 more sources
Noting the shortcomings of current methods in detecting small objects in image-based remote sensing applications, in this paper, we propose a novel implementation of single shot multibox detector (SSD) networks based on dilated convolution and feature ...
Junsuo Qu +3 more
doaj +1 more source
A U-Net Based Multi-Scale Deformable Convolution Network for Seismic Random Noise Suppression
Seismic data processing plays a key role in the field of geophysics. The collected seismic data are inevitably contaminated by various types of noise, which makes the effective signals difficult to be accurately discriminated.
Haixia Zhao +3 more
doaj +1 more source
Multilevel feature fusion dilated convolutional network for semantic segmentation
Recently, convolutional neural network (CNN) has led to significant improvement in the field of computer vision, especially the improvement of the accuracy and speed of semantic segmentation tasks, which greatly improved robot scene perception.
Tao Ku, Qirui Yang, Hao Zhang
doaj +1 more source
We propose a source separation architecture using dilated time-frequency DenseNet for background music identification of broadcast content. We apply source separation techniques to the mixed signals of music and speech. For the source separation purpose,
Woon-Haeng Heo, Hyemi Kim, Oh-Wook Kwon
doaj +1 more source
Road markings, including road lanes and symbolic road markings, can convey abundant guidance information to autonomous driving cars. However, recent works have paid less attention to the recognition of symbolic road markings compared with road lanes.
Junjie Wu, Wen Liu, Yoshihisa Maruyama
doaj +1 more source
Multi-modal Human-Computer Virtual Fusion Interaction In Mixed Reality
Since the receptive field of CNN usually reflects the size of its learning ability, it is limited by the size of the convolution kernel. At the same time, the use of pooling to increase the receptive field will cause the lack of spatial information of ...
Shengying Jia
doaj +1 more source
Short-Term Traffic Flow Prediction Based on Improved Dilated Temporal-Spatio Graph Convolutional Network [PDF]
Traffic flow prediction for road networks plays a key role in intelligent transportation. Traffic flow not only exhibits high spatial correlation but also exhibits long-term correlation and periodicity in time characteristics.
LUO Xianglong, XU Zhongcheng, SU Yongdong, HE Xibin, LIU Ruochen
doaj +1 more source

