Speech acoustics - Open Access .click

Results 21 to 30 of about 132,260 (194)

Applying deep matching networks to Chinese medical question answering: a study and a dataset

BMC Medical Informatics and Decision Making, 2019
Background Medical and clinical question answering (QA) is highly concerned by researchers recently. Though there are remarkable advances in this field, the development in Chinese medical domain is relatively backward.
Junqing He, Mingming Fu, Manshu Tu
doaj +1 more source

Improving Hybrid CTC/Attention Architecture with Time-Restricted Self-Attention CTC for End-to-End Speech Recognition

Applied Sciences, 2019
As demonstrated in hybrid connectionist temporal classification (CTC)/Attention architecture, joint training with a CTC objective is very effective to solve the misalignment problem existing in the attention-based end-to-end automatic speech recognition (
Long Wu, Ta Li, Li Wang, Yonghong Yan
doaj +1 more source

Comprehensive Active Control of Booming Noise Inside a Vehicle Caused by the Engine and the Driveline

IEEE Access, 2022
This study presents comprehensive active cancellation of booming noise caused by the engine and the driveline inside a passenger car. In modern noise control systems for vehicles, booming noise caused by engine harmonics could be effectively suppressed ...
Seonghyeon Kim, M. Ercan Altinsoy
doaj +1 more source

A Two-Stage Approach to Note-Level Transcription of a Specific Piano

Applied Sciences, 2017
This paper presents a two-stage transcription framework for a specific piano, which combines deep learning and spectrogram factorization techniques. In the first stage, two convolutional neural networks (CNNs) are adopted to recognize the notes of the ...
Qi Wang, Ruohua Zhou, Yonghong Yan
doaj +1 more source

Chinese Dialogue Intention Classification Based on Multi-Model Ensemble

IEEE Access, 2019
In dialogue systems, understanding the user utterances is crucial for providing appropriate responses. A traditional dialogue act classification (DA) task is to classify each user reply into “ACCEPT, REJECT, PROPOSE, and others”.
Manshu Tu, Bing Wang, Xuemin Zhao
doaj +1 more source

Acoustic effects of style of speech [PDF]

The Journal of the Acoustical Society of America, 1974
Recordings were made of nine subjects producing test words in seven different styles, varying from completely informal conversation to reading the test words in lists. The subjects were all educated male speakers of standard English who had lived in Southern California from an early age.
P, Ladefoged, I, Kameny, W, Brackenridge
openaire +2 more sources

Convex separable problems with linear and box constraints [PDF]

, 2014
In this work, we focus on separable convex optimization problems with linear and box constraints and compute the solution in closed-form as a function of some Lagrange multipliers that can be easily computed in a finite number of iterations.
D'Amico, Antonio A., Palomar, Daniel P., Sanguinetti, Luca +2 more
core +3 more sources

Relevancy between Objects Based on Common Sense for Semantic Segmentation

Applied Sciences, 2022
Research on image classification sparked the latest deep-learning boom. Many downstream tasks, including semantic segmentation, benefit from it. The state-of-the-art semantic segmentation models are all based on deep learning, and they sometimes make ...
Jun Zhou, Xing Bai, Qin Zhang
doaj +1 more source

Acoustic segmentation of speech [PDF]

International Journal of Man-Machine Studies, 1970
A brief argument is presented for the need for automatic speech segmentation both to facilitate automatic speech recognition and for its theoretical linguistic importance. The problem of speech segmentation in the acoustic domain using a digital computer is examined in detail, that is, of determining an acoustic partition in time which has linguistic ...
openaire +1 more source

Direct Acoustics-to-Word Models for English Conversational Speech Recognition

, 2017
Recent work on end-to-end automatic speech recognition (ASR) has shown that the connectionist temporal classification (CTC) loss can be used to convert acoustics to phone or character sequences.
Audhkhasi, Kartik +4 more
core +1 more source

humans
speech
acoustics

male
middle aged
sound

phonetics
female
adult