Modeling speech imitation and ecological learning of auditory-motor maps. [PDF]
Classical models of speech consider an antero-posterior distinction between perceptive and productive functions. However, the selective alteration of neural activity in speech motor centers, via transcranial magnetic stimulation, was shown to affect ...
Badino, L +4 more
core +2 more sources
Common sound source localization algorithms focus on localizing all the active sources in the environment. While the source identities are generally unknown, retrieving the location of a speaker of interest requires extra effort. This paper addresses the
Ziteng Wang, Junfeng Li, Yonghong Yan
doaj +1 more source
Building competitive direct acoustics-to-word models for English conversational speech recognition
Direct acoustics-to-word (A2W) models in the end-to-end paradigm have received increasing attention compared to conventional sub-word based automatic speech recognition models using phones, characters, or context-dependent hidden Markov model states ...
Audhkhasi, Kartik +4 more
core +1 more source
Waveguide physical modeling of vocal tract acoustics: flexible formant bandwidth control from increased model dimensionality [PDF]
Digital waveguide physical modeling is often used as an efficient representation of acoustical resonators such as the human vocal tract. Building on the basic one-dimensional (1-D) Kelly-Lochbaum tract model, various speech synthesis techniques ...
Howard, D M, Mullen, J, Murphy, D T
core +1 more source
Polyphonic Piano Transcription with a Note-Based Music Language Model
This paper proposes a note-based music language model (MLM) for improving note-level polyphonic piano transcription. The MLM is based on the recurrent structure, which could model the temporal correlations between notes in music sequences. To combine the
Qi Wang, Ruohua Zhou, Yonghong Yan
doaj +1 more source
Modeling Speech Sound Radiation With Different Degrees of Realism for Articulatory Synthesis
Articulatory synthesis is based on modeling various physical phenomena of speech production, including sound radiation from the mouth. With regard to sound radiation, the most common approach is to approximate it in terms of a simple spherical source of ...
Peter Birkholz +5 more
doaj +1 more source
Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
This paper proposes a forward attention method for the sequenceto- sequence acoustic modeling of speech synthesis. This method is motivated by the nature of the monotonic alignment from phone sequences to acoustic sequences. Only the alignment paths that
Dai, Li-Rong +2 more
core +1 more source
Vowel Production in Mandarin Accented English and American English: Kinematic and Acoustic Data from the Marquette University Mandarin Accented English Corpus [PDF]
Few electromagnetic articulography (EMA) datasets are publicly available, and none have focused systematically on non-native accented speech. We introduce a kinematic-acoustic database of speech from 40 (gender and dialect balanced) participants ...
Berry, Jeffrey J. +2 more
core +2 more sources
An Archaeoacoustics Analysis of Cistercian Architecture: The Case of the Beaulieu Abbey
The Cistercian order is of acoustic interest because previous research has hypothesized that Cistercian architectural structures were designed for longer reverberation times in order to reinforce Gregorian chants.
Sebastian Duran +2 more
doaj +1 more source
Online distributed waveform-synchronization for acoustic sensor networks with dynamic topology
Acoustic sensing by multiple devices connected in a wireless acoustic sensor network (WASN) creates new opportunities for multichannel signal processing. However, the autonomy of agents in such a network still necessitates the alignment of sensor signals
Aleksej Chinaev +2 more
doaj +1 more source

