Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop [PDF]
We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding the discovery of linguistic units (subwords and words) in a language without orthography.
Arthur, Philip +18 more
core +3 more sources
Long-term learning behavior in a recurrent neural network for sound recognition [PDF]
In this paper, the long-term learning properties of an artificial neural network model, designed for sound recognition and computational auditory scene analysis in general, are investigated. The model is designed to run for long periods of time (weeks to
Boes, Michiel +3 more
core +1 more source
A Comparison of Perceptually Motivated Loss Functions for Binary Mask Estimation in Speech Separation [PDF]
This work proposes and compares perceptually motivated loss functions for deep learning based binary mask estimation for speech separation. Previous loss functions have focused on maximising classification accuracy of mask estimation but we now propose ...
Milner, Ben, Websdale, Danny
core +1 more source
Learning Mid-Level Auditory Codes from Natural Sound Statistics [PDF]
Interaction with the world requires an organism to transform sensory signals into representations in which behaviorally meaningful properties of the environment are made explicit.
McDermott, Josh, Mlynarski, Wiktor
core +2 more sources
Application of Machine Learning for the Spatial Analysis of Binaural Room Impulse Responses [PDF]
Spatial impulse response analysis techniques are commonly used in the field of acoustics, as they help to characterise the interaction of sound with an enclosed environment.
Lovedee-Turner, Michael James +1 more
core +3 more sources
Native language identification (NLI) is the task of identifying the first language of a user based on their speech or written text in a second language. In this paper, we propose the use of spectrogram- and cochleagram-based features extracted from very ...
Farah Adeeba, Sarmad Hussain
doaj +1 more source
Photonic nonlinear transient computing with multiple-delay wavelength dynamics [PDF]
International audienceWe report on the experimental demonstration of a hybrid optoelectronic neuromorphic computer based on a complex nonlinear wavelength dynamics including multiple delayed feedbacks with randomly defined weights.
Chembo, Yanne Kouomou +4 more
core +3 more sources
Speech recognition through physical reservoir computing with neuromorphic nanowire networks [PDF]
The hardware implementation of the reservoir computing paradigm represents a key aspect for taking into advantage of neuromorphic data processing. In this context, self-organised nanonetworks represent a versatile and scalable computational substrate for
Agliuzza, M +3 more
core +1 more source
An FPGA-Based Electronic Cochlea
A module generator which can produce an FPGA-based implementation of an electronic cochlea filter with arbitrary precision is presented. Although hardware implementations of electronic cochlea models have traditionally used analog VLSI as the ...
M. P. Leong +2 more
doaj +1 more source
Attention-driven auditory stream segregation using a SOM coupled with an excitatory-inhibitory ANN [PDF]
Auditory attention is an essential property of human hearing. It is responsible for the selection of information to be sent to working memory and as such to be perceived consciously, from the abundance of auditory information that is continuously ...
Boes, Michiel +3 more
core +2 more sources

