Results 41 to 50 of about 22,343 (153)

Segmentation of Speech and Humming in Vocal Input [PDF]

open access: yes, 2012
Non-verbal vocal interaction (NVVI) is an interaction method in which sounds other than speech produced by a human are used, such as humming. NVVI complements traditional speech recognition systems with continuous control.
Havlik, J., Polacek, O., Sporka, A. J.
core   +1 more source

Gate‐Align‐SED: Semi‐Supervised Sound Event Detection via Adaptive Feature Gating and Cross‐Task Alignment in Situation Awareness

open access: yesAdvanced Intelligent Systems, EarlyView.
Overview of the proposed Gate‐Align‐SED, including two stages of training: (1) Mean‐Teacher SSL Training; and (2) Enhancer Model Training. In complex real‐world environments such as disaster monitoring, effective sound event detection (SED) is often hindered by the presence of noise and limited labeled data.
Jieli Chen   +4 more
wiley   +1 more source

Artificial Intelligence in Voice Disorders: Current Landscape, Emerging Applications and Future Directions

open access: yesWorld Journal of Otorhinolaryngology - Head and Neck Surgery, EarlyView.
ABSTRACT Objective To provide a comprehensive review of the current landscape of artificial intelligence (AI) applications in voice disorder, with emphasis on emerging applications, limitations, and future directions for clinical integration. Methods Literature review.
Rachel B. Kutler, Anaïs Rameau
wiley   +1 more source

Comparison of VTOL UAV Battery Level for Propeller Faulty Classification Model

open access: yesJOIV: International Journal on Informatics Visualization
The degradation of batteries in UAVs may result in various problems, such as connectivity troubles, flight delays, and unexpected accidents. Flight safety and reliability are affected by propeller efficiency and performance.
Fareisya Zulaikha Mohd Sani   +4 more
doaj   +1 more source

Newborns' Language Discrimination May Not Reflect Sensitivity to Speech Rhythm: Evidence From Computational Modeling

open access: yesDevelopmental Science, Volume 29, Issue 4, July 2026.
ABSTRACT Human newborns are able to discriminate between certain languages but not others. This ability has long been attributed to sensitivity to rhythm—the temporal regularities in speech of different languages. Here, we demonstrate through a series of computational simulations that this discrimination behavior can be achieved using no temporal ...
Ruolan Leslie Famularo   +3 more
wiley   +1 more source

Pistachio Classification Based on Acoustic Systems and Machine Learning

open access: yesElektronika ir Elektrotechnika
An acoustic emission and machine learning based pistachio classification system has been developed. This system performs feature extraction using Mel frequency cepstral coefficients (MFCC) and classification using support vector machine (SVM). This study
Yavuz Türkay, Zekiye Seyma Tamay
doaj   +1 more source

Optimal Representation of Anuran Call Spectrum in Environmental Monitoring Systems Using Wireless Sensor Networks [PDF]

open access: yes, 2018
The analysis and classification of the sounds produced by certain animal species, notably anurans, have revealed these amphibians to be a potentially strong indicator of temperature fluctuations and therefore of the existence of climate change ...
Aguayo-González, Francisco (Coordinador)   +5 more
core  

DeepCough: A Deep Convolutional Neural Network in A Wearable Cough Detection System

open access: yes, 2015
In this paper, we present a system that employs a wearable acoustic sensor and a deep convolutional neural network for detecting coughs. We evaluate the performance of our system on 14 healthy volunteers and compare it to that of other cough detection ...
Amoh, Justice, Odame, Kofi
core   +1 more source

Inter‐Model Feature Fusion for Robust Low‐Resource Speech Recognition

open access: yesApplied AI Letters, Volume 7, Issue 2, June 2026.
Our Self‐Supervised Feature Fusion (SSF‐FT) method enhances low‐resource speech recognition by adaptively combining features from self‐supervised models trained with Contrastive, Predictive, and Reconstruction objectives. This attention‐weighted ensemble delivers robust performance, particularly in acoustically challenging conditions, extending current
Ussen Kimanuka   +2 more
wiley   +1 more source

Voice Analysis and Classification System Based on Perturbation Parameters and Cepstral Presentation in Psychoacoustic Scales

open access: yesДоклады Белорусского государственного университета информатики и радиоэлектроники, 2022
The paper describes an approach to design a system for analyzing and classification of a voice signal based on perturbation parameters and cepstral representation.
M. I. Vashkevich   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy