EchoNet++: A multilingual soccer match audio commentary dataset. [PDF]
Majeed F, Nazir M, Agus M, Schneider J.
europepmc +1 more source
"RaagaDhvani: A novel augmented multi-feature dataset: Advancing emotion recognition in Carnatic music with multimodal features and hybrid deep learning". [PDF]
Priyadarshini A, Divakarla U.
europepmc +1 more source
A unified multimodal learning framework for sentiment analysis and mental health indicators from YouTube videos. [PDF]
Satapathy P +7 more
europepmc +1 more source
AI-driven audio-to-video generation for dynamic content creation via stable diffusion and CNN-augmented transformers. [PDF]
Dharrao D +6 more
europepmc +1 more source
Scenario-based functional modularization framework for consumer electronics using MFD and AHP: A case study on audio products. [PDF]
Wu Z, Zhang J.
europepmc +1 more source
Privacy-preserving cyberthreat detection in decentralized social media with federated cross-modal graph transformers. [PDF]
Premkumar D, Nachimuthu SK.
europepmc +1 more source
FPGA implementation and voice encryption application of a new hyperchaotic system with high complexity. [PDF]
Benkouider K +7 more
europepmc +1 more source
Audio Deepfake Detection via a Fuzzy Dual-Path Time-Frequency Attention Network. [PDF]
Li J +6 more
europepmc +1 more source
Large Language Model Adaptation Strategies in Speech-Based Cognitive Screening: Systematic Evaluation. [PDF]
Taherinezhad F +8 more
europepmc +1 more source

