Openai whisper - Open Access .click

Results 31 to 40 of about 420 (159)

Chef Dalle: Transforming Cooking with Multi-Model Multimodal AI

Computers
In an era where dietary habits significantly impact health, technological interventions can offer personalized and accessible food choices. This paper introduces Chef Dalle, a recipe recommendation system that leverages multi-model and multimodal human ...
Brendan Hannon +3 more
doaj +1 more source

SlideSAVR: Enabling Live Analysis during Data Presentations via Multimodal Sketching and Voice Input

Computer Graphics Forum, EarlyView.
Abstract Interpersonal communication in data science can yield sought‐after insights, but presentation environments are often not conducive for live analysis, forcing the process to move offline. Through a formative survey with 16 participants, we identified both technical (e.g., complexity of tools) and psychological (e.g., pressure of programming ...
C. Han +5 more
wiley +1 more source

Scalable Multilingual Retrieval of Lecture Moments Through Whisper-Based Transcripts and Vector Search

IEEE Access
Educational video platforms host vast collections of long, speech-centric lectures, yet most search interfaces remain metadata-driven and return results at the video level, making it difficult for learners—especially non-English speakers—to
Lazim Afraz +3 more
doaj +1 more source

The Effects of Stroke on Oral Function and Oral Health: A Qualitative Study

Gerodontology, EarlyView.
ABSTRACT Introduction Stroke is the leading cause of adult disability in Aotearoa New Zealand, often resulting in a range of physical and cognitive impairments. The impacts of stroke on oral function and oral health are not well understood from the survivors' perspective, yet the latter are crucial for overall wellbeing. This qualitative study explored
Esther Cheong +4 more
wiley +1 more source

AI-VoiceTherapy: An Automated Platform for Voice Rehabilitation Using Artificial Intelligence

International Journal of Computer Engineering and Data Science
AI-VoiceTherapy is a mobile platform that leverages artificial intelligence to democratize access to speech therapy. The system uses OpenAI's Whisper model to automatically detect and analyze speech disorders from voice recordings, including stuttering,
Nisrine Lachguer, Ourda Azizi, Soumaya El Mamoune +2 more
doaj +4 more sources

Acceptability and Fidelity of a Cognitive Rehabilitation Intervention During and After Intensive Care: A Feasibility Evaluation

Nursing in Critical Care, Volume 31, Issue 4, July 2026.
ABSTRACT Background Cognitive impairment is common after critical illness, yet structured cognitive rehabilitation is rarely integrated into routine intensive care practice. ICU CogHab is a stakeholder‐ and theory‐informed intervention comprising two components: Mindfulness and Brain Training, developed to support recovery from intensive care to 6 ...
Katrine Astrup +4 more
wiley +1 more source

Enhanced Named Entity Recognition in Power Grid Operations: A Span-Based Approach for Chinese Dispatch Communications

IEEE Access
Named Entity Recognition (NER) in power grid dispatch operations is critical for operational safety and system reliability. This domain faces unique challenges including domain-specific terminology, nested entity structures, and Chinese language ...
Yantong Zhang +7 more
doaj +1 more source

Speech Emotion Recognition Leveraging OpenAI's Whisper Representations and Attentive Pooling Methods

CoRR
Speech Emotion Recognition (SER) research has faced limitations due to the lack of standard and sufficiently large datasets. Recent studies have leveraged pre-trained models to extract features for downstream tasks such as SER. This work explores the capabilities of Whisper, a pre-trained ASR system, in speech emotion recognition by proposing two ...
Ali Shendabadi +3 more
openaire +2 more sources

Preparing Midwives for the Digital Future: A Qualitative Study on Academic Midwives' Perspectives on Artificial Intelligence in Education

Journal of Evaluation in Clinical Practice, Volume 32, Issue 4, June 2026.
ABSTRACT Aim The aim of this study was to explore the perspectives of midwifery academics regarding the integration of Artificial Intelligence (AI) into midwifery education. Background AI is increasingly transforming healthcare education by supporting simulation‐based training, individualized learning, and clinical decision‐making.
Neşe Çelik +4 more
wiley +1 more source

Conditional Text Generation for AI‐Powered Interviews: A T5‐Based System With GPT‐2 Comparison

Engineering Reports, Volume 8, Issue 4, April 2026.
This work introduces an AI‐based interview system that uses T5 to generate context‐specific interview questions and responses. By comparing it with GPT‐2, the study shows T5's ability to produce more coherent and relevant dialogue, supporting advances in automated interview and conversational AI applications.
Kritika Acharya, Rashna K.C., Sudip Rana
wiley +1 more source

artificial intelligence
speech recognition
fos: computer and information sciences

machine learning cs.lg
personalized rehabilitation
speech disorders

mobile health
speech therapy
audio and speech processing eess.as