Results 31 to 40 of about 420 (159)
Chef Dalle: Transforming Cooking with Multi-Model Multimodal AI
In an era where dietary habits significantly impact health, technological interventions can offer personalized and accessible food choices. This paper introduces Chef Dalle, a recipe recommendation system that leverages multi-model and multimodal human ...
Brendan Hannon +3 more
doaj +1 more source
SlideSAVR: Enabling Live Analysis during Data Presentations via Multimodal Sketching and Voice Input
Abstract Interpersonal communication in data science can yield sought‐after insights, but presentation environments are often not conducive for live analysis, forcing the process to move offline. Through a formative survey with 16 participants, we identified both technical (e.g., complexity of tools) and psychological (e.g., pressure of programming ...
C. Han +5 more
wiley +1 more source
Educational video platforms host vast collections of long, speech-centric lectures, yet most search interfaces remain metadata-driven and return results at the video level, making it difficult for learners—especially non-English speakers—to
Lazim Afraz +3 more
doaj +1 more source
The Effects of Stroke on Oral Function and Oral Health: A Qualitative Study
ABSTRACT Introduction Stroke is the leading cause of adult disability in Aotearoa New Zealand, often resulting in a range of physical and cognitive impairments. The impacts of stroke on oral function and oral health are not well understood from the survivors' perspective, yet the latter are crucial for overall wellbeing. This qualitative study explored
Esther Cheong +4 more
wiley +1 more source
AI-VoiceTherapy: An Automated Platform for Voice Rehabilitation Using Artificial Intelligence
AI-VoiceTherapy is a mobile platform that leverages artificial intelligence to democratize access to speech therapy. The system uses OpenAI's Whisper model to automatically detect and analyze speech disorders from voice recordings, including stuttering,
Nisrine Lachguer +2 more
doaj +4 more sources
ABSTRACT Background Cognitive impairment is common after critical illness, yet structured cognitive rehabilitation is rarely integrated into routine intensive care practice. ICU CogHab is a stakeholder‐ and theory‐informed intervention comprising two components: Mindfulness and Brain Training, developed to support recovery from intensive care to 6 ...
Katrine Astrup +4 more
wiley +1 more source
Named Entity Recognition (NER) in power grid dispatch operations is critical for operational safety and system reliability. This domain faces unique challenges including domain-specific terminology, nested entity structures, and Chinese language ...
Yantong Zhang +7 more
doaj +1 more source
Speech Emotion Recognition Leveraging OpenAI's Whisper Representations and Attentive Pooling Methods
Speech Emotion Recognition (SER) research has faced limitations due to the lack of standard and sufficiently large datasets. Recent studies have leveraged pre-trained models to extract features for downstream tasks such as SER. This work explores the capabilities of Whisper, a pre-trained ASR system, in speech emotion recognition by proposing two ...
Ali Shendabadi +3 more
openaire +2 more sources
ABSTRACT Aim The aim of this study was to explore the perspectives of midwifery academics regarding the integration of Artificial Intelligence (AI) into midwifery education. Background AI is increasingly transforming healthcare education by supporting simulation‐based training, individualized learning, and clinical decision‐making.
Neşe Çelik +4 more
wiley +1 more source
Conditional Text Generation for AI‐Powered Interviews: A T5‐Based System With GPT‐2 Comparison
This work introduces an AI‐based interview system that uses T5 to generate context‐specific interview questions and responses. By comparing it with GPT‐2, the study shows T5's ability to produce more coherent and relevant dialogue, supporting advances in automated interview and conversational AI applications.
Kritika Acharya, Rashna K.C., Sudip Rana
wiley +1 more source

