Results 121 to 130 of about 6,357,066 (352)

Choice of Mel Filter Bank in Computing MFCC of a Resampled Speech [PDF]

open access: yesarXiv, 2014
Mel Frequency Cepstral Coefficients (MFCCs) are the most popularly used speech features in most speech and speaker recognition applications. In this paper, we study the effect of resampling a speech signal on these speech features. We first derive a relationship between the MFCC param- eters of the resampled speech and the MFCC parameters of the ...
arxiv  

Repurposing With Purpose: Treatment of Bachmann–Bupp Syndrome With Eflornithine and Implications for Other Polyaminopathies

open access: yesAmerican Journal of Medical Genetics Part C: Seminars in Medical Genetics, EarlyView.
ABSTRACT Rare diseases impact approximately 1 in 10 people worldwide, and yet, less than 5% of all rare diseases currently have an approved treatment option available. This is due to many challenges unique to rare diseases, including small, diverse patient populations, the cost of drug development that is not proportionate to the number of patients who
Caleb P. Bupp   +7 more
wiley   +1 more source

Brown fat detection by infrared thermography—An invaluable research methodology with noteworthy uncertainties confirmed by a mathematical proof

open access: yesEndocrinology, Diabetes &Metabolism, Volume 6, Issue 1, January 2023., 2023
Infrared detection by thermal camera obeys Stefan's law of radiative heat transfer. This associated uncertainty in detection is estimated by the formula of propagation of error. Abstract Brown adipose tissue (BAT) represents a pivotal scientific renaissance worthy as a strategy for obesity and diabetes since its re‐discovery in adults over a decade ago.
Melvin K. S. Leow
wiley   +1 more source

Reducing over-smoothness in speech synthesis using Generative Adversarial Networks [PDF]

open access: yesarXiv, 2018
Speech synthesis is widely used in many practical applications. In recent years, speech synthesis technology has developed rapidly. However, one of the reasons why synthetic speech is unnatural is that it often has over-smoothness. In order to improve the naturalness of synthetic speech, we first extract the mel-spectrogram of speech and convert it ...
arxiv  

Enhancing Sound Texture in CNN-Based Acoustic Scene Classification [PDF]

open access: yesarXiv, 2019
Acoustic scene classification is the task of identifying the scene from which the audio signal is recorded. Convolutional neural network (CNN) models are widely adopted with proven successes in acoustic scene classification. However, there is little insight on how an audio scene is perceived in CNN, as what have been demonstrated in image recognition ...
arxiv  

VP-MEL: Visual Prompts Guided Multimodal Entity Linking [PDF]

open access: yesarXiv
Multimodal entity linking (MEL), a task aimed at linking mentions within multimodal contexts to their corresponding entities in a knowledge base (KB), has attracted much attention due to its wide applications in recent years. However, existing MEL methods often rely on mention words as retrieval cues, which limits their ability to effectively utilize ...
arxiv  

Adversarially Trained End-to-end Korean Singing Voice Synthesis System [PDF]

open access: yesarXiv, 2019
In this paper, we propose an end-to-end Korean singing voice synthesis system from lyrics and a symbolic melody using the following three novel approaches: 1) phonetic enhancement masking, 2) local conditioning of text and pitch to the super-resolution network, and 3) conditional adversarial training. The proposed system consists of two main modules; a
arxiv  

Home - About - Disclaimer - Privacy