Choice of Mel Filter Bank in Computing MFCC of a Resampled Speech [PDF]
Mel Frequency Cepstral Coefficients (MFCCs) are the most popularly used speech features in most speech and speaker recognition applications. In this paper, we study the effect of resampling a speech signal on these speech features. We first derive a relationship between the MFCC param- eters of the resampled speech and the MFCC parameters of the ...
arxiv
ABSTRACT Rare diseases impact approximately 1 in 10 people worldwide, and yet, less than 5% of all rare diseases currently have an approved treatment option available. This is due to many challenges unique to rare diseases, including small, diverse patient populations, the cost of drug development that is not proportionate to the number of patients who
Caleb P. Bupp+7 more
wiley +1 more source
Infrared detection by thermal camera obeys Stefan's law of radiative heat transfer. This associated uncertainty in detection is estimated by the formula of propagation of error. Abstract Brown adipose tissue (BAT) represents a pivotal scientific renaissance worthy as a strategy for obesity and diabetes since its re‐discovery in adults over a decade ago.
Melvin K. S. Leow
wiley +1 more source
Reducing over-smoothness in speech synthesis using Generative Adversarial Networks [PDF]
Speech synthesis is widely used in many practical applications. In recent years, speech synthesis technology has developed rapidly. However, one of the reasons why synthetic speech is unnatural is that it often has over-smoothness. In order to improve the naturalness of synthetic speech, we first extract the mel-spectrogram of speech and convert it ...
arxiv
The age-associated increase in IFN-gamma synthesis by mouse CD8+ T cells correlates with shifts in the frequencies of cell subsets defined by membrane CD44, CD45RB, 3G11, and MEL-14 expression. [PDF]
David Ernst+4 more
openalex +1 more source
Enhancing Sound Texture in CNN-Based Acoustic Scene Classification [PDF]
Acoustic scene classification is the task of identifying the scene from which the audio signal is recorded. Convolutional neural network (CNN) models are widely adopted with proven successes in acoustic scene classification. However, there is little insight on how an audio scene is perceived in CNN, as what have been demonstrated in image recognition ...
arxiv
Rapid capping in alpha-spectrin-deficient MEL cells from mice afflicted with hereditary hemolytic anemia. [PDF]
Stephen C. Dahl+4 more
openalex +1 more source
VP-MEL: Visual Prompts Guided Multimodal Entity Linking [PDF]
Multimodal entity linking (MEL), a task aimed at linking mentions within multimodal contexts to their corresponding entities in a knowledge base (KB), has attracted much attention due to its wide applications in recent years. However, existing MEL methods often rely on mention words as retrieval cues, which limits their ability to effectively utilize ...
arxiv
Adversarially Trained End-to-end Korean Singing Voice Synthesis System [PDF]
In this paper, we propose an end-to-end Korean singing voice synthesis system from lyrics and a symbolic melody using the following three novel approaches: 1) phonetic enhancement masking, 2) local conditioning of text and pitch to the super-resolution network, and 3) conditional adversarial training. The proposed system consists of two main modules; a
arxiv
Insetos capturados com armadilha Malaise na Ilha do Mel, Baía de Paranaguá, Paraná, Brasil: I. composição de ordens [PDF]
Renato Roxo Coutinho Dutra+1 more
openalex +1 more source