Results 251 to 260 of about 4,915,608 (331)
Some of the next articles are maybe not open access.
GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions
AAAI Conference on Artificial IntelligenceComposing music for video is essential yet challenging, leading to a growing interest in automating music generation for video applications. Existing approaches often struggle to achieve robust music-video correspondence and generative diversity ...
Heda Zuo +7 more
semanticscholar +1 more source
Benchmarking Music Generation Models and Metrics via Human Preference Studies
IEEE International Conference on Acoustics, Speech, and Signal ProcessingRecent advancements have brought generated music closer to human-created compositions, yet evaluating these models remains challenging. While human preference is the gold standard for assessing quality, translating these subjective judgments into ...
Florian Grötschla +3 more
semanticscholar +1 more source
Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation
arXiv.orgAutoregressive (AR) models have demonstrated impressive capabilities in generating high-fidelity music. However, the conventional next-token prediction paradigm in AR models does not align with the human creative process in music composition, potentially
Max W. Y. Lam +16 more
semanticscholar +1 more source
MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling
IEEE International Conference on Acoustics, Speech, and Signal ProcessingWhile most music generation models generate a mixture of stems (in mono or stereo), we propose to train a multi-stem generative model with 3 stems (bass, drums and other) that learn the musical dependencies between them.
Simon Rouard +3 more
semanticscholar +1 more source
Electronics
Text-to-music generation integrates natural language processing and music generation, enabling artificial intelligence (AI) to compose music from textual descriptions.
Yujia Zhao +7 more
semanticscholar +1 more source
Text-to-music generation integrates natural language processing and music generation, enabling artificial intelligence (AI) to compose music from textual descriptions.
Yujia Zhao +7 more
semanticscholar +1 more source
Music for All: Representational Bias and Cross-Cultural Adaptability of Music Generation Models
North American Chapter of the Association for Computational LinguisticsThe advent of Music-Language Models has greatly enhanced the automatic music generation capability of AI systems, but they are also limited in their coverage of the musical genres and cultures of the world. We present a study of the datasets and research
Atharva Mehta +5 more
semanticscholar +1 more source
Hybrid Learning Module-Based Transformer for Multitrack Music Generation With Music Theory
IEEE Transactions on Computational Social SystemsIn recent years, multitrack music generation has garnered significant attention in both academic and industrial spheres for its versatile utilization of various instruments in collaborative settings.
Y. Tie +5 more
semanticscholar +1 more source
DITTO: Diffusion Inference-Time T-Optimization for Music Generation
International Conference on Machine LearningWe propose Diffusion Inference-Time T-Optimization (DITTO), a general-purpose frame-work for controlling pre-trained text-to-music diffusion models at inference-time via optimizing initial noise latents.
Zachary Novack +3 more
semanticscholar +1 more source
ClaviNet: Generate Music With Different Musical Styles
IEEE MultiMedia, 2021Classically, the style of the generated music by deep learning models is usually governed by the training dataset. In this article, we improved this by proposing the continuous style embedding ${z}_{s}$ z s to the general formulation of variational autoencoder (VAE) to allow users to be able to condition on the style of the generated music.
Yu-Quan Lim +2 more
openaire +1 more source
Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion
International Conference on Machine LearningWe study the problem of symbolic music generation (e.g., generating piano rolls), with a technical focus on non-differentiable rule guidance. Musical rules are often expressed in symbolic form on note characteristics, such as note density or chord ...
Yujia Huang +8 more
semanticscholar +1 more source

