Neural Notes - Paper replays by the AMAAI Lab podcast artwork

PODCAST · technology

Neural Notes - Paper replays by the AMAAI Lab

Dive into the latest research papers from the Audio, Music & AI Lab (AMAAI) at Singapore University of Technology and Design. Every episode turns a fresh AMAAI publication into an engaging, understandable conversation. Multimodal generative AI, symbolic music, automatic mastering, and beyond. Hosted by AI, powered by humans.

  1. 2

    Inside the text2midi Architecture

    This episode of Neural Notes explores text2midi, the breakthrough end-to-end model that converts textual descriptions directly into symbolic MIDI music files,. We reveal how this system utilizes Large Language Models (LLMs) to give users unprecedented control, allowing them to generate compositions simply by typing prompts that specify elements like chords, keys, and tempo,. Discover how text2midi streamlines the music creation process, generating compositions with superior long-term structure, and making AI-guided composition accessible to expert composers and everyday users alike. Original paper: Bhandari, K., Roy, A., Wang, K., Puri, G., Colton, S., & Herremans, D. (2025, April). Text2midi: Generating symbolic music from captions. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 39, No. 22, pp. 23478-23486).Read the paper here.

  2. 1

    Why Your AI Music Lacks Soul: Aligning Computational Goals with Human Taste.

    This episode of Neural Notes discusses a new AAAI paper by Dorien Herremans and Abhinaba Roy which tackles the persistent challenge in generative music AI: why systems, despite achieving high technical fidelity, often fail to produce music that is aesthetically pleasing and emotionally resonant to human listeners. Traditional training methods optimize for likelihood, successfully capturing surface-level patterns but failing to grasp the deeper qualities that drive human musical appreciation. We explore how researchers are bridging this fundamental gap between computational optimization and human preference through systematic alignment techniques. This includes detailed discussions of large-scale preference learning (e.g., MusicRL), Direct Preference Optimization (DPO) integrated into modern diffusion architectures (e.g., DiffRhythm+), and inference-time optimization strategies (e.g., Text2midi-InferAlign), all focused on shifting the generative modeling objective from statistical fidelity to human-centered quality optimization.Paper discussed: Aligning Generative Music AI with Human Preferences: Methods and Challenges by Dorien Herremans, Abhinaba Roy. Accepted for presentation in the senior member track of AAAI 2026, Singapore. Read the paper here.

Type above to search every episode's transcript for a word or phrase. Matches are scoped to this podcast.

Searching…

We're indexing this podcast's transcripts for the first time — this can take a minute or two. We'll show results as soon as they're ready.

No matches for "" in this podcast's transcripts.

Showing of matches

No topics indexed yet for this podcast.

Loading reviews...

ABOUT THIS SHOW

Dive into the latest research papers from the Audio, Music & AI Lab (AMAAI) at Singapore University of Technology and Design. Every episode turns a fresh AMAAI publication into an engaging, understandable conversation. Multimodal generative AI, symbolic music, automatic mastering, and beyond. Hosted by AI, powered by humans.

HOSTED BY

Dorien Herremans

CATEGORIES

Frequently Asked Questions

How many episodes does Neural Notes - Paper replays by the AMAAI Lab have?

Neural Notes - Paper replays by the AMAAI Lab currently has 2 episodes available on PodParley. New episodes are automatically indexed when they're published to the podcast feed.

What is Neural Notes - Paper replays by the AMAAI Lab about?

Dive into the latest research papers from the Audio, Music & AI Lab (AMAAI) at Singapore University of Technology and Design. Every episode turns a fresh AMAAI publication into an engaging, understandable conversation. Multimodal generative AI, symbolic music, automatic mastering, and beyond....

How often does Neural Notes - Paper replays by the AMAAI Lab release new episodes?

Neural Notes - Paper replays by the AMAAI Lab has 2 episodes. Check the episode list to see recent publication dates and frequency.

Where can I listen to Neural Notes - Paper replays by the AMAAI Lab?

You can listen to Neural Notes - Paper replays by the AMAAI Lab on PodParley by clicking any episode. We provide an embedded audio player for direct listening, and you can also subscribe via your preferred podcast app using the RSS feed.

Who hosts Neural Notes - Paper replays by the AMAAI Lab?

Neural Notes - Paper replays by the AMAAI Lab is created and hosted by Dorien Herremans.
URL copied to clipboard!