Feature/mac mps support by TheApeMachine · Pull Request #23 · HeartMuLa/heartlib

TheApeMachine · 2026-01-19T13:16:42Z

Adds Mac MPS/Metal support
Refactors hard-coded CUDA implementation to detect platform
Beginning of custom Metal kernels to improve performance on mac

Currently takes about 11 minutes on Macbook Pro M4-Max 128GB unified memory (96 GB assignable as VRAM).

Manual inspection of output (listening to .mp3) confirms it is working.

…e selection. Update argument handling in `run_music_generation.py` and improve `HeartMuLaGenPipeline` class for better input processing and model execution.

…odec model. Update `run_lyrics_transcription.py` to dynamically select device based on availability, and modify `HeartCodec` to determine device from input tensor or model parameters. Improve `HeartMuLaGenPipeline` to support autocast on MPS for better performance.

…mize audio token padding. Introduce a context manager for autocast that gracefully handles unsupported cases, and preallocate buffers for audio tokens to enhance performance during generation.

…ce on MPS. Update `pyproject.toml` to include the optimizer package directory. Enhance `HeartMuLaGenPipeline` to optionally enable Metal optimizations during model execution, improving performance for Llama blocks.

…w Metal kernels and Python wrappers. Update `pyproject.toml` to remove the optimizer package directory. Enhance runtime detection for Metal support and build tools availability.

iamwavecut · 2026-01-20T11:44:57Z

I tested it on an MBP 16" M2 Max 64GB: the default prompt took 24 minutes, with 33 GB of RAM allocated.

TheApeMachine · 2026-01-21T01:36:28Z

@iamwavecut Damn... Well, let's start by saying "cool it works" :p But this is of course not super great to have to wait that long for a song.
There is still a lot that can be done, most notably writing more, and better custom metal kernels to fuse operations.

Ah, so that reminds me, did you have everything in place to support the custom metal kernels that are there? (you need xcode-tools, which I am sure you have, but also two additional libraries, I would have to look up) And set:

HEARTLIB_ENABLE_MPS_METAL=1
HEARTLIB_MPS_METAL_VERBOSE=1

I do see a couple of things right now, let me take a stab at making it faster...

tonywestonuk · 2026-01-24T21:59:21Z

Generated on a 'modest' MacBook Air m4 32gb. Memory pressure went to red for a few seconds, but.... it did it in about an hour, Im guessing a fair amount of thermal throttling.

But, wow. I didn't think it would work....but it did. Thanks for your efforts getting this going.

output.mp3

TheApeMachine · 2026-02-06T23:22:56Z

@tonywestonuk check you activity monitor, my guess is swapping to disk.

TheApeMachine added 5 commits January 19, 2026 13:04

Refactor music generation pipeline to support dynamic device and dtyp…

c2fc45c

…e selection. Update argument handling in `run_music_generation.py` and improve `HeartMuLaGenPipeline` class for better input processing and model execution.

Refactor HeartMuLaGenPipeline to improve autocast handling and opti…

082f715

…mize audio token padding. Introduce a context manager for autocast that gracefully handles unsupported cases, and preallocate buffers for audio tokens to enhance performance during generation.

Implement Metal support for RMSNorm and RoPE operations, including ne…

b56ec87

…w Metal kernels and Python wrappers. Update `pyproject.toml` to remove the optimizer package directory. Enhance runtime detection for Metal support and build tools availability.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/mac mps support#23

Feature/mac mps support#23
TheApeMachine wants to merge 5 commits intoHeartMuLa:mainfrom
TheApeMachine:feature/mac-mps-support

TheApeMachine commented Jan 19, 2026

Uh oh!

iamwavecut commented Jan 20, 2026

Uh oh!

TheApeMachine commented Jan 21, 2026

Uh oh!

tonywestonuk commented Jan 24, 2026

Uh oh!

TheApeMachine commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

TheApeMachine commented Jan 19, 2026

Uh oh!

iamwavecut commented Jan 20, 2026

Uh oh!

TheApeMachine commented Jan 21, 2026

Uh oh!

tonywestonuk commented Jan 24, 2026

Uh oh!

TheApeMachine commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants