Super slow on Mac MPS

Hi, a follow up on #15: I compared cpu vs mps and compile vs no compile on halfcheetah for 100k steps using SAC. It shows that mps is significantly slower than cpu, and `aot_eager` backend makes compile slower and much more so for cpu, tho the default `inductor` backend makes compile quite a bit faster for cpu but doesn't work for mps. 

![Screenshot 2024-11-12 at 9 19 03 AM](https://github.com/user-attachments/assets/0951bcdb-5e96-441d-af99-43cf2e9173e3)

Code change is the following:
```
if args.compile:
        mode = None  # "reduce-overhead" if not args.cudagraphs else None
        backend = "aot_eager" if device == torch.device("mps") else "inductor"
        update_main = torch.compile(update_main, mode=mode, backend=backend)
        update_pol = torch.compile(update_pol, mode=mode, backend=backend)
        policy = torch.compile(policy, mode=mode, backend=backend)
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Super slow on Mac MPS #16

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Super slow on Mac MPS #16

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions