Adding a simple optimizer registry. #876

balancap · 2025-02-21T16:11:24Z

At the moment, _create_optimizer is hardcoding a list optimizers available.

Adding a simple registry is allowing users to fully customize their optimizer choice and configuration (when used in combination with a custom build_optimizers).

At the moment, `_create_optimizer` is hardcoding a list optimizers available. Adding a simple registry is allowing users to fully customize their optimizer choice and configuration (when used in combination with a custom `build_optimizers`).

balancap · 2025-02-21T16:19:58Z

I am introducing the minimal change to be able to introduce custom optimizers in the training loop. But there is probably a case for extending this work to avoid hard coding

optimizer_kwargs = {
    "lr": lr,
    "betas": (0.9, 0.95),
    "weight_decay": 0.1,
    "fused": fused,
    "foreach": not fused,
}

in the current build_optimizers implementation.

Additionally, it could also be useful to have a registry for LRSchedulerLambda functions. Not strictly necessary for the users, as they can customize build_lr_schedulers, but could simplify a bit things for them.

fegin · 2025-02-21T17:53:41Z

If users would like to extend even more cases beyond the current optimizer, they should just use TrainSpec and replace the optimizer implementation with their owns. IMHO, that's the granularity TorchTitan provides for users to customize.

balancap · 2025-02-21T19:11:15Z

Agree, that's what we want to do, have our own TrainSpec. But we would also like to be able to re-use the OptimizersContainer class, which is generic and not tied to Adam optimizers.

One simpler alternative, keeping TorchTitan lean as it is, would be to pass optimizer_cls to OptimizersContainer instead of the name: i.e.

class OptimizersContainer(Optimizer):
    """
    optimizers: List[Optimizer]
    model_parts: List[nn.Module]

    def __init__(
        self, model_parts: List[nn.Module], optimizer_cls: Type[Optimizer], optimizer_kwargs: Dict[str, Any]
    ) -> None:
          ....

How does it sound to you @fegin @tianyu-l ?

Independently of my need, I believe it is also a better design: a container should be agnostic to element type (i.e. std::vector, ...) and OptimizersContainer could be then more strongly typed as Generic[T], reflecting that the class expect all optimizers to be of the same type.

fegin · 2025-02-21T20:31:05Z

ye, using cls is a good idea, just like model. I didn't change it because it was originally coded and I didn't want to change too much in one PR. I vote for changing from name to cls.

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 21, 2025

Adding a simple optimizer registry.

8aae457

At the moment, `_create_optimizer` is hardcoding a list optimizers available. Adding a simple registry is allowing users to fully customize their optimizer choice and configuration (when used in combination with a custom `build_optimizers`).

balancap force-pushed the adding-optimizer-registry branch from 1c47613 to 8aae457 Compare February 21, 2025 16:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding a simple optimizer registry. #876

Adding a simple optimizer registry. #876

balancap commented Feb 21, 2025

balancap commented Feb 21, 2025 •

edited

Loading

fegin commented Feb 21, 2025

balancap commented Feb 21, 2025

fegin commented Feb 21, 2025

Adding a simple optimizer registry. #876

Are you sure you want to change the base?

Adding a simple optimizer registry. #876

Conversation

balancap commented Feb 21, 2025

balancap commented Feb 21, 2025 • edited Loading

fegin commented Feb 21, 2025

balancap commented Feb 21, 2025

fegin commented Feb 21, 2025

balancap commented Feb 21, 2025 •

edited

Loading