Skip to content

**Training script** #23

@arampacha

Description

@arampacha
  • - add bf16 support
  • - check if training with bf16 weights works fine
  • - add resuming from ckpt
  • - add wandb tracking
  • - complete adafactor option
  • - figure out how to best utilize profiler for training loop optimization
  • - add gradient accumulation
  • - support iterable datasets and max_steps argument
  • - prefetch generator for dataloader

Metadata

Metadata

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions