Skip to content

Support chat template in prepare #262

@sohamparikh

Description

@sohamparikh

🎯 Goal (What & Why)

Support chat template during dataset preparation to make it easier for SFT, DPO and other instruction finetuning methods. This takes away from the user the onus of preparing datasets and being careful about constructing loss masking spans, eos and bos tokens. e.g., OpenRLHF directly works with chat templates.

🚀 Execution Plan

📌 Acceptance Criteria (Must-Haves for Completion)

  • The feature must be functional and tested.
  • The implementation must be documented in practical terms.
  • No refactors unless directly necessary for feature completion.

🛠️ Project Management

  • Assign the project to the Fast-LLM project.
  • Set the Estimate field (in days) in the GitHub project.
  • Use the Size field to categorize the PR size (Small/Medium/Large).
  • Assign an owner when opening the issue.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions