[WIP] Add finetuning samples #36723

mohammadsheraj · 2025-11-24T07:11:56Z

Packages impacted by this PR

Issues associated with this PR

Describe the problem that is addressed by this PR

What are the possible designs available to address the problem? If there are more than one possible design, why was the one in this PR chosen?

Are there test cases added in this PR? (If not, why?)

Provide a list of related PRs (if any)

Command used to generate this PR:***(Applicable only to SDK release request PRs)*

Checklists

Added impacted package name to the issue description
Does this PR needs any fixes in the SDK Generator?** (If so, create an Issue in the Autorest/typescript repository and link it here)
Added a changelog (if necessary)

Copilot

Pull request overview

This PR adds finetuning samples for the @azure/ai-projects SDK, introducing sample data files for various finetuning methods (supervised fine-tuning, reward fine-tuning, and direct preference optimization) and a TypeScript sample demonstrating their use.

Key changes:

Added TypeScript sample for finetuning operations
Added JSONL training/validation datasets for SFT, RFT, and DPO methods
Includes training data for multiple finetuning approaches

Reviewed changes

Copilot reviewed 6 out of 7 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`sample_finetuning_supervised_job.ts`	TypeScript sample demonstrating finetuning workflow (currently shows file operations only)
`sft_training_set.jsonl`	Supervised fine-tuning training dataset with 10 examples
`sft_validation_set.jsonl`	Supervised fine-tuning validation dataset with 10 examples
`rft_training_set.jsonl`	Reward fine-tuning training dataset with 100 arithmetic problem examples
`rft_validation_set.jsonl`	Reward fine-tuning validation dataset with 50 arithmetic problem examples
`dpo_training_set.jsonl`	Direct preference optimization training dataset with 20 examples
`dpo_validation_set.jsonl`	Direct preference optimization validation dataset with 2 examples

sdk/ai/ai-projects/samples-dev/finetuning/sample_finetuning_supervised_job.ts

Copilot · 2025-11-24T07:21:58Z

sdk/ai/ai-projects/samples-dev/finetuning/sample_finetuning_supervised_job.ts

+
+const projectEndpoint = process.env["AZURE_AI_PROJECT_ENDPOINT"] || "<project endpoint string>";
+const __dirname = path.dirname(fileURLToPath(import.meta.url));
+const filePath = path.join(__dirname, "data", "training_set.jsonl");


The variable filePath references "data", "training_set.jsonl" but the actual data file in this PR is named sft_training_set.jsonl (for supervised fine-tuning). The path should be corrected to path.join(__dirname, "data", "sft_training_set.jsonl") to match the actual file structure.

Suggested change

const filePath = path.join(__dirname, "data", "training_set.jsonl");

const filePath = path.join(__dirname, "data", "sft_training_set.jsonl");

sdk/ai/ai-projects/samples-dev/finetuning/sample_finetuning_reinforcement_job.ts

sdk/ai/ai-projects/samples-dev/finetuning/sample_finetuning_supervised_job.ts

Add finetuning samples

fef8f2b

Copilot AI review requested due to automatic review settings November 24, 2025 07:11

mohammadsheraj requested review from bobogogo1990, dargilco, ganeshyb and glharper as code owners November 24, 2025 07:11

Copilot started reviewing on behalf of mohammadsheraj November 24, 2025 07:12 View session

Copilot finished reviewing on behalf of mohammadsheraj November 24, 2025 07:13

Copilot AI reviewed Nov 24, 2025

View reviewed changes

mohammadsheraj added 5 commits November 25, 2025 10:11

Add SFT tasks

362f447

Add more functionality

f65f617

fix

dec299d

Add all finetuning jobs.

8b31845

Update the match

d7a6a4d

mohammadsheraj changed the title ~~Add finetuning samples~~ [WIP] Add finetuning samples Nov 25, 2025

mohammadsheraj added 6 commits November 25, 2025 23:21

fix code

9dd1d9d

add listEvents

0e587ff

fix

673e8f7

Add checkpoints

5facf7e

complete code

bcecd46

update dependencies

1a80811

mohammadsheraj requested review from a team and jeremymeng as code owners November 26, 2025 06:30

Improve code

2bc8c5f

bobogogo1990 reviewed Nov 26, 2025

View reviewed changes

sdk/ai/ai-projects/samples-dev/finetuning/sample_finetuning_reinforcement_job.ts Outdated Show resolved Hide resolved

bobogogo1990 reviewed Nov 26, 2025

View reviewed changes

sdk/ai/ai-projects/samples-dev/finetuning/sample_finetuning_supervised_job.ts Outdated Show resolved Hide resolved

mohammadsheraj added 4 commits November 27, 2025 13:05

Add all JS FT samples

2e04862

Remove wait for event

4fc81c6

Remove any

a7369c6

Removed any

cf5afb9

mohammadsheraj added 3 commits November 27, 2025 13:54

Removed all any

52d8cb9

Add full change

3beab99

Fix error

0143349

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Add finetuning samples #36723

[WIP] Add finetuning samples #36723

Uh oh!

mohammadsheraj commented Nov 24, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Nov 24, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	const filePath = path.join(__dirname, "data", "training_set.jsonl");
	const filePath = path.join(__dirname, "data", "sft_training_set.jsonl");

[WIP] Add finetuning samples #36723

Are you sure you want to change the base?

[WIP] Add finetuning samples #36723

Uh oh!

Conversation

mohammadsheraj commented Nov 24, 2025

Packages impacted by this PR

Issues associated with this PR

Describe the problem that is addressed by this PR

What are the possible designs available to address the problem? If there are more than one possible design, why was the one in this PR chosen?

Are there test cases added in this PR? (If not, why?)

Provide a list of related PRs (if any)

Command used to generate this PR:**(Applicable only to SDK release request PRs)

Checklists

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Command used to generate this PR:***(Applicable only to SDK release request PRs)*