Skip to content

Conversation

@jayesh-tanna
Copy link
Member

@jayesh-tanna jayesh-tanna commented Oct 15, 2025

Description

Please add an informative description that covers that changes made by the pull request and link all relevant issues.

If an SDK is being regenerated based on a new API spec, a link to the pull request containing these API spec changes should be included above.

All SDK Contribution checklist:

  • The pull request does not introduce [breaking changes]
  • CHANGELOG is updated for new features, bug fixes or other significant changes.
  • I have read the contribution guidelines.

General Guidelines and Best Practices

  • Title of the pull request is clear and informative.
  • There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

  • Pull request includes test coverage for the included changes.

Copilot AI review requested due to automatic review settings October 15, 2025 15:30
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adds sample code demonstrating how to use fine-tuning jobs with the Azure AI Projects SDK. The samples cover three fine-tuning methods: supervised learning, reinforcement learning, and Direct Preference Optimization (DPO). Both synchronous and asynchronous implementations are provided for each method.

Key Changes:

  • Added six new Python sample files demonstrating fine-tuning job operations
  • Included sample training and validation datasets in JSONL format
  • Updated CHANGELOG to document the new samples

Reviewed Changes

Copilot reviewed 12 out of 13 changed files in this pull request and generated 7 comments.

Show a summary per file
File Description
sample_finetuning_supervised_job_async.py Async implementation for supervised fine-tuning with job creation, retrieval, listing, and cancellation
sample_finetuning_supervised_job.py Synchronous implementation for supervised fine-tuning operations
sample_finetuning_reinforcement_job_async.py Async implementation for reinforcement fine-tuning with pause/resume and checkpoint listing
sample_finetuning_reinforcement_job.py Synchronous implementation for reinforcement fine-tuning operations
sample_finetuning_dpo_job_async.py Async implementation for DPO fine-tuning job creation and retrieval
sample_finetuning_dpo_job.py Synchronous implementation for DPO fine-tuning operations
validation_set.jsonl Sample validation data with conversational prompts and responses
training_set.jsonl Sample training data with conversational prompts and responses
dpo_validation_set.jsonl DPO-specific validation data with preferred and non-preferred outputs
dpo_training_set.jsonl DPO-specific training data with preferred and non-preferred outputs
countdown_valid_50.jsonl Validation dataset for arithmetic problem-solving tasks
CHANGELOG.md Documents the addition of fine-tuning samples

@jayesh-tanna jayesh-tanna changed the title Adding samples for fine tuning job Adding samples for files and fine tuning job Oct 22, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant