Setting up the Experimental Pipeline

Create a new resource group

Follow instructions on creating a new Resource Group in Azure.

Further instructions on managing the Access Control (IAM) can be found on Assign Azure roles using the Azure portal .

Create a new workspace

Follow instructions on creating a new ML Workspace in Azure.

Install Azure CLI (v2)

Follow instructions to setup Azure CLI.

Install Azure ML extension in VS Code

Follow instructions to set up Visual Studio Code desktop with the Azure Machine Learning extension.

Setting up requirements

The provided shell script creates the necessary prerequisites, a Registry, a sample Environment, and a Compute Cluster.

export RESOURCE_GROUP=<your-resource-group-name>
export WORKSPACE_NAME=<your-workspace-name>
bash setup/setup.sh

The created Registry is named after your workspace, <your-workspace-name>-registry, the Environment is named sample-environment, and the Compute Cluster cpu-cluster.

Note that this script just creates a starter Environment and you would need to customize the sample-conda.yml as per your project needs. I recommend creating one Environment for development and a separate one for Production purposes. You would find them under mlops and deploy directories.

Similarly, the Compute Cluster must be adapted based on your hardware requirements. For this, modifications to the cpu-cluster.yml file may be necessary.

Why create a Registry?

Registry facilitates sharing components, environments, models, and data assets with collaborators within your organization. You can also see this as a way to track your staging or production assets. Additionally, you can control who has access to them!

The excerpt below from Azure Documentation further expands on the utility.

There are two scenarios where you'd want to use the same set of models, components and environments in multiple workspaces:

Cross-workspace MLOps: You're training a model in a dev workspace and need to deploy it to test and prod workspaces. In this case you, want to have end-to-end lineage between endpoints to which the model is deployed in test or prod workspaces and the training job, metrics, code, data and environment that was used to train the model in the dev workspace.
Share and reuse models and pipelines across different teams: Sharing and reuse improve collaboration and productivity. In this scenario, you may want to publish a trained model and the associated components and environments used to train it to a central catalog. From there, colleagues from other teams can search and reuse the assets you shared in their own experiments.

Additional documentation can be found here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup_README.md

Setup_README.md

Setting up the Experimental Pipeline

Create a new resource group

Create a new workspace

Install Azure CLI (v2)

Install Azure ML extension in VS Code

Setting up requirements

Why create a Registry?

Files

Setup_README.md

Latest commit

History

Setup_README.md

File metadata and controls

Setting up the Experimental Pipeline

Create a new resource group

Create a new workspace

Install Azure CLI (v2)

Install Azure ML extension in VS Code

Setting up requirements

Why create a Registry?