Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
mnist-pvc.yaml		mnist-pvc.yaml

README.md

Create a InferenceService for on-prem cluster

The guide shows how to train model and create InferenceService for the trained model for on-prem cluster.

Prerequisites

Refer to the document to create Persistent Volume (PV) and Persistent Volume Claim (PVC), the PVC will be used to store model.

Training model

Follow the mnist example guide to train a mnist model and store it to PVC. The InferenceService is deployed in the notebook example by Kubeflow Fairing that uses kfserving SDK. If you want to apply the InferenceService via kubectl by using the YAML format as below, no need to run the deployment step in the notebook. In this example, the relative path of model will be ./export/ on the PVC.

Create the InferenceService

Update the ${PVC_NAME} to the created PVC name in the mnist-pvc.yaml and apply:

kubectl apply -f mnist-pvc.yaml

Expected Output

$ inferenceservice.serving.kubeflow.org/mnist-sample configured

Check the InferenceService

$ kubectl get inferenceservice
NAME           URL                                                               READY     DEFAULT TRAFFIC   CANARY TRAFFIC   AGE
mnist-sample   http://mnist-sample.kubeflow.example.com/v1/models/mnist-sample   True      100                                1m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pvc

pvc

README.md

Create a InferenceService for on-prem cluster

Prerequisites

Training model

Create the InferenceService

Check the InferenceService

Files

pvc

Directory actions

More options

Directory actions

More options

Latest commit

History

pvc

Folders and files

parent directory

README.md

Create a InferenceService for on-prem cluster

Prerequisites

Training model

Create the InferenceService

Check the InferenceService