Skip to content

Commit a0c0e8d

Browse files
authored
remove flex gpu workloads (#2689)
* remove flex gpu workloads * update docs * remove flex references * remove unnecessary file
1 parent 9fbb0fa commit a0c0e8d

File tree

341 files changed

+3
-62356
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

341 files changed

+3
-62356
lines changed

README.md

+3-18
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@ and a list of models that are supported on Windows, see the
4444

4545
Instructions available to run on [Sapphire Rapids](https://www.intel.com/content/www/us/en/newsroom/opinion/updates-next-gen-data-center-platform-sapphire-rapids.html#gs.blowcx).
4646

47-
For best performance on Intel® Data Center GPU Flex and Max Series, please check the [list of supported workloads](#intel-data-center-gpu-workloads). It provides instructions to run inference and training using [Intel(R) Extension for PyTorch](https://github.com/intel/intel-extension-for-pytorch) or [Intel(R) Extension for TensorFlow](https://github.com/intel/intel-extension-for-tensorflow).
47+
For best performance on Intel® Data Center GPU Max Series, please check the [list of supported workloads](#intel-data-center-gpu-workloads). It provides instructions to run inference and training using [Intel(R) Extension for PyTorch](https://github.com/intel/intel-extension-for-pytorch) or [Intel(R) Extension for TensorFlow](https://github.com/intel/intel-extension-for-tensorflow).
4848

4949
### Image Recognition
5050

@@ -128,36 +128,21 @@ For best performance on Intel® Data Center GPU Flex and Max Series, please chec
128128
## Intel® Data Center GPU Workloads
129129
| Model | Framework | Mode | GPU Type | Model Documentation |
130130
| ----------------------------------| ---------- | ----------| -------- | ------------------- |
131-
| [ResNet 50v1.5](https://github.com/tensorflow/models/tree/v2.11.0/official/legacy/image_classification/resnet) | TensorFlow | Inference | Flex Series | [Float32 TF32 Float16 BFloat16 Int8](/models_v2/tensorflow/resnet50v1_5/inference/gpu/README.md) |
132131
| [ResNet 50 v1.5](https://github.com/tensorflow/models/tree/v2.11.0/official/legacy/image_classification/resnet) | TensorFlow | Training | Max Series | [BFloat16 FP32](/models_v2/tensorflow/resnet50v1_5/training/gpu/README.md) |
133-
| [ResNet 50 v1.5](https://arxiv.org/pdf/1512.03385.pdf) | PyTorch | Inference | Flex Series, Max Series, Arc Series |[Int8 FP32 FP16 TF32](/models_v2/pytorch/resnet50v1_5/inference/gpu/README.md) |
132+
| [ResNet 50 v1.5](https://arxiv.org/pdf/1512.03385.pdf) | PyTorch | Inference | Max Series, Arc Series |[Int8 FP32 FP16 TF32](/models_v2/pytorch/resnet50v1_5/inference/gpu/README.md) |
134133
| [ResNet 50 v1.5](https://arxiv.org/pdf/1512.03385.pdf) | PyTorch | Training | Max Series, Arc Series |[BFloat16 TF32 FP32](/models_v2/pytorch/resnet50v1_5/training/gpu/README.md) |
135-
| [DistilBERT](https://arxiv.org/pdf/1910.01108.pdf) | PyTorch | Inference | Flex Series, Max Series | [FP32 FP16 BF16 TF32](/models_v2/pytorch/distilbert/inference/gpu/README.md) |
136-
| [DLRM v1](https://arxiv.org/pdf/1906.00091.pdf) | PyTorch | Inference | Flex Series | [FP16 FP32](/models_v2/pytorch/dlrm/inference/gpu/README.md) |
134+
| [DistilBERT](https://arxiv.org/pdf/1910.01108.pdf) | PyTorch | Inference | Max Series | [FP32 FP16 BF16 TF32](/models_v2/pytorch/distilbert/inference/gpu/README.md) |
137135
| [SSD-MobileNet*](https://arxiv.org/pdf/1704.04861.pdf)| PyTorch | Inference | Arc Series| [INT8 FP16 FP32](/models_v2/pytorch/ssd-mobilenetv1/inference/gpu/README.md) |
138-
| [EfficientNet](https://arxiv.org/pdf/1905.11946.pdf) | PyTorch | Inference | Flex Series | [FP16 BF16 FP32](/models_v2/pytorch/efficientnet/inference/gpu/README.md) |
139-
| [EfficientNet](https://arxiv.org/pdf/1905.11946.pdf) | TensorFlow | Inference | Flex Series | [FP16](/models_v2/tensorflow/efficientnet/inference/gpu/README.md) |
140-
| [FBNet](https://arxiv.org/pdf/1812.03443.pdff) | PyTorch | Inference | Flex Series | [FP16 BF16 FP32](/models_v2/pytorch/fbnet/inference/gpu/README.md) |
141-
| [Wide Deep Large Dataset](https://arxiv.org/pdf/2112.10752.pdf) | TensorFlow | Inference | Flex Series | [FP16](/models_v2/tensorflow/wide_deep_large_ds/inference/gpu/README.md) |
142-
| [YOLO V5](https://arxiv.org/pdf/2108.11539.pdf) | PyTorch | Inference | Flex Series | [FP16](/models_v2/pytorch/yolov5/inference/gpu/README.md) |
143136
| [BERT large](https://arxiv.org/pdf/1810.04805.pdf) | PyTorch | Inference | Max Series, Arc Series | [BFloat16 FP32 FP16](/models_v2/pytorch/bert_large/inference/gpu/README.md) |
144137
| [BERT large](https://arxiv.org/pdf/1810.04805.pdf) | PyTorch | Training | Max Series, Arc Series | [BFloat16 FP32 TF32](/models_v2/pytorch/bert_large/training/gpu/README.md) |
145138
| [BERT large](https://arxiv.org/pdf/1810.04805.pdf) | TensorFlow | Training | Max Series | [BFloat16 TF32 FP32](/models_v2/tensorflow/bert_large/training/gpu/README.md) |
146139
| [DLRM v2](https://arxiv.org/abs/1906.00091) | PyTorch | Inference | Max Series | [FP32 BF16](/models_v2/pytorch/torchrec_dlrm/inference/gpu/README.md)
147140
| [DLRM v2](https://arxiv.org/abs/1906.00091) | PyTorch | Training | Max Series | [FP32 TF32 BF16](/models_v2/pytorch/torchrec_dlrm/training/gpu/README.md)
148141
| [3D-Unet](https://arxiv.org/pdf/1606.06650.pdf) | PyTorch | Inference | Max Series | [FP16 INT8 FP32](/models_v2/pytorch/3d_unet/inference/gpu/README.md) |
149142
| [3D-Unet](https://arxiv.org/pdf/1606.06650.pdf) | TensorFlow | Training | Max Series | [BFloat16 FP32](/models_v2/tensorflow/3d_unet/training/gpu/README.md) |
150-
| [Stable Diffusion](https://arxiv.org/pdf/2112.10752.pdf) | PyTorch | Inference | Flex Series, Max Series, Arc Series | [FP16 FP32](/models_v2/pytorch/stable_diffusion/inference/gpu/README.md) |
151-
| [Stable Diffusion](https://arxiv.org/pdf/2112.10752.pdf) | TensorFlow | Inference | Flex Series | [FP16 FP32](/models_v2/tensorflow/stable_diffusion/inference/gpu/README.md) |
152-
| [Mask R-CNN](https://arxiv.org/pdf/1703.06870.pdf) | TensorFlow | Inference | Flex Series | [FP32 Float16](/models_v2/tensorflow/maskrcnn/inference/gpu/README.md) |
153143
| [Mask R-CNN](https://arxiv.org/pdf/1703.06870.pdf) | TensorFlow | Training | Max Series | [FP32 BFloat16](/models_v2/tensorflow/maskrcnn/training/gpu/README.md) |
154-
| [Swin Transformer](https://arxiv.org/pdf/2103.14030.pdf) | PyTorch | Inference | Flex Series | [FP16](/models_v2/pytorch/swin-transformer/inference/gpu/README.md) |
155-
| [FastPitch](https://arxiv.org/pdf/1703.06870.pdf) | PyTorch | Inference | Flex Series | [FP16](/models_v2/pytorch/fastpitch/inference/gpu/README.md) |
156-
| [UNet++](https://arxiv.org/pdf/1807.10165.pdf) | PyTorch | Inference | Flex Series | [FP16](/models_v2/pytorch/unetpp/inference/gpu/README.md) |
157144
| [RNN-T](https://arxiv.org/abs/1211.3711) | PyTorch | Inference | Max Series | [FP16 BF16 FP32](/models_v2/pytorch/rnnt/inference/gpu/README.md) |
158145
| [RNN-T](https://arxiv.org/abs/1211.3711) | PyTorch | Training | Max Series | [FP32 BF16 TF32](/models_v2/pytorch/rnnt/training/gpu/README.md) |
159-
| [IFRNet](https://arxiv.org/pdf/2205.14620.pdf) | PyTorch | Inference | Flex Series | [FP16](/models_v2/pytorch/IFRNet/inference/gpu/README.md) |
160-
| [RIFE](https://arxiv.org/pdf/2011.06294.pdf) | PyTorch | Inference | Flex Series | [FP16](/models_v2/pytorch/RIFE/inference/gpu/README.md) |
161146

162147
## How to Contribute
163148
If you would like to add a new benchmarking script, please use [this guide](/CONTRIBUTING.md).

docker/pytorch/3d_unet/inference/gpu/pytorch-flex-series-3dunet-inference.Dockerfile

-43
This file was deleted.

docker/pytorch/3d_unet/inference/gpu/tests.yaml

-25
This file was deleted.

docker/pytorch/dlrm/inference/gpu/pytorch-flex-series-dlrm-v1-inference.Dockerfile

-43
This file was deleted.

docker/pytorch/dlrm/inference/gpu/tests.yaml

-19
This file was deleted.

docker/pytorch/docker-compose.yml

-107
Original file line numberDiff line numberDiff line change
@@ -299,55 +299,13 @@ services:
299299
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-recognition-pytorch-gpu-resnet50v1-5-training
300300
cap_drop:
301301
- NET_RAW
302-
efficientnet-inference-gpu:
303-
build:
304-
dockerfile: docker/pytorch/efficientnet/inference/gpu/pytorch-flex-series-efficientnet-inference.Dockerfile
305-
extends: stable_diffusion-inference-gpu
306-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-recognition-pytorch-flex-gpu-efficientnet-inference
307-
cap_drop:
308-
- NET_RAW
309-
yolov5-inference-gpu:
310-
build:
311-
dockerfile: docker/pytorch/yolov5/inference/gpu/pytorch-flex-series-yolov5-inference.Dockerfile
312-
extends: stable_diffusion-inference-gpu
313-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-object-detection-pytorch-flex-gpu-yolov5-inference
314-
cap_drop:
315-
- NET_RAW
316-
dlrm-inference-gpu:
317-
build:
318-
dockerfile: docker/pytorch/dlrm/inference/gpu/pytorch-flex-series-dlrm-v1-inference.Dockerfile
319-
extends: stable_diffusion-inference-gpu
320-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-recommendation-pytorch-flex-gpu-dlrm-v1-inference
321-
cap_drop:
322-
- NET_RAW
323302
distilbert-inference-gpu:
324303
build:
325304
dockerfile: docker/pytorch/distilbert/inference/gpu/pytorch-gpu-distilbert-inference.Dockerfile
326305
extends: stable_diffusion-inference-gpu
327306
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-language-modeling-pytorch-gpu-distilbert-inference
328307
cap_drop:
329308
- NET_RAW
330-
unetpp-inference-gpu:
331-
build:
332-
dockerfile: docker/pytorch/unetpp/inference/gpu/pytorch-flex-series-unetpp-inference.Dockerfile
333-
extends: stable_diffusion-inference-gpu
334-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-segmentation-pytorch-flex-gpu-unetpp-inference
335-
cap_drop:
336-
- NET_RAW
337-
fastpitch-inference-gpu:
338-
build:
339-
dockerfile: docker/pytorch/fastpitch/inference/gpu/pytorch-flex-series-fast-pitch-inference.Dockerfile
340-
extends: stable_diffusion-inference-gpu
341-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-speech-generation-pytorch-flex-gpu-fast-pitch-inference
342-
cap_drop:
343-
- NET_RAW
344-
swin-transformer-inference-gpu:
345-
build:
346-
dockerfile: docker/pytorch/swin-transformer/inference/gpu/pytorch-flex-series-swin-transformer-inference.Dockerfile
347-
extends: stable_diffusion-inference-gpu
348-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-recognition-pytorch-flex-gpu-swin-transformer-inference
349-
cap_drop:
350-
- NET_RAW
351309
bert_large-inference-gpu:
352310
build:
353311
dockerfile: docker/pytorch/bert_large/inference/gpu/pytorch-max-series-bert-large-inference.Dockerfile
@@ -376,68 +334,3 @@ services:
376334
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-speech-recognition-pytorch-max-gpu-rnnt-training
377335
cap_drop:
378336
- NET_RAW
379-
fbnet-inference-gpu:
380-
build:
381-
dockerfile: docker/pytorch/fbnet/inference/gpu/pytorch-flex-series-fbnet-inference.Dockerfile
382-
extends: stable_diffusion-inference-gpu
383-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-recognition-pytorch-flex-gpu-fbnet-inference
384-
cap_drop:
385-
- NET_RAW
386-
ifrnet-inference-gpu:
387-
build:
388-
dockerfile: docker/pytorch/ifrnet/inference/gpu/pytorch-flex-series-ifrnet-inference.Dockerfile
389-
extends: stable_diffusion-inference-gpu
390-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-interpolation-pytorch-flex-gpu-ifrnet-inference
391-
cap_drop:
392-
- NET_RAW
393-
rife-inference-gpu:
394-
build:
395-
dockerfile: docker/pytorch/rife/inference/gpu/pytorch-flex-series-rife-inference.Dockerfile
396-
extends: stable_diffusion-inference-gpu
397-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-interpolation-pytorch-flex-gpu-rife-inference
398-
cap_drop:
399-
- NET_RAW
400-
efficientnet-cuda-inference:
401-
build:
402-
context: ../../
403-
args:
404-
http_proxy: ${http_proxy}
405-
https_proxy: ${https_proxy}
406-
no_proxy: ""
407-
NO_PROXY: ""
408-
CUDA_BASE_IMAGE: ${CUDA_BASE_IMAGE:-nvcr.io/nvidia/pytorch}
409-
CUDA_BASE_TAG: ${CUDA_BASE_TAG:-24.01-py3}
410-
dockerfile: docker/pytorch/efficientnet/inference/gpu/pytorch-cuda-series-efficientnet-inference.Dockerfile
411-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-recognition-pytorch-cuda-gpu-efficientnet-inference
412-
pull_policy: always
413-
cap_drop:
414-
- NET_RAW
415-
yolov5-cuda-inference:
416-
build:
417-
dockerfile: docker/pytorch/yolov5/inference/gpu/pytorch-cuda-series-yolov5-inference.Dockerfile
418-
extends: efficientnet-cuda-inference
419-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-object-detection-pytorch-cuda-gpu-yolov5-inference
420-
pull_policy: always
421-
cap_drop:
422-
- NET_RAW
423-
fbnet-cuda-inference:
424-
build:
425-
dockerfile: docker/pytorch/fbnet/inference/gpu/pytorch-cuda-series-fbnet-inference.Dockerfile
426-
extends: efficientnet-cuda-inference
427-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-recognition-pytorch-cuda-gpu-fbnet-inference
428-
cap_drop:
429-
- NET_RAW
430-
ifrnet-cuda-inference:
431-
build:
432-
dockerfile: docker/pytorch/ifrnet/inference/gpu/pytorch-cuda-series-ifrnet-inference.Dockerfile
433-
extends: efficientnet-cuda-inference
434-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-interpolation-pytorch-cuda-gpu-ifrnet-inference
435-
cap_drop:
436-
- NET_RAW
437-
rife-cuda-inference:
438-
build:
439-
dockerfile: docker/pytorch/rife/inference/gpu/pytorch-cuda-series-rife-inference.Dockerfile
440-
extends: stable_diffusion-inference-gpu
441-
image: ${REGISTRY}/aiops/mlops-ci:b-${GITHUB_RUN_NUMBER:-0}-image-interpolation-pytorch-cuda-gpu-rife-inference
442-
cap_drop:
443-
- NET_RAW

docker/pytorch/efficientnet/inference/gpu/pytorch-cuda-series-efficientnet-inference.Dockerfile

-40
This file was deleted.

docker/pytorch/efficientnet/inference/gpu/pytorch-flex-series-efficientnet-inference.Dockerfile

-41
This file was deleted.

0 commit comments

Comments
 (0)