Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
19 commits
Select commit Hold shift + click to select a range
892348d
Update onnxruntime main 8fe48049c089ec50f6f915ca97c37969b7194a96 (#4441)
github-actions[bot] Nov 17, 2025
78f96b1
Add right pad masking mode (#4392)
TedThemistokleous Nov 17, 2025
0d81fd4
[AIMIGRAPHX-273] Fix Jenkins Parallel Stage Displaying Incorrect Time…
eddieliao Nov 18, 2025
d80331a
Refactor GroupQueryAttention (#4396)
turneram Nov 19, 2025
66b6e36
Flash decoding round 1; AIMIGRAPHX-242 (#4393)
bdevorem Nov 19, 2025
8254ea9
Show attributes in onnx trace (#4445)
pfultz2 Nov 19, 2025
ce3870f
change opset version for resnet50 example (#4446)
aarushjain29 Nov 19, 2025
71d23dd
[AIMIGRAPHX-326] Fix "reduce_sum: axes: value out of range" error in …
pfultz2 Nov 19, 2025
72ed531
disable matching for dynamic shapes (#4362)
shivadbhavsar Nov 20, 2025
074e4d8
Add initial unit tests for jit gpu kernels (#4278)
pfultz2 Nov 20, 2025
4444596
[Ex CI] Enable PipelineRepoRef parameter (#4455)
amd-hsivasun Nov 20, 2025
1235768
clamping the scale (#4410)
aarushjain29 Nov 20, 2025
bdbf814
quieten the verbose console output of compiler warnings and errors, i…
lakhinderwalia Nov 20, 2025
f9effd4
Use local variable in place of DOCKER_OPTS (#4454)
eddieliao Nov 21, 2025
52785bb
Bump rocm-docs-core from 1.29.0 to 1.30.0 in /docs/sphinx (#4449)
dependabot[bot] Nov 21, 2025
8a1e49f
Algo min element if tests (#4447)
pfultz2 Nov 21, 2025
d1a1b43
Docs: Change images to .png to resolve lint warnings (#4457)
harkgill-amd Nov 22, 2025
80fa637
Improve layout propagation in poinwise fusion when using broadcasted …
pfultz2 Nov 22, 2025
26f7e37
Update onnxruntime main 4665804592d1ae341094b93c0717b1c1c6656559 (#4458)
github-actions[bot] Nov 22, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
14 changes: 10 additions & 4 deletions .azuredevops/rocm-ci.yml
Original file line number Diff line number Diff line change
@@ -1,13 +1,19 @@
variables:
- group: common
- template: /.azuredevops/variables-global.yml@pipelines_repo

parameters:
- name: pipelinesRepoRef
type: string
default: refs/heads/develop

resources:
repositories:
- repository: pipelines_repo
type: github
endpoint: ROCm
name: ROCm/ROCm

variables:
- group: common
- template: /.azuredevops/variables-global.yml@pipelines_repo
ref: ${{ parameters.pipelinesRepoRef }}

trigger:
batch: true
Expand Down
2 changes: 2 additions & 0 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@
cmake_minimum_required(VERSION 3.15 FATAL_ERROR)

if("${CMAKE_SOURCE_DIR}" STREQUAL "${CMAKE_BINARY_DIR}")
message(FATAL_ERROR "The binary and source directroy cannot be the same")

Check warning on line 27 in CMakeLists.txt

View workflow job for this annotation

GitHub Actions / misspell

[misspell] CMakeLists.txt#L27

"directroy" is a misspelling of "directory"
Raw output
./CMakeLists.txt:27:47: "directroy" is a misspelling of "directory"
endif()

# Setup valid strings for build type
Expand Down Expand Up @@ -300,6 +300,8 @@
# Disable because of too many FPs
arithOperationsOnVoidPointer
definePrefix:*test/include/test.hpp
definePrefix:*src/targets/gpu/kernels/include/migraphx/kernels/test.hpp
UseNamedLogicOperator:*src/targets/gpu/kernels/include/migraphx/kernels/debug.hpp
ctuOneDefinitionRuleViolation:*test/*
useSmartPointer:*src/api/api.cpp
useSmartPointer:*make_shared_array.hpp
Expand Down
572 changes: 390 additions & 182 deletions Jenkinsfile

Large diffs are not rendered by default.

File renamed without changes
File renamed without changes
6 changes: 3 additions & 3 deletions docs/dev/tools.rst
Original file line number Diff line number Diff line change
Expand Up @@ -55,11 +55,11 @@ Provides additional debug information related to data. Use for debugging purpose

Example output:

.. image:: ../data/roctx1.jpg
.. image:: ../data/roctx1.png

Hotspot kerel timing information:
Hotspot kernel timing information:

.. image:: ../data/roctx2.jpg
.. image:: ../data/roctx2.png

The output provides ``SUM``, ``MIN``, ``MAX`` and ``COUNT`` information for each kernel executed for a given model. It also
provides the average total time. The following three files are provided for reference:
Expand Down
8 changes: 8 additions & 0 deletions docs/reference/MIGraphX-dev-env-vars.rst
Original file line number Diff line number Diff line change
Expand Up @@ -160,6 +160,14 @@ Model performance tunable variables change the compilation behavior of a model.

| Default: Split-k performance configurations are turned off.

* - | ``MIGRAPHX_FLASH_DECODING_NUM_SPLITS``
| Turns on flash decoding for attention fusion and sets the number of splits along the key-value sequence dimension.

- | ``0``: Flash decoding is turned off (i.e., number of splits is 0).
| ``N`` (where N > 1): Enables flash decoding with N splits along the key-value sequence dimension. For example, ``2`` enables flash decoding with 2 splits, ``4`` with 4 splits, etc.

| Default: flash decoding is turned off.

* - | ``MIGRAPHX_DISABLE_FP16_INSTANCENORM_CONVERT``
| When set, FP16 is not converted to FP32 in the ``InstanceNormalization`` ONNX operator.

Expand Down
2 changes: 1 addition & 1 deletion docs/sphinx/requirements.in
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
rocm-docs-core==1.29.0
rocm-docs-core==1.30.0
sphinx-collapse
2 changes: 1 addition & 1 deletion docs/sphinx/requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -211,7 +211,7 @@ requests==2.32.4
# via
# pygithub
# sphinx
rocm-docs-core==1.29.0
rocm-docs-core==1.30.0
# via -r requirements.in
rpds-py==0.22.3
# via
Expand Down
4 changes: 2 additions & 2 deletions examples/onnxruntime/resnet50/resnet50.py
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
#####################################################################################
# The MIT License (MIT)
#
# Copyright (c) 2015-2024 Advanced Micro Devices, Inc. All rights reserved.
# Copyright (c) 2015-2025 Advanced Micro Devices, Inc. All rights reserved.
#
# Permission is hereby granted, free of charge, to any person obtaining a copy
# of this software and associated documentation files (the "Software"), to deal
Expand Down Expand Up @@ -219,7 +219,7 @@ def main():
"resnet50.onnx", # where to save the model (can be a file or file-like object)
export_params=
True, # store the trained parameter weights inside the model file
opset_version=12, # the ONNX version to export the model to
opset_version=18, # the ONNX version to export the model to
do_constant_folding=
True, # whether to execute constant folding for optimization
input_names=['input'], # the model's input names
Expand Down
3 changes: 2 additions & 1 deletion src/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -190,6 +190,7 @@ register_migraphx_ops(
ceil
clip
concat
concat_past_present
contiguous
convert
convolution
Expand All @@ -212,8 +213,8 @@ register_migraphx_ops(
gather
gathernd
get_tuple_elem
gqa_rotary_embedding
greater
group_query_attention
group
gru
identity
Expand Down
Loading
Loading