-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update CHANGELOG.md for 6.4 #3847
Open
causten
wants to merge
9
commits into
release/rocm-rel-6.4
Choose a base branch
from
changelog_updates
base: release/rocm-rel-6.4
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+61
−1
Open
Changes from all commits
Commits
Show all changes
9 commits
Select commit
Hold shift + click to select a range
5096dd2
Update CHANGELOG.md for 6.4
causten d4cd2d7
Update CHANGELOG.md
causten d07e9fd
Update CHANGELOG.md
causten 82158a0
Update CHANGELOG.md
causten c17ef65
Update CHANGELOG.md
causten 3215985
Update CHANGELOG.md
causten f0403d2
Update CHANGELOG.md
causten d610be1
updates after doc review
causten 9325bd9
additional review updates
causten File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
|
@@ -3,6 +3,66 @@ | |||||
Full documentation for MIGraphX is available at | ||||||
[https://rocmdocs.amd.com/projects/AMDMIGraphX/en/latest/](https://rocmdocs.amd.com/projects/AMDMIGraphX/en/latest/). | ||||||
|
||||||
## MIGraphX 2.12 for ROCm 6.4.0 | ||||||
|
||||||
### Added | ||||||
|
||||||
* Support for gfx1200 and gfx1201 | ||||||
* hipBLASLt support for contiguous transpose GEMM fusion and GEMM pointwise fusions for improved performance | ||||||
* Support for hardware specific FP8 datatypes (FP8 OCP and FP8 FNUZ) | ||||||
* Add support for the BF16 datatype | ||||||
* ONNX Operator Support for `com.microsoft.MultiHeadAttention`, `com.microsoft.NhwcConv`, and `com.microsoft.MatMulIntgerFloat` | ||||||
* migraphx-driver can now produce outfor for use with Netron | ||||||
* migraphx-driver now includes a `time` parameter (similar to `perf`) that is more accurate for very fast kernels | ||||||
* An end-to-end Stable Diffusion 3 example with option to disable T5 encoder on VRAM-limited GPUs has been added | ||||||
* Added support to track broadcast axes in `shape_transform_descriptor` | ||||||
* Added support for unsigned types with `rocMLIR` | ||||||
* Added a script to convert mxr files to ONNX models | ||||||
* Added the `MIGRAPHX_SET_GEMM_PROVIDER` environment variable to choose between rocBLAS and hipBLASLt. Set `MIGRAPHX_SET_GEMM_PROVIDER` to `rocblas` to use rocBLAS, or to `hipblaslt` to use hipBLASLt. | ||||||
|
||||||
|
||||||
### Changed | ||||||
|
||||||
* With the exception of gfx90a, switched to using hipBLASLt instead of rocBLAS | ||||||
* Included the min/max/median of the `perf` run as part of the summary report | ||||||
* Enable non-packed inputs for `rocMLIR` | ||||||
* Always output a packed type for q/dq after determining non-packed tensors were inefficient | ||||||
* Even if using NHWC, MIGraphX will always convert group convolutions to NCHW for best performance | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||
* Renamed the `layout_nhwc` to `layout_convolution` and ensured that either the weights are the same layout as the inputs or set the input and weights to NHWC | ||||||
* Minimum version of Cmake is now 3.27 | ||||||
|
||||||
|
||||||
### Removed | ||||||
|
||||||
* Removed `fp8e5m2fnuz` rocBLAS support | ||||||
* `__AMDGCN_WAVEFRONT_SIZE` has been deprecated. | ||||||
* Removed a warning that printed to stdout when using FP8 types | ||||||
* Remove zero point parameter for dequantizelinear when its zero | ||||||
|
||||||
|
||||||
### Optimized | ||||||
|
||||||
* Prefill buffers when MLIR produces a multioutput buffer | ||||||
causten marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
* Improved the resize operator performance which should improve overall performance of models that use it | ||||||
* Allow the `reduce` operator to be split across an axis to improve fusion performance. The `MIGRAPHX_SPLIT_REDUCE_SIZE` environment variable has been added to allow the minimum size of the reduction to be adjusted for a possible model specific performance improvement | ||||||
* Added `MIGRAPHX_DISABLE_PASSES` environment variable for debugging | ||||||
* Added `MIGRAPHX_MLIR_DUMP` environment variable to be set to a folder where individual final rocMLIR modules can be saved for investigation | ||||||
* Improved the C++ API to allow onnxruntime access to fp8 quantization | ||||||
|
||||||
|
||||||
|
||||||
### Resolved Issues | ||||||
causten marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
|
||||||
* Fixed multistream execution with larger models (#3757) | ||||||
* Peephole LSTM Error (#3768) | ||||||
* Fixed BertSquad example that could include a broken tokenizers package (#3556) | ||||||
* Fixed Attention fusion ito not error with a shape mismatch when a trailing pointwise contains a literal (#3758) | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Is this a typo or is it something that needs to be in quotes/preformatted text? |
||||||
* Fixed instruction::replace() logic to handle more complex cases (#3574) | ||||||
* MatMulNBits could fail with a shape error (#3698) | ||||||
* Fixed a bug were some models could fail to compile with an error `flatten: Shapes are not in standard layout` (#3579) | ||||||
|
||||||
|
||||||
|
||||||
## MIGraphX 2.11 for ROCm 6.3.0 | ||||||
|
||||||
### Added | ||||||
|
@@ -18,7 +78,7 @@ Full documentation for MIGraphX is available at | |||||
* Split-K as an optional performance improvement | ||||||
* Scripts to validate ONNX models from the ONNX Model Zoo | ||||||
* GPU Pooling Kernel | ||||||
* --mlir flag to the migraphx-driver program to offload entire module to mlir | ||||||
* --mlir flag to the migraphx-driver program to offload entire module to rocMLIR | ||||||
* Fusing split-reduce with MLIR | ||||||
* Multiple outputs for the MLIR + Pointwise fusions | ||||||
* Pointwise fusions with MLIR across reshape operations | ||||||
|
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because below everything starts with "added", this either needs to start with "added" or nothing needs to start with "added". It's easier to add "added" to these three lines