-
Notifications
You must be signed in to change notification settings - Fork 769
[SYCL][Matrix] Add support for tf32 type using the unified interface #8151
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
LGTM. Intel backend also does not support tf32 for Accumulator right? btw I think this patch will also allow you to load joint_matrix from |
} | ||
for (int i = 0; i < get_wi_data(sg, sub_b).length(); i++) { | ||
get_wi_data(sg, sub_b)[i] = | ||
round_to_tf32(static_cast<float>(get_wi_data(sg, sub_b)[i])); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is this static_cast<float>
required?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No it is not needed I will remove it
// function will work on truncated floats. | ||
for (int i = 0; i < get_wi_data(sg, sub_a).length(); i++) { | ||
get_wi_data(sg, sub_a)[i] = | ||
round_to_tf32(get_wi_data(sg, sub_a)[i]); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add round_to_tf32 that takes a matrix as an input and returns a matrix as an output. This gives a flexibility to the implementation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The patch is LGTM
Please hold the merge for a bit to clarify SPIR-V instruction name (Convert Vs Round)
Replaced with the following: |
No description provided.