-
Notifications
You must be signed in to change notification settings - Fork 2.1k
[PROTON-DEV] Add realtime metric #7334
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: proton-dev
Are you sure you want to change the base?
Conversation
I don't understand how the i64 return value fits here as we should truncate it to a smaller bitwidth. |
Since 9367791, the default clock size has been changed to 64 bits with some of the higher bits used for other purposes. triton/third_party/proton/dialect/lib/ProtonGPUToLLVM/Utility.cpp Lines 96 to 122 in 9faa7cd
I suppose |
I still don't get how "cycle" is different from "time" in this case. |
Since we use a global timer to align CTAs across SMs, so we only need 1 global timestamp per CTA to capture the starting time. The right way to do this is to add this 64-bit global timestamp in the metadata section of each CTA (we store |
In addition to the existing CYCLE metric, add a new metric,
REALTIME
, which is a global clock synchronized across SMs. It is guaranteed to count in nanoseconds, regardless of the GPU clock frequency.This is useful for constructing a global timeline for
triton/third_party/proton/common/include/TraceDataIO/TraceWriter.h
Lines 15 to 17 in 9faa7cd