-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clean up for JOSS paper #19
Changes from 15 commits
131b03a
003557c
6ccdfc9
32e7929
8de863c
2f3a8d2
8d26b00
e1fd558
c69b64d
f1e6089
09cc77d
947fd6a
3b414bb
d7d1308
b226590
5e5484e
ac8f2ef
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,19 +1,18 @@ | ||
# Change log | ||
|
||
|
||
<!-- Template for documenting changes | ||
## jaxdecomp 0.0.1 | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Are we really releasing a working code with several new additions as I would personally prefer something like There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I don't have enough open source experience to know the implication of a minor vs patch version |
||
|
||
* Changes | ||
* Change 1 | ||
* Breaking Changes | ||
* Change 1 | ||
* Deprecations | ||
* Some things that are getting deprecated | ||
* Bugs | ||
* Bug 1 | ||
--> | ||
* New version compatible with JAX 0.4.30 | ||
* jaxDecomp now works in a multi-host environment | ||
* Added custom partitioning for FFTs | ||
* Added custom partitioning for halo exchange | ||
* Added custom partitioning for slice_pad and slice_unpad | ||
* Add example for multi-host FFTs in `examples/jaxdecomp_lpt.py` | ||
Comment on lines
+11
to
+12
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Careful with consistency Added or Add but not both. |
||
|
||
|
||
## jaxdecomp 0.0.1 | ||
## jaxdecomp 0.0.1rc2 | ||
* Changes | ||
* Added utility to run autotuning | ||
|
||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -4,7 +4,9 @@ cmake_minimum_required(VERSION 3.19...3.25) | |
set(CMAKE_CXX_STANDARD 17) | ||
set(CMAKE_CUDA_STANDARD 17) | ||
# Latest JAX v0.4.26 no longer supports cuda 11.8 | ||
set(NVHPC_CUDA_VERSION 12.2) | ||
# By default, build for CUDA 12.2, users can override this with -DNVHPC_CUDA_VERSION=11.8 | ||
set(NVHPC_CUDA_VERSION 12.2 CACHE STRING "CUDA version to build for" ) | ||
EiffL marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
# Build debug | ||
# set(CMAKE_BUILD_TYPE Debug) | ||
add_subdirectory(third_party/cuDecomp) | ||
|
@@ -15,8 +17,8 @@ option(CUDECOMP_BUILD_FORTRAN "Build Fortran bindings" OFF) | |
option(CUDECOMP_ENABLE_NVSHMEM "Enable NVSHMEM" OFF) | ||
option(CUDECOMP_BUILD_EXTRAS "Build benchmark, examples, and tests" OFF) | ||
|
||
set(CUDECOMP_CUDA_CC_LIST "70;80" CACHE STRING "List of CUDA compute capabilities to build cuDecomp for.") | ||
|
||
# 70: Volta, 80: Ampere, 89: RTX 4060 | ||
set(CUDECOMP_CUDA_CC_LIST "70;80;89" CACHE STRING "List of CUDA compute capabilities to build cuDecomp for.") | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Should we already prepare for H100 ? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. You are right. |
||
|
||
# Add pybind11 and cuDecomp subdirectories | ||
add_subdirectory(pybind11) | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -22,10 +22,13 @@ class haloDescriptor_t { | |
~haloDescriptor_t() = default; | ||
|
||
bool operator==(const haloDescriptor_t& other) const { | ||
return (double_precision == other.double_precision && halo_extents == other.halo_extents && | ||
halo_periods == other.halo_periods && axis == other.axis && config.gdims[0] == other.config.gdims[0] && | ||
config.gdims[1] == other.config.gdims[1] && config.gdims[2] == other.config.gdims[2] && | ||
config.pdims[0] == other.config.pdims[0] && config.pdims[1] == other.config.pdims[1]); | ||
return (double_precision == other.double_precision && halo_extents[0] == other.halo_extents[0] && | ||
halo_extents[1] == other.halo_extents[1] && halo_extents[2] == other.halo_extents[2] && | ||
halo_periods[0] == other.halo_periods[0] && halo_periods[1] == other.halo_periods[1] && | ||
halo_periods[2] == other.halo_periods[2] && axis == other.axis && | ||
config.gdims[0] == other.config.gdims[0] && config.gdims[1] == other.config.gdims[1] && | ||
config.gdims[2] == other.config.gdims[2] && config.pdims[0] == other.config.pdims[0] && | ||
config.pdims[1] == other.config.pdims[1]); | ||
Comment on lines
+25
to
+31
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Could you put one test per line for readability ? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The formatter does the magic here |
||
} | ||
}; | ||
|
||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No need to get rid of the template, it can guide a future maintainer in the process.