Skip to content

Releases: aws/aws-parallelcluster-cookbook

AWS ParallelCluster v3.11.1

21 Oct 16:53
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.11.1

This is associated with AWS ParallelCluster v3.11.1

CHANGES

  • Pyxis is now disabled by default, so it must be manually enabled as documented in the product documentation.
  • Upgrade libjwt to version 1.17.0.

BUG FIXES

  • Fix an issue in the way we configure the Pyxis Slurm plugin in ParallelCluster that can lead to job submission failures.
    aws/aws-parallelcluster#6459

AWS ParallelCluster v3.11.0

25 Sep 20:43
b00a0c6
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.11.0

This is associated with AWS ParallelCluster v3.11.0

ENHANCEMENTS

  • Allow custom actions on login nodes.
  • Allow DCV connection on login nodes.
  • Add new attribute efs_access_point_ids to specify optional EFS access points for the mounts
  • Install enroot and pyxis in official pcluster AMIs

CHANGES

  • Upgrade Slurm to 23.11.10 (from 23.11.7).
  • Upgrade Pmix to 5.0.3 (from 5.0.2).
  • Upgrade EFA installer to 1.34.0.
    • Efa-driver: efa-2.10.0-1
    • Efa-config: efa-config-1.17-1
    • Efa-profile: efa-profile-1.7-1
    • Libfabric-aws: libfabric-aws-1.22.0-1
    • Rdma-core: rdma-core-52.0-1
    • Open MPI: openmpi40-aws-4.1.6-3 and openmpi50-aws-5.0.3-11
  • Upgrade NVIDIA driver to version 550.90.07 (from 535.183.01).
  • Upgrade CUDA Toolkit to version 12.4.1 (from 12.2.2).
  • Upgrade Python to 3.9.20 (from 3.9.19).
  • Upgrade Intel MPI Library to 2021.13.1.769 (from 2021.12.1.8).

BUG FIXES

  • Fix EFA kmod installation with RHEL 8.10 or newer.

AWS ParallelCluster v3.10.1

08 Jul 20:04
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.10.1

This is associated with AWS ParallelCluster v3.10.1

CHANGES

  • There were no changes for this version.

AWS ParallelCluster v3.10.0

27 Jun 21:42
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.10.0

This is associated with AWS ParallelCluster v3.10.0

ENHANCEMENTS

  • Add support for external Slurmdbd.
  • Allow build-image to be run in an isolated network.
  • Add support for Amazon Linux 2023.

CHANGES

  • Upgrade Cinc Client to version to 18.4.12 from 18.2.7.
  • Upgrade munge to version 0.5.16 (from 0.5.15).
  • Upgrade Pmix to 5.0.2 (from 4.2.9).
  • Upgrade third-party cookbook dependencies:
    • apt-7.5.22 (from apt-7.5.14)
    • openssh-2.11.12 (from openssh-2.11.3)
  • Remove third-party cookbook: selinux-6.1.12.
  • Upgrade EFA installer to 1.32.0.
    • Efa-driver: efa-2.8.0-1
    • Efa-config: efa-config-1.16-1
    • Efa-profile: efa-profile-1.7-1
    • Libfabric-aws: libfabric-aws-1.21.0-1
    • Rdma-core: rdma-core-50.0-1
    • Open MPI: openmpi40-aws-4.1.6-3 and openmpi50-aws-5.0.2-12
  • Upgrade NVIDIA driver to version 535.183.01 (from 535.154.05).
  • Upgrade Python to 3.9.19 (from 3.9.17).
  • Upgrade Intel MPI Library to 2021.12.1.8 (from 2021.9.0.43482).

BUG FIXES

  • Fix an issue that prevented cluster updates from including EFS filesystems with encryption in transit.
  • Fix an issue that prevented slurmctld and slurmdbd services from restarting on head node reboot when
    EFS is used for shared internal data.
  • On Ubuntu systems, remove default logrotate configuration for cloud-init log files that clashed with the
    configuration coming from Parallelcluster.
  • Removing /etc/profile.d/pcluster.sh so that it's not executed at every user login and
    cfn_bootstrap_virtualenv is not added in PATH environment variable.
  • Fix image build failure with RHEL 8.10 or newer.

AWS ParallelCluster v3.9.3

19 Jun 12:19
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.9.3

This is associated with AWS ParallelCluster v3.9.3

ENHANCEMENTS

  • Add support for FSx Lustre as a shared storage type in us-iso-east-1.

BUG FIXES

  • Remove cloud_dns from the SlurmctldParameters in the Slurm config to avoid Slurm fanout issues.
    This is also not required since we set the IP addresses on instance launch.

AWS ParallelCluster v3.9.2

28 May 19:20
8eb077b
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.9.2

This is associated with AWS ParallelCluster v3.9.2

CHANGES

  • Upgrade Slurm to 23.11.7 (from 23.11.4).

AWS ParallelCluster v3.9.1

11 Apr 10:42
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.9.1

This is associated with AWS ParallelCluster v3.9.1

BUG FIXES

  • Remove recursive deletion of shared storage mountdir when unmounting filesystems as part of update-cluster operation.

AWS ParallelCluster v3.9.0

12 Mar 01:28
9506af6
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.9.0

This is associated with AWS ParallelCluster v3.9.0

ENHANCEMENTS

  • Permit to update the external shared storage of type Efs, FsxLustre, FsxOntap, FsxOpenZfs and FileCache
    without replacing compute and login fleet.
  • Add support for RHEL9.
  • Add support for Rocky Linux 9 as CustomAmi created through build-image process. No public official ParallelCluster Rocky9 Linux AMI is made available at this time.
  • Add the configuration parameter DeploymentSettings/DefaultUserHome to allow users to move the default user's home directory to /local/home instead of /home (default).
    • SSH connections will be closed and rejected while the user's home directory is being moved during the bootstrapping process.
  • Add possibility to choose between Open and Closed Source Nvidia Drivers when building an AMI, through the ['cluster']['nvidia']['kernel_open'] cookbook node attribute.
  • Add configuration parameter DeploymentSettings/DisableSudoAccessForDefaultUser to disable sudo access of default user in supported OSes.

CHANGES

  • Upgrade Slurm to 23.11.4 (from 23.02.7).
    • Upgrade Pmix to 4.2.9 (from 4.2.6).
  • Upgrade NVIDIA driver to version 535.154.05.
  • Upgrade EFA installer to 1.30.0.
    • Efa-driver: efa-2.6.0-1
    • Efa-config: efa-config-1.15-1
    • Efa-profile: efa-profile-1.6-1
    • Libfabric-aws: libfabric-aws-1.19.0
    • Rdma-core: rdma-core-46.0-1
    • Open MPI: openmpi40-aws-4.1.6-2 and openmpi50-aws-5.0.0-11
  • Upgrade NICE DCV to version 2023.1-16388.
    • server: 2023.1.16388-1
    • xdcv: 2023.1.565-1
    • gl: 2023.1.1047-1
    • web_viewer: 2023.1.16388-1
  • Upgrade ARM PL to version 23.10.
  • Upgrade third-party cookbook dependencies:
    • nfs-5.1.2 (from nfs-5.0.0)

BUG FIXES

  • Fix issue making job fail when submitted as active directory user from login nodes.
    The issue was caused by an incomplete configuration of the integration with the external Active Directory on the head node.
  • Fix issue making login nodes fail to bootstrap when the head node takes more time than expected in writing keys.

AWS ParallelCluster v3.8.0

19 Dec 17:40
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.8.0

This is associated with AWS ParallelCluster v3.8.0

ENHANCEMENTS

  • Add support for EC2 Capacity Blocks for ML.
  • Add support for Rocky Linux 8.
  • Add support for Scheduling/SlurmSettings/Database/DatabaseName parameter to render StorageLoc
    in the slurmdbd configuration generated by ParallelCluster.
  • Add the option to use EFS storage instead of NFS exports from the head node root volume
    for intra-cluster shared file system resources: ParallelCluster, Intel, Slurm, and /home data.
  • Allow for mounting home as an EFS or FSx external shared storage via the SharedStorage section of the config file.

CHANGES

  • Upgrade Slurm to 23.02.7 (from 23.02.6).
  • Upgrade NVIDIA driver to version 535.129.03.
  • Upgrade CUDA Toolkit to version 12.2.2.
  • Use Open Source NVIDIA GPU drivers (OpenRM) as NVIDIA kernel module for Linux instead of NVIDIA closed source module.
  • Do not wait for static nodes in maintenance to signal CFN that the head node initialization is complete.
  • Upgrade EFA installer to 1.29.1.
    • Efa-driver: efa-2.6.0-1
    • Efa-config: efa-config-1.15-1
    • Efa-profile: efa-profile-1.5-1
    • Libfabric-aws: libfabric-aws-1.19.0-1
    • Rdma-core: rdma-core-46.0-1
    • Open MPI: openmpi40-aws-4.1.6-1
  • Upgrade GDRCopy to version 2.4 in all supported OSes, except for Centos 7 where version 2.3.1 is used.
  • Upgrade aws-cfn-bootstrap to version 2.0-28.
  • Upgrade Python to 3.9.17.

BUG FIXES

  • Fix inconsistent scaling configuration after cluster update rollback when modifying the list of instance types declared in the Compute Resources.
  • Fix users SSH keys generation when switching users without root privilege in clusters integrated with an external LDAP server through cluster configuration files.
  • Fix disabling Slurm power save mode when setting ScaledownIdletime = -1.
  • Fix hard-coded path to Slurm installation dir in update_slurm_database_password.sh script for Slurm Accounting.

AWS ParallelCluster v3.7.2

13 Oct 19:37
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.7.2

This is associated with AWS ParallelCluster v3.7.2

CHANGES

  • Upgrade Slurm to 23.02.6.