Skip to content

v0.2

Compare
Choose a tag to compare
@aoyulong aoyulong released this 30 Nov 09:27
· 346 commits to main since this release
  • Provide the actually used training scheme for Aquila2-70B-Expr, including the parallel strategies, optimizations and hyper-parameter settings.
  • Support heterogeneous training on chips of different generations with the same architecture or compatible architectures, including NVIDIA GPUs and Iluvatar CoreX chips.
  • Support training on chinese domestic hardwares, including Iluvatar CoreX and Baidu KUNLUN chips.