support deepseek on xpu #10060

yulangz · 2025-03-10T09:16:56Z

Before submitting

Lint code. If there are lint issues, please format the code first.

# Install and register `pre-commit` in the project folder
pip install pre-commit && pre-commit install

# Process previous code files separately
pre-commit run --file XXXX.py

Add test cases into tests folder. If there are codecov issues, please add tests cases first.

PR types

New features

PR changes

Models | APIs

Description

Support deepseek on Kunlun XPU inference.
Support dynamic and static graphics.
Now, only MOE-ONLYINT8 quantization is supported (only MOE weights are quantized to int8, other weights are not quantized).

paddle-bot · 2025-03-10T09:17:01Z

Thanks for your contribution!

codecov · 2025-03-10T09:51:58Z

Codecov Report

Attention: Patch coverage is 0% with 171 lines in your changes missing coverage. Please review.

Project coverage is 50.15%. Comparing base (1db27cd) to head (5a9fc9c).
Report is 46 commits behind head on develop.

Files with missing lines	Patch %	Lines
...erimental/transformers/fused_transformer_layers.py	0.00%	101 Missing ⚠️
.../experimental/transformers/deepseek_v2/modeling.py	0.00%	68 Missing ⚠️
paddlenlp/experimental/transformers/proposers.py	0.00%	2 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop   #10060      +/-   ##
===========================================
- Coverage    50.39%   50.15%   -0.24%     
===========================================
  Files          756      757       +1     
  Lines       121658   122553     +895     
===========================================
+ Hits         61304    61469     +165     
- Misses       60354    61084     +730

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

hong19860320 · 2025-03-10T11:44:24Z

csrc/xpu/src/cmake_build.sh

 # export XDNN_PATH=Paddle/build/third_party/xpu/src/extern_xpu/xdnn-ubuntu_x86_64/ # <path_to_xdnn>
 # export XRE_PATH=Paddle/build/third_party/xpu/src/extern_xpu/xre-ubuntu_x86_64/  # <path_to_xre>
 # export CLANG_PATH=xtdk-ubuntu_1604_x86_64 # <path_to_xtdk>
 # export HOST_SYSROOT=/opt/compiler/gcc-8.2/bin/gcc # <path_to_gcc>

+export XDNN_PATH=/opt/output/work_dir/paddle-deepseek/xpu_libs/xhpc/xdnn


为啥用一些和模型类型相关的hardcode路径？

hong19860320 · 2025-03-10T11:48:48Z

csrc/xpu/src/get_padding_offset_v2.cc

+                                                             const std::vector<int64_t>& seq_len_shape,
+                                                             const std::vector<int64_t>& draft_tokens_shape,
+                                                             const std::vector<int64_t>& seq_lens_encoder_shape) {
+    // std::cout << "wht --- GetPaddingOffsetV2InferShape" << std::endl;


是不是可以去掉这些已经注释的调试代码？下同。

hong19860320 · 2025-03-10T11:49:30Z

csrc/xpu/src/mla_block_multihead_attention_xpu.cc

+                                              prefix_lens_vp, // prefix_lens_vp
+                                              encoder_kv_lods_vp); // encoder_kv_lods_vp
+
+    // encoder 关键信息打印


去掉已经注释了的调试代码

support deepseek on xpu

b4a587e

hong19860320 reviewed Mar 10, 2025

View reviewed changes

yulangz added 2 commits March 11, 2025 01:24

remove useless comment

305552f

fix

5a9fc9c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support deepseek on xpu #10060

support deepseek on xpu #10060

yulangz commented Mar 10, 2025

paddle-bot bot commented Mar 10, 2025

codecov bot commented Mar 10, 2025 •

edited

Loading

hong19860320 Mar 10, 2025

yulangz Mar 11, 2025

hong19860320 Mar 10, 2025

yulangz Mar 11, 2025

hong19860320 Mar 10, 2025

yulangz Mar 11, 2025

support deepseek on xpu #10060

Are you sure you want to change the base?

support deepseek on xpu #10060

Conversation

yulangz commented Mar 10, 2025

Before submitting

PR types

PR changes

Description

paddle-bot bot commented Mar 10, 2025

codecov bot commented Mar 10, 2025 • edited Loading

Codecov Report

hong19860320 Mar 10, 2025

Choose a reason for hiding this comment

yulangz Mar 11, 2025

Choose a reason for hiding this comment

hong19860320 Mar 10, 2025

Choose a reason for hiding this comment

yulangz Mar 11, 2025

Choose a reason for hiding this comment

hong19860320 Mar 10, 2025

Choose a reason for hiding this comment

yulangz Mar 11, 2025

Choose a reason for hiding this comment

codecov bot commented Mar 10, 2025 •

edited

Loading