Skip to content

Conversation

@paddle-bot
Copy link

paddle-bot bot commented Nov 14, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请检查PR提交格式和内容是否完备,具体请参考示例模版
Your PR has been submitted. Thanks for your contribution!
Please check its format and content. For this, you can refer to Template and Demo.

@luotao1 luotao1 self-assigned this Nov 14, 2025
@luotao1
Copy link
Collaborator

luotao1 commented Nov 14, 2025

@chang-wenbin

const int kernel_size, const int kernel_stride, const int topk,
paddle::Tensor& output, const bool is_prefill
) {
// 1. 根据 is_prefill 选择不同的计算策略

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这部分建议新增attentionbackend,按照目前FastDeploy的管理方式管理PD的attention

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

好的

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已更新,麻烦review下

这部分建议新增attentionbackend,按照目前FastDeploy的管理方式管理PD的attention

@luotao1 luotao1 merged commit daa357a into PaddlePaddle:master Nov 17, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants