add rule for _C_ops.matmul by greenhandF · Pull Request #640 · PFCCLab/PaddleAPITest

greenhandF · 2026-06-09T02:25:40Z

add mapping and rule for paddle.C_ops.matmul

cangtianhuang

LGTM

cangtianhuang

修改后要统一复测一下原有配置，在pr描述里贴上测试结果，要看精度没有大问题～

cangtianhuang · 2026-06-11T06:05:53Z

                    or (
                        isinstance(paddle_item, paddle.Tensor)
-                        and not (paddle_item._is_initialized() or paddle_item.numel() == 0)
+                        and (not paddle_item._is_initialized() or paddle_item.numel() == 0)


这样会影响合法的 0size tensor 正常判断（0size tensor 的 numel 为 0，但是有数据类型与形状），是否能改成：

Suggested change

and (not paddle_item._is_initialized() or paddle_item.numel() == 0)

and not paddle_item._is_initialized()

cangtianhuang · 2026-06-11T06:08:59Z

                        or (
                            isinstance(paddle_item, paddle.Tensor)
-                            and not (paddle_item._is_initialized() or paddle_item.numel() == 0)
+                            and (not paddle_item._is_initialized() or paddle_item.numel() == 0)


cangtianhuang · 2026-06-11T06:25:55Z

+            do1_e  = torch.zeros_like(o1)
+            o2_s_e = torch.zeros_like(do2_s)


可能 empty_like 更贴切～但是都行

cangtianhuang · 2026-06-11T06:59:21Z

+            pg_out[s:e] = (do2_c * o2_val).sum(dim=-1)
+            # o2_s 写入需在 do1 之前完成时使用 do2_2d 切片；这里 o2_s 与 do1
+            # 来自不同 buffer（即使 inplace 也是 do2_s vs o1），互不影响。
+            o2_s_out[s:e] = (o2_val * prob_c).to(do2_dtype)
+            do1_out[s:e, :H] = x0g.to(o1_dtype)
+            do1_out[s:e, H:] = x1g.to(o1_dtype)


这处切片赋值会对 fp32 的叶子节点进行原地写入，torch 会报错，考虑加 torch.no_grad() 包裹

可以复现一下：
paddle._C_ops._run_custom_op("fused_swiglu_probs_bwd", Tensor([2, 4],"float32"), Tensor([2, 2],"float32"), Tensor([2, 1],"float32"), True, )

greenhandF · 2026-06-12T03:41:55Z

所有paddle_only能够通过的test精度比较都没有问题，共3个算子共23个case

cangtianhuang

LGTM

root added 3 commits June 9, 2026 10:24

add rule for _C_ops.matmul

d9184a3

add rele for _C_ops.swiglu_grad

ec3b5e2

remove useless code

1554750

cangtianhuang previously approved these changes Jun 9, 2026

View reviewed changes

add rules for custom op fused_swiglu_probs_bwd

f464f8b

greenhandF dismissed cangtianhuang’s stale review via f464f8b June 10, 2026 11:27

cangtianhuang requested changes Jun 11, 2026

View reviewed changes

cangtianhuang self-assigned this Jun 12, 2026

fix small bugs

74f18f9

cangtianhuang approved these changes Jun 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add rule for _C_ops.matmul#640

add rule for _C_ops.matmul#640
greenhandF wants to merge 5 commits into
PFCCLab:mainfrom
greenhandF:fanbohao

greenhandF commented Jun 9, 2026

Uh oh!

cangtianhuang left a comment

Uh oh!

cangtianhuang left a comment

Uh oh!

cangtianhuang Jun 11, 2026

Uh oh!

cangtianhuang Jun 11, 2026

Uh oh!

cangtianhuang Jun 11, 2026

Uh oh!

cangtianhuang Jun 11, 2026

Uh oh!

greenhandF commented Jun 12, 2026

Uh oh!

cangtianhuang left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	and (not paddle_item._is_initialized() or paddle_item.numel() == 0)
	and not paddle_item._is_initialized()

		do1_e = torch.zeros_like(o1)
		o2_s_e = torch.zeros_like(do2_s)

Conversation

greenhandF commented Jun 9, 2026

Uh oh!

cangtianhuang left a comment

Choose a reason for hiding this comment

Uh oh!

cangtianhuang left a comment

Choose a reason for hiding this comment

Uh oh!

cangtianhuang Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

cangtianhuang Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

cangtianhuang Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

cangtianhuang Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

greenhandF commented Jun 12, 2026

Uh oh!

cangtianhuang left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants