Commit cbbb3dc
committed
optimized: add BFloat16 and Half support to opt_log_softmax_out
opt_log_softmax_out only handled Float; BFloat16 and Half fell through to
ET_KERNEL_CHECK(false), leaving output unchanged. The underlying
log_softmax_kernel<IN_T, OUT_T> is fully generic and the ATen vectorized
functions it delegates to already support BFloat16 and Half.
- Extend log_softmax_wrapper with an if constexpr branch for BFloat16/Half
that calls log_softmax_kernel<T, T>
- Add BFloat16 and Half dispatch cases in opt_log_softmax_out1 parent 16ba018 commit cbbb3dc
1 file changed
Lines changed: 32 additions & 13 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
98 | 98 | | |
99 | 99 | | |
100 | 100 | | |
101 | | - | |
102 | | - | |
103 | | - | |
104 | | - | |
105 | | - | |
| 101 | + | |
| 102 | + | |
106 | 103 | | |
107 | | - | |
108 | | - | |
109 | | - | |
110 | | - | |
111 | | - | |
112 | | - | |
113 | | - | |
114 | | - | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
115 | 122 | | |
116 | 123 | | |
117 | 124 | | |
| |||
148 | 155 | | |
149 | 156 | | |
150 | 157 | | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
151 | 170 | | |
152 | 171 | | |
153 | 172 | | |
| |||
0 commit comments