Commit d8705a2
committed
Use
When compiled for CUDA, `std::rint` would incorrectly round values halfway between two integers toward 0, rather than to the nearest even number as required per array API.
`sycl::rint` avoids such issues by not relying on the current rounding modesycl::rint rather than std::rint to resolve failed tests on Nvidia hardware1 parent 1de00cb commit d8705a2
File tree
1 file changed
+1
-1
lines changed- dpctl/tensor/libtensor/include/kernels/elementwise_functions
1 file changed
+1
-1
lines changedLines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
81 | 81 | | |
82 | 82 | | |
83 | 83 | | |
84 | | - | |
| 84 | + | |
85 | 85 | | |
86 | 86 | | |
87 | 87 | | |
| |||
0 commit comments