Fixes dpctl.tensor.round on CUDA devices
#1700
Merged
dpctl.tensor.round on CUDA devices
#1700