I would like to know and understand how can one declare half-precision buffers and pointers in SYCL namely in the following ways -
- Via the buffer class.
- Using malloc_device() function.
Also, suppose I have an existing fp32 matrix/array on the host side. How can I copy its contents to fp16 memory on the GPU side?