How to profile the SYCL

Hi, when I finish a SYCL application, I don’t know how to profile and tune it. Could you provide some guide or tool for it? thanks.

Hi @alanzhai219,
GPU profilers generally need to talk closely to the device drivers and be able to control low-level events/metrics, so you should use the profiling tools specific to the device vendor. That would be VTune for Intel GPUs, Nsight Systems / Nsight Compute for NVIDIA GPUs, and rocprof for AMD GPUs.

You can see our guides for profiling SYCL on NVIDIA and AMD targets here:

In addition, Intel has documentation on using VTune here: Get Started with Intel® VTune™ Profiler