I’m trying to build the samples from the sdk (git checkout with tag v1.1.4) with ComputeCpp-CE-1.1.4 when enabling the ptx64 backend for NVidia hardware. None of the sample code will successfully compile:
> cmake ../ -DComputeCpp_DIR=~/work/sycl/ComputeCpp-CE-1.1.4-CentOS-x86_64 -DCOMPUTECPP_BITCODE=ptx64
> make
parallel-for.cpp:(.text._ZN2cl4sycl7program25create_program_for_kernelI15assign_elementsEES1_NS0_7contextE[_ZN2cl4sycl7program25create_program_for_kernelI15assign_elementsEES1_NS0_7contextE]+0x29d): undefined reference to `cl::sycl::program::create_program_for_kernel_impl(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, unsigned char const*, int, char const* const*, std::shared_ptr<cl::sycl::detail::context>, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >)'
CMakeFiles/parallel-for.dir/parallel-for.cpp.o: In function `cl::sycl::kernel cl::sycl::program::get_kernel<assign_elements>() const':
parallel-for.cpp:(.text._ZNK2cl4sycl7program10get_kernelI15assign_elementsEENS0_6kernelEv[_ZNK2cl4sycl7program10get_kernelI15assign_elementsEENS0_6kernelEv]+0x59): undefined reference to `cl::sycl::program::get_kernel(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) const'
This is on a centos7 system with an NVidia GTX 1080Ti. I have the OpenCL libraries installed (both via the cuda sdk and from the ocl-icd-devel )
clinfo finds the hardware and reports it.
computecpp_info finds the hardware, but says (as expected) that it doesn’t support SPIR
Am I doing something wrong?
thanks, Charles.