Hi,
The hardware you are using is a Nvidia GPU so you would need to compile with the ptx64 flag in order for your code to be executed on the GPU. This was the same in 1.2.0 too so are you sure you were trying to execute your code on the same GPU, or perhaps you were just executing on the host device (CPU?).
There are some mising built-ins for our ptx implementation so some mathematical functions may fail at the moment.
Can you double check that you were actually offloading to the same GPU?