In my setup computecpp-sdk/samples/matrix-multiply works fine for matSize <= 4096. As soon as I choose matSize >= 8192, the program causes the screen to blank for a few seconds and then throw an exception “[ComputeCpp:RT0201] Error while waiting for an event”
I am running computecpp v2.1.0 and CUDA/v11.0 on a GeForce GTX TITAN X with ptx64 enabled.
The behavior is not entirely reproducible. In some cases, the program appears to work correctly for matSize==8192 or fail in slightly different ways. Most cases, however, behave as described above.