This should help slightly with CPU/GPU parallelism when vsync is on. I also return the time spent on vsync, which can help users figure out how much CPU wall-time was used on non-vsync stuff