4 core / 32 threads per core GPU loosely based off modern NVIDIA and AMD architectures featuring a single issue warp scheduler and integration of warp divergence.
To see the final results. Run the simulation first. Then at the top bar, change the simulation to run for 1500us. Then press the "play" icon with "(T)".
Output waveform

