• 5 min read
How I Won the Mandelbrot Speed Competition: Squeezing Every FLOP from the GPU
A deep dive into the optimization techniques that helped me achieve the fastest Mandelbrot set computation in my distributed computing class—leveraging FMA operations, problem symmetry, and memory coalescing to dominate the leaderboard.
#GPU Computing
#CUDA
#Julia
Read More