In this post I will do my best to explain all my attempts towards optimizing the above algorithm. All of them were unsuccessful, so these two guys in NVIDIA (Louis Bavoil, Kevin Myers) did a pretty good job. However, I managed to get a few extra percent utilization in the CROP unit(from 32%-33% to 41,4%…
Category: Architecture
Why did I decide to use it? Well, it is not because it is very popular, but because it actually solved a problem (not the most difficult, but still a problem)! Always look for a solution to the problem, not for what is currently the most used architecture design. The problem was that through the…