GPU architecture

GPU GPU or graphics processing unit, are type of processors that are used to perform large floating point instructions. It gained intial popularity for it’s use in video processing or gaming and more recently for training large language models. But why are GPUs so fast and are they always faster than CPUs? The answer lies in the architecture. A GPU is designed to perform single instruction over multiple data acronymed SIMD. A CPU usually contains Control Unit, Arithmatic Logic Unit, registers etc. More recently CPU have multiple cores, each cpu works like a mini cpu with it’s own CU and ALU. If a CPU has 4 cores, it can operate 4 instruction truly in parallel. GPU are different, they can 1000s of threads in parallel. ...

TPU? Google's answer to Nvidia

Tensor processing Units (TPU)