- Floating Point Unit (FPU): FPU to ALU, Division to Multiplication, rsqrt
- Millions of Instructions Per Second (MIPS): Pre-processing
- Processor vectorization: Reduce branch instructions
- Memory bandwidth: Cache alignement, parallelization
A Fast, Compact Approximation of the Exponential Function