AMD interview question

How would you optimize matrix multiplication?