Cray T3E (cont)
T3E has an atomic fetch-and-increment that works on a specialized register set
Latency hiding via a prefetch queue
- Schedule loads before you actually need the data
- Prefetching happens in parallel with normal instruction execution
Obviously, can also use T3E as a message passing machine