laws, formulas, and mental models worth keeping close.
the serial fraction sets a hard ceiling on parallel speedup.
items in a system = arrival rate times time spent inside.
attainable performance is the minimum of compute peak and memory bandwidth times arithmetic intensity.
extends Amdahl with a coherency penalty that explains why more nodes can mean less throughput.