Field Guide · hardware

Also known as: CPU cache

Cache memory is a small, very fast memory close to the CPU that keeps recently and frequently used data and instructions on hand, hiding the latency of slower main memory.1

Overview

Caches are usually built from fast SRAM on the processor die and arranged in levels: a tiny, fastest L1 per core, a larger L2, and a big shared L3. When the CPU needs data, a cache hit serves it immediately; a cache miss forces a slow trip to RAM over the system bus. Caches work because programs show locality of reference — they tend to reuse the same data and access nearby addresses — so a small store captures most accesses.

Where it fits

Cache exists because main memory cannot keep up with clock speed: without it, a fast CPU would stall waiting on RAM. It sits between the registers and main memory in the broader memory hierarchy. For throughput-heavy code like GopherTrunk’s streaming DSP, keeping the hot working set — filter taps, buffers — inside cache is what lets the processor sustain its rate instead of stalling on memory.

Sources

  1. CPU cache — Wikipedia, on cache levels, hits and misses, and locality. 

See also