Neural processing unit (NPU)

Also known as: NPU

A neural processing unit (NPU) is an on-device accelerator built to run neural-network inference efficiently — typically integrated into a phone, laptop, or edge device’s main chip so AI features work without the cloud.¹

Overview

An NPU is a class of AI accelerator optimized for inference at the edge: running an already-trained model with low latency and minimal power, rather than the large-scale training that data-center TPUs and GPUs handle. NPUs are usually one block inside a larger system-on-a-chip, sitting alongside the CPU and GPU and handling tasks such as camera processing, voice recognition, and background-blur. Their headline spec is TOPS (trillions of operations per second) at a given power budget.

Where it fits

The NPU is what makes edge AI practical: instead of streaming data to a server, a device runs the model locally. For a distributed scanner like GopherTrunk, an SBC-class NPU could in principle do on-the-edge classification of decoded audio or signals near the antenna, keeping bandwidth and latency low — though GopherTrunk’s core DSP is conventional hardware acceleration territory, not neural-network work.

Sources

AI accelerator — Wikipedia, on NPUs and related on-device machine-learning hardware. ↩

Overview

Where it fits

Sources

See also

Join the GopherTrunk community