1. This isn't inference only, it has the full capabilities of a normal GPU, just small and low power (and therefore much slower than normal GPUs).
2. TPUv1 is a matrix multiply ASIC that requires a host CPU to do anything. This thing is a SoC that includes both a CPU and a GPU. The CPU is pretty fast for what it is - much faster than e.g. raspberry pi, see https://www.phoronix.com/scan.php?page=article&item=nvidia-j....
3. not sure how you know whether this is more expensive than a TPUv1, since the TPUv1 was never sold or available outside of google.
A much better comparison would be between this and the Edge TPU development board.