All the news you're looking for from Tensilica, Inc. Find out how you can use Tensilica's customizable, extensible processors to speed your SOC design. See Tensilica for DSPs and all the processing you need to do in the dataplane (dataplane processors - DPUs).
Try the ConnX Vectra LX DSP Engine. A basic Tensilica Xtensa LX processor might take 155,389 cycles for a 256pt Radix-4 FFT. But add Vectra LX, and that cycle count drops down to 994. Get performance, just where you need it.