Because we are using proper grammatically transcribed text and alternatives to match phrases, rather than just trying to pick out sounds, you need the right kind of processing power to get the right balance between latency, streams and the physical size of the system. Because of IV’s pioneering work with GPU technology, we can push hundreds of streams through the same physical box, or allow you to auto-scale to infinity in AWS or Azure.