The launch of Amazon Elastic Inference lets customers add GPU acceleration to any EC2 instance for faster inference at 75 percent savings. Typically, the average utilization of GPUs during inference ...