This is a deployment example of an intent-entity extractor built with Rasa.
The deployment is done by using Cortex, a very interesting open source software that allows to deploy AI-APIs in AWS though customizable infrastructures, including Kubernetes with CPUs and GPUs.