Restrict the use of memory GPU for inference in rasa models

facundolazcano · January 23, 2020, 3:52pm

I have a custom rasa chatbot in the Spanish language with the Spacy model and EmbeddingIntentClassifier also the chatbot has a KerasPolicy with LSTM. My problem is the model uses all of the memory of the GPU for inference. I look at solutions for restricting memory growth and found this page Use a GPU | TensorFlow Core but I don’t know how to implement these solutions in rasa.

I appreciate your help.

Topic		Replies	Views
Memory usage Rasa Open Source	4	2054	October 15, 2019
Rasa Eats Memory. Is Garbage been handled? Rasa Open Source	9	1332	December 20, 2019
High memory usage when using RASA NLU as an HTTP Server Rasa Open Source	3	1930	September 1, 2018
Large Memory & Time Utilization for loading a Model Getting Started with Rasa	1	116	July 6, 2020
Rasa Training Issue Large Dataset TeslaV100s Rasa Open Source	6	191	January 16, 2024

Restrict the use of memory GPU for inference in rasa models

Related Topics