Load times for NLU Model Interpreter

adai183 · March 11, 2021, 1:15pm

I have been doing some performance tests for model loading of DIETClassifiers without pre-trained word vectors. Looks like there is a lot of smart caching / loading going on. Does anybody know why loading the first model is so much slower than the latter ones? Also is there a way to speed up the load time for the very first model?

@koaning would be awesome if you could point me in the right direction.

from pathlib import Path
import time
from rasa.nlu.model import Interpreter

load_times = []
for model_path in Path("./models_10").glob("*"):
    start = time.time()
    nlu_interpreter = Interpreter.load(model_path)
    load_times.append(time.time() - start)
print(load_times)
>>>>>>>>>
[4.209419012069702, 1.2521369457244873, 1.2520649433135986, 1.256606101989746, 1.4406280517578125, 1.234318733215332, 1.238109827041626, 1.2923908233642578, 1.4882800579071045]

koaning · March 12, 2021, 8:23am

I think it’s Tensorflow.

When you run rasa from the command line there’s also a unconfortable waiting time. That’s because tensorflow needs to load. You can check locally, the tensorflow payload on disk is typically about 700MB in your virtualenv.

Topic		Replies	Views
Why loading a trained model is taking so much time? Rasa Open Source	1	813	December 20, 2019
What's the recommended approach to reduce model loading time in rasa 3 using python Rasa Open Source	2	767	February 6, 2024
Rasa NLU model loading takes significant amount of time Rasa Open Source	7	1172	January 24, 2020
Rasa NLU with spaCy large default model - en_core_web_lg Rasa Open Source	2	969	June 10, 2019
RASA Interpreter is slow (PYTHON) Rasa Open Source	1	508	June 11, 2020

Load times for NLU Model Interpreter

Related topics