What's the recommended approach to reduce model loading time in rasa 3 using python

I am new for Rasa,

I have trained a RASA NLU model with the Default rasa config.yml file. with 3 intent and per intent 3 example and total 2 entities.

I have use below commend for train model

config = 'confing.yml'
training_files = './data/'
domain = 'domain.yml'
output = './models/
    import rasa
    rasa.train(domain, config, [training_files], output, fixed_model_name='model_name')

When i have load a RASA NLU model with the following code

    from rasa.core.agent import Agent
    Agent.load(model_path='Model_path')

It’s take 1.5 to 2 minute for load model in variable.

Is there anyway that this can be mitigated ? or I am doing something wrong ? Because I want to serve these models on demand, which requires a faster load time.

Any help would be highly appreciated. Thanks in advance! :slightly_smiling_face:

1 Like

Hello there,

I did not find any solution yet. Would be great to have someone’s opinion on that.

Thank you!

Any updates on this? I’m experiencing the same issue.