Interpreter object takes 2 minutes to load and loads everytime azure function is called

akgarg00 · December 4, 2021, 7:43am

I really need some help on this. The command

interpreter = Interpreter.load(model_directory1)

is taking too much time (2 minutes) to load model (it builds model from scratch , downloading bert model vocab and other configurations). It happens every time azure function is called. It takes nearly 2 minutes. Is there a way by which I can save this interpreter object and just consume it in a Azure Function.

Following are the files, code and error that I am using. I am using a pre-trained model and named it’s directory as “nlu_new”. I am able to get predictions

requirements.txt ------------------------------------------------------------------------>

azure-functions
rasa
rasa[transformers]

init.py ------------------------------------------------------------------------------------------->

import logging
import azure.functions as func
from rasa.nlu.model import Interpreter
import json


def main(req: func.HttpRequest) -> func.HttpResponse:
    logging.info('Python HTTP trigger function processed a request.')
    message = req.get_json()
    logging.info(message)
    msg1=message.get("text")
    model_directory1='./nlu_new'
    interpreter = Interpreter.load(model_directory1) ## this should be an extracted model

    result = interpreter.parse(msg1,only_output_properties=False)
    new_result=dict(text=result['text'],intent=result['intent'],entities=result['entities'],intent_ranking=result['intent_ranking'])
    logging.info(new_result)

    return func.HttpResponse(
    json.dumps(new_result),mimetype="application/json",
      status_code=200)

Model Trained on ---------------------------------------------------------- >

language: en

pipeline:
- name: HFTransformersNLP
  model_weights: "bert-base-uncased"
  model_name: "bert"
- name: LanguageModelTokenizer            # splits the sentence into tokens
- name: LanguageModelFeaturizer

- name: DIETClassifier

stephens · December 6, 2021, 7:20pm

You should be running Rasa in a separate container that is constantly running and call the REST or socket channel from your Azure function.

akgarg00 · December 14, 2021, 1:22pm

I tried using LRU_Cache and on a dedicated machine it is working fine. Replying within a second but on Azure, process is closed between 5 to 10 minutes. I guess it is due to the plan that we have for Azure Function. Change of plan to have dedicated Azure resources or cache can solve the purpose. Thanks!

vishu1994 · April 12, 2022, 2:40pm

Looks like for every request you are trying to load the model and then doing inference. You can load your model outside your api endpoint i.e Load the model at global level , so that whenever the server is started you load the model and keeps the model in memory.

Topic		Replies	Views
Exception: ImportError: cannot import name 'Interpreter' from 'rasa.nlu.model' Rasa Open Source	5	3006	December 6, 2021
Load times for NLU Model Interpreter Rasa Open Source	1	649	March 12, 2021
RASA Interpreter is slow (PYTHON) Rasa Open Source	1	507	June 11, 2020
Why loading a trained model is taking so much time? Rasa Open Source	1	807	December 20, 2019
Rasa NLU model loading takes significant amount of time Rasa Open Source	7	1171	January 24, 2020

Interpreter object takes 2 minutes to load and loads everytime azure function is called

Related topics