Local LLM with text-generation-webui steps

chatz · August 14, 2024, 9:00am

Hello everyone I was trying to deploy a local llm with RASA PRO and finally I found the solution here is the details if anyone needs it:

I have installed text-generation-webui → link

Then:

I started the server with ./start_linux.sh
Loaded the model through “Model” tab
In “Session” tab I selected openai, api, listen and pressed Apply flags
image939×774 65.1 KB

In rasa endpoints.yml:

nlg:
  type: rephrase
  rephrase_all: true
  llm:
    model: 'model_gemma_27b_it'
    model_name : 'model_gemma_27b_it'
    type: "openai"
    openai_api_key: "NULL"
    openai_api_base: http://127.0.0.1:5000/v1
    request_timeout: 800

If you have an error

AttributeError: module ‘openai’ has no attribute ‘error’

you have to install this:

  pip install openai==0.28.1

Sanjukta.bs · September 12, 2024, 6:53pm

Hey what changes should I be making to my config for this to work! Also I dont have any access to open api key how can I bypass open api keys. Because it keeps popping up. Any help will be appreciated. Thanks

sahibpreetsingh12 · September 14, 2024, 9:11pm

Try and use a huggingface model (Mixtral would be fine)

Sanjukta.bs · September 15, 2024, 6:22am

I want to use local models, I was trying ollama but it is taking a lot of time to generate a reply. Thus I am stuck.

sahibpreetsingh12 · September 15, 2024, 1:46pm

That’s the only stopping point with HF Try and see if you can use vLLM

chatz · September 16, 2024, 8:51am

Hello, if the model takes too much time to generate I believe the problem is that your system is struggling to load the model. Maybe you should try to improve it by reducing the characters generated or other configuration variables.

Sanjukta.bs · September 16, 2024, 11:08am

More than taking time, it is predicting wrong flows! Is there any good demo that we can follow that uses local llms instead of open ai?

Topic		Replies	Views
How to use Ollama models in Rasa CALM? Rasa CALM	8	1163	September 27, 2024
Hugging face LLM instead of OpenAI Rasa CALM	5	609	May 21, 2024
Rasa 3.10.0 provider for local llm is giving error Rasa CALM	0	99	September 16, 2024
Local LLM with RASA CALM Rasa CALM	14	1059	May 15, 2025
How to access currently used LLM Rasa CALM	1	137	July 22, 2024

Local LLM with text-generation-webui steps

Related topics