Support for Language Models inside Rasa

thisisishara · November 25, 2021, 9:35am

@koaning, Adding bert based models works just fine. I’ve tried it with the following config.

language: si

pipeline:
  - name: "HFTransformersNLP"
    model_name: "roberta"
    model_weights: "keshan/SinhalaBERTo"
    cache_dir: "hf_lm_weights/bert_si"
  - name: "LanguageModelTokenizer"
  - name: "LanguageModelFeaturizer"
  - name: "LexicalSyntacticFeaturizer"
  - name: "CountVectorsFeaturizer"
  - name: "CountVectorsFeaturizer"
    analyzer: "char_wb"
    min_ngram: 1
    max_ngram: 4
  - name: "CountVectorsFeaturizer"
    analyzer: "char"
    min_ngram: 3
    max_ngram: 5
  - name: "DIETClassifier"
    entity_recognition: true
    epochs: 300
  - name: "EntitySynonymMapper"
  - name: "ResponseSelector"
    epochs: 300
    retrieval_intent: faq

policies:
  - name: RulePolicy

My question is that is it possible to attach xml-roberta-base model in the same way? If I want to add it to the pipeline via LanguageModelFeaturizer, how do I have to specify model_name and model_weights? That’s where I’m stuck because I couldn’t find those parameters in the documentaion for xml-roberta based models.

Topic		Replies	Views
Correct tokenizer for BERT Rasa/LaBSE Rasa Open Source	2	255	January 17, 2025
I need a Albert in LanguageModelFeature Rasa Open Source	16	1585	January 3, 2022
How to import huggingface models to Rasa? Rasa Open Source	12	4799	December 27, 2021
Using BERT with RASA Rasa Open Source	10	7067	September 9, 2020
Clarification on Model Weights Getting Started with Rasa	2	325	November 23, 2020

Support for Language Models inside Rasa

Related topics