Are entity-only training examples still supported? If so, how are they formatted?

plwrenn · December 11, 2019, 6:52pm

The docs for the training data format mentions both the intent and entity are optional data, text being the only required data. I’m just not sure how examples that include the text and entities, but no intent should actually look. In older docs I see mention of a entity_examples, but that seems to have been removed from the more modern docs.

I’m interested in this because I would like to build a lot of training examples for the CRFEntityExtractor extractor without introducing a large imbalance in the intent training data.

dakshvar22 · December 12, 2019, 10:43am

Hi @plwrenn You can use the json data format to accomplish this. Here is an example that should work -

{
  "rasa_nlu_data": {
    ...
    "common_examples": [
      {
        "text": "hey",
        "intent": "greet", 
        "entities": []
      }, 
      {
        "text": "I am looking for asian fusion food",
        "entities": [
          {
            "start": 17,
            "end": 29,
            "value": "asian fusion",
            "entity": "cuisine"
          }
        ]
      },
      ....

Let me know if you have any further queries.

Topic		Replies	Views
NLU trainign data entity format Rasa Open Source	2	367	October 14, 2020
Does each of the sentence must have the entity to train? Rasa Open Source	1	507	September 4, 2018
How to add intent and entites from rasa x api Rasa Open Source	4	1660	December 25, 2020
Adding new training data (new intent to existing model) Rasa Open Source	3	1320	March 9, 2022
Entity recognition CRF without intent classification Rasa Open Source	2	754	June 13, 2019

Are entity-only training examples still supported? If so, how are they formatted?

Related topics