RASA NLU Training Data - Entity Position

gopeekrishnan · February 7, 2019, 1:27pm

Hi Team,

While training RASA NLU, why do we need start and end positions of the entities in training data when we are already specifying the entity value. { “text”: “What’s the weather in Berlin at the moment?”, “intent”: “inform”, “entities”: [ { “start”: 22, “end”: 28, “value”: “Berlin”, “entity”: “location” }

As in the example below, we are telling the model that value is Berlin, why do we need start and end here. Is it not redundant.

Juste · March 7, 2019, 12:26pm

Hello @gopeekrishnan. Sorry for a late response on this. The start and end positions are important because this is how the model knows which characters to extract and use to train the model. The value of the entity can be different from the one in the original message. For example, if you want to train you NLU model to know that New York City and NYC is the same, your training data would look like the example below:

  "text": "I moved to New York City",
  "intent": "inform_relocation",
  "entities": [{"value": "nyc",
                "start": 11,
                "end": 24,
                "entity": "city",
               }]
},
{
  "text": "I got a new flat in NYC.",
  "intent": "inform_relocation",
  "entities": [{"value": "nyc",
                "start": 20,
                "end": 23,
                "entity": "city",
               }]
}]

Topic		Replies	Views
RASA NLU Training Data - Entity Position is not correct Rasa Open Source	1	517	February 13, 2019
Entity extraction not rightly working Rasa Open Source	6	1599	October 10, 2019
Entity [Deprecated] Rasa X Community Edition	1	335	May 28, 2019
NLU training data issue Rasa Open Source	2	455	October 16, 2020
Does each of the sentence must have the entity to train? Rasa Open Source	1	510	September 4, 2018

RASA NLU Training Data - Entity Position

Related topics