Hi,
I think that I have a problem when generating RegexEntityExtractor_errors.json. The report is generated by running rasa test nlu
.
Analysing this report I notice that the entities are only extracted by CRFEntityExtractor. Bellow I pasted an example from the file generated. However, when I used rasa shell nlu
the two extractors can extract correctly the entity.
RegexEntityExtractor_errors.json
{
"text": "Compare my tariff with tariff_type_2",
"entities": [
{
"start": 23,
"end": 36,
"value": "tariff_type_2",
"entity": "tariff_type"
}
],
"predicted_entities": [
{
"entity": "tariff_type",
"start": 23,
"end": 36,
"confidence_entity": 0.9832901695523858,
"value": "tariff_type_2",
"extractor": "CRFEntityExtractor"
}
]
},
NLU:
{
"text": "Compare my tariff with tariff_type_2",
"intent": {
"name": "tariff_comparison",
"confidence": 0.984128006208084
},
"entities": [
{
"entity": "tariff_type",
"start": 23,
"end": 36,
"value": "tariff_type_2",
"extractor": "RegexEntityExtractor"
},
{
"entity": "tariff_type",
"start": 23,
"end": 36,
"confidence_entity": 0.9947785145759475,
"value": "tariff_type_2",
"extractor": "CRFEntityExtractor"
}
],
Can anyone explain to me why is it happening?
Another thing is. During the cross validation (5 folds) I found the following warning:
UserWarning: No lookup tables or regexes defined in the training data that have a name equal to any entity in the training data. In order for this component to work you need to define valid lookup tables or regexes in the training data.
I have 50 examples where 19 don’t have any entity of lookup table Thank you.