0
I’ve got rasa_nlu integrated into my python app. I’m passing it a glob of characters, ‘${webAddress}’ in this example, and I’d like to get that glob of characters back as an entity. For some reason, ner_crf is adding spaces in the entity value, even though it is not in the response text. How do I make it stop doing that?
I’m using the spacy_sklearn pipeline for training. I have included several very similar examples in my training data (substituting some other string for webAddress), and it does recognized the entitity. Just…just…stop it with the spaces!
$ curl -XPOST localhost:5000/parse -d '{"q":"navigate to ${webAddress}"}'
{
"intent": {
"name": "navigate",
"confidence": 0.1911308126544064
},
"entities": [
{
"start": 12,
"end": 25,
"value": "$ { webaddress }",
"entity": "url",
"confidence": 0.5229620578330448,
"extractor": "ner_crf"
}
],
"text": "navigate to ${webAddress}",**
"project": "default",**
"model": "model_20190409-153615"**
}
I just ran into another example. The text Double-click-me, which I want to get back as Double-click, is instead returned as double - click - me.
I guess a more direct way of asking the question is, “How to stop entities formatting?”