Train the RASA NLU to extract the entities and fill slot based on regular expressions

hemamalini · April 12, 2019, 1:49pm

In my data.json , I have given sample IP and if i give some other ip address , still it is taking the sample IP given in the data.json. How can i train the model to extract the correct entity .

{
         "text": "1.2.3.4",
         "intent": "get_ip_reputation",
          "entities": [
           {
             "start": 0,
              "end": 7,
             "value": "1.2.3.4",
             "entity": "ipAddr"
           }
         ]
       },

erohmensing · April 23, 2019, 9:04am

Hi @hemamalini, what does your training data look like? Can you also provide an example like this one, but where it extracts the wrong entity value?

hemamalini · April 24, 2019, 6:52pm

@erohmensing i tried by giving different ip addresses . It is taking by default 1.2.3.4 even i type some other IP. Sample story below. How can i use regular expressions to fill the slot. Whenever i type some other ip address , ipAddr slot is getting the value as 1.2.3.4

story_001

greeting

utter_greet

get_ip_reputation

utter_ask_ip_addr

get_ip_reputation{“ipAddr”: “1.2.3.4”}

slot{“ipAddr”: “1.2.3.4”}

get_ipaddr_reputation

utter_reply

utter_good_bye

erohmensing · April 25, 2019, 6:05am

Can you try setting the slot type to unfeaturized if it isn’t already so?

Seeing your NLU training data for intent: get_ip_reputation would be helpful too.

hemamalini · May 8, 2019, 12:09pm

Thanks … Do i need to change in the stories as well

erohmensing · May 8, 2019, 1:25pm

Actually I apologize, as long as the slot type was originally text and not something like categorical, keep it the way it was instead of switching it to unfeaturized. Can you show me your intent data for the get_ip_reputation?

hemamalini · May 9, 2019, 8:15am

Hi , this is the same data for getting IP. I need to get the IP value dynamically

{
        "text": "what is the reputation of 10.1.1.1",
        "intent": "get_ip_reputation",
        "entities": [
          {
            "start": 26,
            "end": 34,
            "value": "10.1.1.1",
            "entity": "ipAddr"
          }
        ]
      },

erohmensing · May 9, 2019, 2:51pm

Yes, but how many examples of this entity do you have? In order for it to generalize, you’ll want to have at least 20 examples. With IP addresses, however, your best bet is probably a regex entity extractor, as you thought in your post title. You can use it by adding the regex_features to your training data as described here and adding the intent_entity_featurizer_regex (RegexFeaturizer if on NLU 0.15.0) to your pipeline.

hemamalini · May 10, 2019, 10:10am

i added nearly 10 examples.Do we have to modify that in stories.md file. Any sample on stories would be highly appreciated

erohmensing · May 10, 2019, 3:17pm

nope, shouldn’t need to be in your stories! Did you try out the regex featurizer?

hemamalini · May 19, 2019, 1:10pm

thanks it worked .I figured out the issue. Regex was the issue!!Thanks a ton

Topic		Replies	Views
How to design Rasa NLU training data for extracting human name Rasa Open Source testing	12	929	March 17, 2023
Get entities in online trainment Rasa Open Source	2	477	September 13, 2018
Extract number type entity from more than one entity with same example Rasa Open Source	2	356	February 23, 2023
Entity extraction/slot type differences in Lex to Rasa Migration Getting Started with Rasa	1	133	December 27, 2020
RegexEntityExtractor Slot filling not working in Rasa 3.x Rasa Open Source	1	388	October 28, 2022

Train the RASA NLU to extract the entities and fill slot based on regular expressions

story_001

Related topics