Regexp entity email extraction

linediconsine · December 14, 2018, 7:51pm

I want to create a forgot password bot as learning exercise

With this regexp

  "rasa_nlu_data": {
    "regex_features": [
      {
        "name" : "email",
        "pattern" : "[A-Z0-9._%+-]+@[A-Z0-9.-]+\.[A-Z]{2,}"
      },
      {
        "name": "zipcode",
        "pattern": "[0-9]{5}"
      },

I receive this error while training

ValueError: Unknown data format for file NLU/nlu.json

I noted in particular that is the use of

\ .

is a problem for json format

Questions

Any help on how to solve this and track email in the sentences?
Also, what is the difference of use .md versus .json format for NLU training

Thanks

JiteshGaikwad · December 15, 2018, 4:47am

hey @linediconsine,

1)You can use Duckling entity extractor to get email-id , you can get the details how to do so @

https://rasa.com/docs/nlu/0.13.8/components/#ner-duckling-http

github.com

facebook/duckling/blob/main/README.md#supported-dimensions

![Duckling Logo](https://github.com/facebook/duckling/raw/main/logo.png)

# Duckling [![Support Ukraine](https://img.shields.io/badge/Support-Ukraine-FFD500?style=flat&labelColor=005BBB)](https://opensource.fb.com/support-ukraine) [![Build Status](https://travis-ci.org/facebook/duckling.svg?branch=master)](https://travis-ci.org/facebook/duckling)

Duckling is a Haskell library that parses text into structured data.

```bash
"the first Tuesday of October"
=> {"value":"2017-10-03T00:00:00.000-07:00","grain":"day"}
```

## Requirements

A Haskell environment is required. We recommend using
[stack](https://haskell-lang.org/get-started).

On Linux and MacOS you'll need to install PCRE development headers.
On Linux, use your package manager to install them.
On MacOS, the easiest way to install them is with [Homebrew](https://brew.sh/):

This file has been truncated. show original

2)You can get the amswer of your 2nd question here :

https://rasa.com/docs/nlu/0.13.8/dataformat/#data-format

linediconsine · December 18, 2018, 3:58pm

HI Jitesh, I really appreciate your help.

About the components,

Can I position a duckling before or after anything in the pipeline or there is a logic I should follow?
How can I test if duckling or in general the pipeline is working? ( maybe a this is a stupid question)

Right now my pipeline is :


language: "en"

pipeline:
- name: "ner_duckling_http"
 # stack from https://haskell-lang.org/get-started/osx
 # https://rasa.com/docs/nlu/components/#ner-duckling-http
 # https://github.com/facebook/duckling#quickstart
 # url of the running duckling server
 url: "http://localhost:8000"
 # dimensions to extract
 dimensions: ["email", "time", "number", "amount-of-money", "distance"]
 # allows you to configure the locale, by default the language is
 # used
 locale: "en_US"
 # if not set the default timezone of Duckling is going to be used
 # needed to calculate dates from relative expressions like "tomorrow"
 timezone: "Europe/Berlin"
- name: "nlp_spacy"                   # loads the spacy language model
- name: "tokenizer_spacy"             # splits the sentence into tokens
- name: "intent_featurizer_spacy"     # transform the sentence into a vector representation
- name: "intent_classifier_sklearn"   # uses the vector representation to classify using SVM

Thank you!

Marco

dariofiore · January 9, 2019, 5:31pm

Hi Marco I am also working on a password-reset chatbot with RASA. Are you interested in a knowledge exchange? You can contact me on: fioredar@students.zhaw.ch Greets Dario

linediconsine · January 9, 2019, 7:15pm

Yes, this Sounds amazing! I sent you my contacts,

Speech soon!

JiteshGaikwad · August 20, 2019, 11:35am

You can connect with me at LinkedIn

Topic		Replies	Views
Entity extraction in rasax [Deprecated] Rasa X Community Edition entity	14	2156	August 28, 2019
No Regex Entity Extraction Getting Started with Rasa	2	207	February 16, 2021
Suggestion for pipeline Rasa Open Source	1	555	April 9, 2019
Extract alphanumeric entity Rasa Open Source	3	703	October 31, 2018
Pattern extraction problem with DucklingEntityExtractor Rasa Open Source	6	1102	February 24, 2023

Regexp entity email extraction

Related topics