what if have different values of hundreds ?
There is alot of variations ?
There are not same length ?
Somethinglike these ??
colors
c3 ii
c4
tour
travato
vectra
via
view and view
view
vista
rialta
adventure
adventure cb500x
cruiser
cruiser 1300 custom line
cruiser ctx700n
cruiser fury
cruiser goldwing f6b
cruiser rebel
cruiser shadow line
dual sport
dual sport crf250l
super
super cb1000r
t@b
trailblazer (lite) by komfort
a.c.e.
@akelad what do i have to do ? if have hurdreds of data kike this data ? bcz loopup data is not working, do I have to train on each of these ??
I’m not sure if it’ll work for such large variation, never tried it out. However, I guess your training data must include all such variations for it to work.
Hi all, do I have to give examples for all the existing entities in my lookup table? I am failing to understand, if i have to give all the entities in the training sets how is lookup table helpful?
Hi,
I have very domain specific entities that are not well recognized in my model. So, I am trying to improve the accuracy of entity recognition by adding lookup tables. But I don’t see any improvement. Currently, I’m using DIET classifier and based on this post “Entity lookup in DIET Classifier” we can use lookup tables with DIET model as long as “RegexFeaturizer” is provided in the pipeline. I also added more training data with the same entity in my nlu data. But still I don’t see any changes. The other thing that I tried was adding “CRFEntityExtractor” which I’m not sure if it is a good idea or not, but still it didn’t help. Can you share your experience that what possibly could be the solution?
Just want to chime in that it’s Feb 2022 and this still does not seem to work. I have a lookup table that shares the same name as the entity I am extracting. The table contains 14 elements. My training data uses 4-5 of those elements in a good number of different sentence structures. When entering the same EXACT phrasing as in my training data, but only changing the entity that the lookup table should provide, it does not detect any entity at all.
Example:
In my NLU:
Tell me about the rockwell.
In my lookup table:
rockwell
destin
Train, start, ask:
Tell me about the destin
I get no matches on the entity.
Its just flat out not working at all. And yes I have more training data than the 1 example and a larger lookup table than the one listed.
use RegexEntityExtractor, it must resolve your problem.
pipeline:
- name: RegexEntityExtractor
# text will be processed with case insensitive as default
case_sensitive: False
# use lookup tables to extract entities
use_lookup_tables: True
# use regexes to extract entities
use_regexes: True
# use match word boundaries for lookup table
"use_word_boundaries": True
I am also unable to understand how is lookup table helpful if I have to add every single element of lookup table into examples because I have a list of all countries and was building a tell time bot and in lookup table I have added around 150 country names now do I also have to add 150 examples mentioning all of the countries?