I ask because if i am extracting a number the ner_crf gives a string whereas duckling gives an int, so the form actually extracts a list e.g. [ ‘12’, 12 ]. I could go through and parse this in the validate part, but it would be neat if i could just say ‘take the duckling one’. One might also end up trusting one extractor more than another for certain things (i imagine, i am not an expert)
The above could also just be me misunderstanding: should it be returning a list? I thought it would just pick whichever it matched first and ignore the rest.
I had a similar issue but for me, if I used ner_spacy’s inbuilt entities like CARDINAL, DATE etc it would always return a list (not sure why) of two values. When I use ner_crf, it returned a single string.
Yea, probably spacy is responsible for this because I faced the issue only while using spacy’s entity. Maybe you can try removing the duckling and testing it to see if the issue remains. .