Hi,
I’m using rasa 1.2.2 with supervised_embeddings
on english text and am having trouble picking weights (eg. 10 grams, ten grams, 100 grams). My model keeps separating inputs such as “10 mg” into “integer” and “dosage”. I should ideally have one entity called “dosage” with value “10 mg”. My model does this for some inputs and not others:
Input MValue Entities Entity Confidence
10 ounce ['10 ounce'] ['dosage'] 0.991
10 pound ['pound', '10'] ['dosage', 'integer'] 0.854
100 grams ['gram', '100'] ['dosage', 'integer'] 0.862
100 iu ['100 iu'] ['dosage'] 0.991
100 mcg ['100 microgram'] ['dosage'] 0.991
100 mg ['milligram', '100'] ['dosage', 'integer'] 0.949
1000 iu ['1000 iu'] ['dosage'] 0.991
1000 mg ['milligram', '1000'] ['dosage', 'integer'] 0.949
10000 iu ['10000 iu'] ['dosage'] 0.991
How should I proceed to ensure that whenever a number is followed by a weight unit is recognised as a single entity. Do I need to provide more training data or having some manual check in my code further down the line?