Hi there, I’m trying out using lookup tables to detect cities. In my lookups table, I have both “batu” (from Indonesia) and “batu caves” (from Malaysia) as examples. I’m using RegexEntityExtractor in my pipeline
Once the model is trained, I typed in: “What’s the weather in Batu Caves?”
However, the model decided to extract the entity “batu” instead of “batu caves”.
Is there a way to customize the regex behaviour to prefer longer matches?