What would be the best architecture of a component that enhances entity lookup tables with fuzzy matching?
I see how the regex_featurizer.py is using the lookup tables and I know I have to use FuzzyWuzzy somewhere. Some general thoughts are:
Would I I add something into regex_featurizer.py?
Would I make a new component and use FuzzyWuzzy to look into the lookup table files and see what are the best matches and pass them into the the RegexFeaturizer component?
The the bullets above might be completely off but I am just curious what the general plan would be.