A thing that seems like it would add a lot of out-of-the-box power to custom entity recognizers is the ability to pass token-level features to
The simplest version of this would be a
SpacyEntityFeaturizer that would make a token’s
.vector attribute available to
CRFEntityExtractor. That would let you use much more powerful features in classifying your custom entities than simply part-of-speech or the other current features.
I am working on a way to add this feature and would love comments/feedback/confirmation that others want this feature.