CRF with dense features

koaning · June 28, 2021, 7:14am

Ah! I think I may have found it. And I stand corrected. It does seem like the CRF model ignores features. It seems to even ignore the sparse features in the pipeline.

koaning · June 28, 2021, 7:16am

Yeah this is definately a bug.

These two pipelines yield the same results.

pipeline:
  - name: WhitespaceTokenizer
  - name: LanguageModelFeaturizer
    model_name: "roberta"
    model_weights: "roberta-base"
  - name: CRFEntityExtractor

pipeline:
  - name: WhitespaceTokenizer
  - name: CRFEntityExtractor

Making a note on the GitHub issue as well.

imene_tar · June 28, 2021, 7:31am

Thank you for the time command, I didn’t know it and it will be really helpful for my research.

I have already noticed that the sparse features are not taken into account but I tought that the CRF module generates them automaticaly from the given tokens.

So do you think that the dense features are being ingored (in that case, why would it be different when using DIET without transformer?) or is it simply a bug?

imene_tar · June 28, 2021, 7:53am

From what I undestood the time command give the time of the whole testing, is there any command to have the time for the prediction only?

Also, it makes sence that the LM config takes more time in testing because, even if it’s not taken into account, the LM Featurizer in still trained which is computationly expensive.

Topic		Replies	Views
No Difference in Performance when Using or Changing Language Model Featurizers Rasa Open Source	3	1263	January 17, 2022
Difference between CRF_entity_extractor and DIET Tutorials, Resources & Videos entity	0	672	June 17, 2021
ValueError: Sequence dimensions for sparse and dense features don't coincide Rasa Open Source	23	2028	February 11, 2020
Ner_crf Rasa Open Source	12	5141	September 28, 2018
Using NER as a Feature for CRFEntityExtractor Rasa Open Source	6	1730	June 28, 2021

CRF with dense features

Related topics