Memory Issue

shashank34 · December 6, 2019, 6:42am

Using this pipeline

name: “SpacyNLP”
name: “WhitespaceTokenizer”
name: “SpacyFeaturizer”
name: “RegexFeaturizer”
name: “EntitySynonymMapper”
name: “SklearnIntentClassifier”
name: “CRFEntityExtractor”
name: “DucklingHTTPExtractor” url: “http://localhost:8000” dimensions: [“time”,“number”,“distance”,“email” , “amount-of-money”] locale: “en_GB” timezone: “Europe/London” policies:

Memory consumption increases as size of nlu data increases. Please recommend the best pipeline so that memory consumption handled

Thanks

Tobias_Wochinger · December 6, 2019, 8:51am

Memory consumption increases as size of nlu data increases. Please recommend the best pipeline so that memory consumption handled

What would you consider as “memory consumption is handled”?

shashank34 · December 6, 2019, 10:32am

@Tobias_Wochinger @dakshvar22 Here crf entity extranctor using more memory than other entity extractors . how we can handle the memory consumption in aws servers if nlu data increases as it blocks the instance ?

Tobias_Wochinger · December 10, 2019, 9:08am

than other entity extractors

Can you name a few examples? CRF is an actually machine learning based algorithm which has to be trained from scratch in contrast to others which are either rule-based or pretrained.

Apart from giving the machines more power, you could try lowering the amount of features you feed in the component ner_crf docs. Also lowering max_iterations might help.

Topic		Replies	Views
Entity Extractor alternatives Rasa Open Source	10	656	December 13, 2019
Rasa-NLU Memory footprint and hardware requirement Rasa Open Source	1	1968	July 30, 2020
Entity tagging for large datasets Rasa Open Source	11	2714	May 17, 2019
Memory usage Rasa Open Source	4	2052	October 15, 2019
NER_CRF model is not generalizing Rasa Open Source	3	776	December 2, 2019

Memory Issue

Related Topics