Entity Extractor alternatives

MohitSinghCS · December 6, 2019, 6:37am

CRFEntityExtractor Component is increasing the memory and training time of system as training size of nlu increases. Is there any alternative to this extractor that works efficiently for entity extraction and consumes less time and memory as well. Main concern is memory here as it blocks the whole system…

Ghostvv · December 6, 2019, 8:53am

we’re working on a new architecture. But it is a general rule, the more data you have, the more memory you need

MohitSinghCS · December 6, 2019, 10:32am

what would be different in this new architecture. would it solve the problem of excessive memory and time consumption

Ghostvv · December 6, 2019, 11:17am

what do you mean “excessive”? how much memory it is using already for how much data?

MohitSinghCS · December 6, 2019, 11:23am

for nlu size of 15-20 MB , it took above 8GB of server’s memory (server’s ram is 8gb) @Ghostvv

Ghostvv · December 6, 2019, 12:08pm

what is your pipeline?

Ghostvv · December 6, 2019, 12:09pm

it is hard to understand in mb, how many examples do you have?

MohitSinghCS · December 9, 2019, 4:28am

pipeline:

name: “SpacyNLP”
name: “WhitespaceTokenizer”
name: “SpacyFeaturizer”
name: “RegexFeaturizer”
name: “EntitySynonymMapper”
name: “SklearnIntentClassifier”
name: “CRFEntityExtractor”
name: “DucklingHTTPExtractor” url: “http://localhost:8000” dimensions: [“time”,“number”,“distance”,“email” , “amount-of-money”] locale: “en_GB” timezone: “Europe/London” policies:
- name: MemoizationPolicy
- name: KerasPolicy
- name: MappingPolicy
- name: “FallbackPolicy” nlu_threshold: 0.3 fallback_action_name: “utter_default_fallback”

This is the pipeline we are using. @Ghostvv

Ghostvv · December 10, 2019, 10:05am

How many intent examples do you have?

MohitSinghCS · December 12, 2019, 12:01pm

@Ghostvv we have intent examples approx 50-60 thousands. actually our examples are formed dynamically following the permutations and combinations of entities and their synonyms values(dynamic)

Ghostvv · December 13, 2019, 12:06pm

that’s a lot. You can implement an online loader, in order to reduce memory consumption

Topic		Replies	Views
Memory Issue Rasa Open Source	3	502	December 10, 2019
Rasa-NLU Memory footprint and hardware requirement Rasa Open Source	1	2095	July 30, 2020
Large Memory & Time Utilization for loading a Model Getting Started with Rasa	1	167	July 6, 2020
Memory usage Rasa Open Source	4	2188	October 15, 2019
Rasa Training Issue Large Dataset TeslaV100s Rasa Open Source	6	315	January 16, 2024

Entity Extractor alternatives

Related topics