ISSUE WHILE TRAINING - Everything freezes

rasafan · October 31, 2019, 5:00pm

Hi,

while I train my dataset I get this two warnings:

“UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 in labels with no predicted samples. ‘precision’, ‘predicted’, average, warn_for)”

"c:\users\tjcol\appdata\local\programs\python\python37\lib\site-packages\sklearn\metrics\classification.py:1143: UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 in labels with no predicted samples. ‘precision’, ‘predicted’, average, warn_for) "

After I get them, the training freezes and I’m not able to train my model.

Please help.

Ghostvv · November 1, 2019, 9:23am

this warning can occur during testing not training

rasafan · November 1, 2019, 4:11pm

Well for me this occurs after I run the line “rasa train”, so I don’t know how it is not possible then?

rasafan · November 2, 2019, 10:28am

I tried now with a bit smaller dataset but I still have the same problem, could someone look at my dataset maybe there is a problem with it that I’m overlooking?

JulianGerhard · November 2, 2019, 2:32pm

Hi rasafan,

have your tried to use a virtual environment with e.g. python 3.6.8 ?

I am suggestion this because the warning you see is familiar to me and usually doesnt lead to a freeze. However I experienced quite different problems with a python version equal and greater to 3.7. Maybe this is worth a try.

If that doesnt lead to a working system, feel free to publish your dataset / zip the bot such that I can test it.

Regards Julian

rasafan · November 3, 2019, 6:36pm

Hi,

yeah we tried that too, still doesn’t work.

rasa_data.zip (39.4 KB)

I’ve uploded our dataset, can you please try it out? Thanks.

Ghostvv · November 4, 2019, 11:36am

@rasafan it could be a memory error

rasafan · November 4, 2019, 2:53pm

How can I solve that?

Ghostvv · November 5, 2019, 8:20am

We’re working on the new version that will significantly reduce amount of memory required. for now, could you please try the following nlu pipeline:

language: "en"

pipeline:
- name: "WhitespaceTokenizer"
- name: "RegexFeaturizer"
- name: "CRFEntityExtractor"
- name: "EntitySynonymMapper"
- name: "CountVectorsFeaturizer"
- name: "EmbeddingIntentClassifier"
  batch_strategy: sequence

rasafan · November 5, 2019, 9:07am

Hi, that training is with supervised embedding and that works for us all the time.

We tried now your pipeline and got a model with 93% accuracy that works poorly, it doesn’t understand anything that isn’t literally written as it is in dataset.

We are trying to train on pretrained embedding option and that doesn’t work for us, check it out:

language: “en”
pipeline: “pretrained_embeddings_spacy”
policies:
epochs: 75
max_history: 10
name: KerasPolicy
max_history: 10
name: AugmentedMemoizationPolicy
name: “FallbackPolicy”
nlu_threshold: 0.2
core_threshold: 0.1
fallback_action_name: “action_default_fallback”

In addition, when I trained now once again with this pipeline I check the memory conditions when it freezes and it was around 1.5GB and my computer has 12GB of memory.

Ghostvv · November 5, 2019, 9:38am

sorry, I didn’t see that you use spacy pipeline. In this case it is not a memory issue

rasafan · November 5, 2019, 10:21am

Yeah I understand, do you know what should I do? Do you know anyone who could assist?

@JulianGerhard, did you maybe found the time to check out our dataset and test it?

rasafan · November 7, 2019, 7:46pm

Any ideas?

Ghostvv · November 8, 2019, 9:25am

did you debug your code to find out the spot where it freezes?

rasafan · November 8, 2019, 6:05pm

@Ghostvv

How do I debug training with Rasa since that is on the Rasa side of the code? I googled and haven’t found how to debug the training part anywhere…

Ghostvv · November 11, 2019, 9:13am

did you run in --debug mode? If, it doesn’t help, otherwise I’m afraid the solution is to clone GitHub repo, and use the source code

rasafan · November 12, 2019, 9:38pm

Can you please tell me step by step how to debug the source code after cloning it? I cannot find any info on to how to do that when I’m using it in another project as a dependency. Thank you.

rasafan · November 14, 2019, 10:18am

You can find here more people with the same error:

github.com/RasaHQ/rasa

Spacy model won't train. Freezes at "[Parallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent workers."

opened 08:28AM - 06 Nov 19 UTC

closed 02:09PM - 25 Nov 19 UTC

mroyce1

type:bug

status:more-details-needed

Hi, when using spacy, rasa won't train the NLU model. It preoceeds until `[Pa…rallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent workers.` and then just stays like this forever. I tried several operating systems and python versions, but the problem persists. I don't think it's an issue of "giving it more time". I think it just freezes. While I do have a lot of training data, supervised embeddings work just fine (and I would expect this to be slower). There is a thread about this issue on rasa forums which was however never resolved: https://forum.rasa.com/t/can-not-train-rasa-nlu-with-spacy-models/11374/3 **Rasa version**: rasa==1.4.3 rasa-nlu==0.14.3 rasa-sdk==1.4.0 (also happens with older versions) **Python version**: 3.6.8 (also happens with 3.7) **Operating system** (windows, osx, ...): Both Windows 10 and Ubuntu 18.04 **Issue**: Rasa won't train the NLU model when using spacy. It proceed's until `Fitting 2 folds for each of 6 candidates, totalling 12 fits` `[Parallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent workers.` and then just stays like this forever. I left it like this for 12+ hours and nothing happened. This happens with both `en_core_web_sm` and `en_core_web_md` **Command or request that led to error**: ``` rasa train ``` **Content of configuration file (config.yml)** (if relevant): ``` # Configuration for Rasa NLU. # https://rasa.com/docs/rasa/nlu/components/ language: "en" pipeline: - name: "SpacyNLP" - name: "SpacyTokenizer" - name: "SpacyFeaturizer" - name: "RegexFeaturizer" - name: "CRFEntityExtractor" - name: "EntitySynonymMapper" - name: "CountVectorsFeaturizer" - name: "CountVectorsFeaturizer" analyzer: char_wb min_ngram: 1 max_ngram: 4 lowercase: true - name: "SklearnIntentClassifier" timezone: "Europe/Berlin" case_sensitive: false # Configuration for Rasa Core. # https://rasa.com/docs/rasa/core/policies/ policies: - name: "KerasPolicy" featurizer: - name: "MaxHistoryTrackerFeaturizer" max_history: 5 state_featurizer: - name: "LabelTokenizerSingleStateFeaturizer" - name: "MemoizationPolicy" max_history: 5 - name: MappingPolicy - name: FormPolicy - name: "FallbackPolicy" nlu_threshold: 0.75 core_threshold: 0.45 fallback_action_name: "action_default_ask_rephrase" ```

Ghostvv · November 14, 2019, 12:40pm

I would suggest to run your training locally with rasa installed from the cloned repo, then debug the parts of code of rasa that are executed during training

rasafan · November 16, 2019, 2:36pm

I’ve done that and my error happens while running rasa train nlu --debug since it doesn’t offer any insight in to the freezing error as there are no additional information logged while training.

Training still freezes on: Fitting 3 folds for each of 6 candidates, totalling 18 fits [Parallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent workers.

Topic		Replies	Views
Rasa train hangs Rasa Open Source	16	1350	January 7, 2022
Training always aborts with killed Rasa Open Source	18	3339	July 17, 2019
large amount of memory. Rasa Open Source	4	1609	December 23, 2021
[ASK] Process get killed when training RASA core Rasa Open Source	22	2466	October 6, 2023
Rasa train doesn't work Rasa Open Source	1	251	November 18, 2022

ISSUE WHILE TRAINING - Everything freezes

Related topics