The amount of generated training data

koaning · January 13, 2020, 3:30pm

I have a bunch of intents defined in my nlu.md. The file is 241 lines long. I also have a domain.yml file and a stories.md file.

I then hit rasa train on the terminal and things start running. This is the first few lines;

Epoch 1/100
763/763 [==============================] - 1s 736us/sample - loss: 2.6756 - acc: 0.3486
Epoch 2/100
763/763 [==============================] - 0s 168us/sample - loss: 2.2360 - acc: 0.5007
Epoch 3/100
763/763 [==============================] - 0s 166us/sample - loss: 1.8602 - acc: 0.5007
Epoch 4/100
763/763 [==============================] - 0s 171us/sample - loss: 1.7426 - acc: 0.5007
Epoch 5/100
763/763 [==============================] - 0s 166us/sample - loss: 1.6763 - acc: 0.5007

I understand that these lines of code are the output from keras but I wonder … where does the number 763 come from?

dakshvar22 · January 14, 2020, 9:23am

@koaning That should be the number of mini-batch iterations for each epoch of Rasa Core training.

koaning · January 14, 2020, 12:38pm

Thanks for the response!

That sounds a bit strange though. There are more mini batches than data points going into the model?

koaning · February 24, 2020, 3:49pm

To anybody else interested in this, the “extra” datapoints are generated by rasa by merit of the stories. It is not just the intents we’re trying to learn from the data, we’re also trying to get the intents from the past history of the stories that we’ve seen.

Topic		Replies	Views
More information on rasa core model training process Rasa Open Source	9	1872	August 17, 2018
Is it normal that total samples changes Rasa Open Source	3	625	November 30, 2018
What do the numbers that appear while training a core model mean and where do they come from? Rasa Open Source	3	943	July 30, 2019
Rasa NLU train epochs problem Rasa Open Source	1	756	May 10, 2019
Training with rasa 1.0 (sample amount, story directory) Rasa Open Source	2	856	May 27, 2019

The amount of generated training data

Related topics