Using NLP for Training data selection

datistiquo · September 18, 2018, 8:24am

Since for training the NLU with sentences you need to take care of the balances and variations of examples to avoid overfitting. When you are in practice and have many data you don’t know in general what kind of sentences you already have in your data. In my mind came the idea to do Clustering of new sentences to look if I should add the new sentences to the data.

Do you use some algorithms for training data yet. What kind?

Topic		Replies	Views
How can I make the training dataset from over 400 question and answer? Rasa Open Source	1	591	August 24, 2018
Should one use all stories during training data? Rasa Open Source	1	259	August 14, 2020
NLU only best practices Getting Started with Rasa	1	345	August 12, 2019
Automated Training in RASA Rasa Open Source testing	6	1245	September 13, 2020
Is it possible to train partially new Intents when I add new data(intents, stories) Rasa Open Source	1	267	June 19, 2020

Using NLP for Training data selection

Related topics