Rasa-nlu-benchmark: Collection of dataset and corresponding benchmark for Rasa NLU

nghuyong · July 26, 2019, 2:24am

Project link : GitHub - nghuyong/rasa-nlu-benchmark: Collection of dataset and corresponding benchmark for Rasa NLU

Rasa NLU is a powerful and open-source natural language processing tool for intent classification and entity extraction in chatbots.

However, we found that there is no published public dataset and the corresponding benchmark. This makes it difficult to evaluate the performance of our own NLU system built by Rasa.

Therefore, we do a project aims to collect and organize datasets and baselines for Task-Oriented Dialogue, which will be in the data format required by Rasa NLU and you can directly use them in your Rasa NLU system.

Welcome to star and contribute together~

MetcalfeTom · July 26, 2019, 8:22am

Wow! Amazing work @nghuyong!

I’m interested the see the supervised embeddings achieving a fairly high accuracy with low amounts of data (on AskUbuntuCorpus) - actually, I’d request you run these datasets under the NLU model comparison script and report on how well the models perform on these datasets with different occlusions! The script would make some informative graphs for the repository as well.

I hope you know about the Rasa contributor program too. I believe this warrants a reward

nghuyong · July 26, 2019, 8:39am

Thanks，I will add experiments and report how well the models perform on these datasets with different occlusions! And you mean, this work can contribute to rasa rep ?

MetcalfeTom · July 26, 2019, 2:00pm

Well it’s not a direct contribution to the Github repo, but we consider it a contribution since you put in the work to help other Rasa Community members we love to see these kinds of projects.

nghuyong · July 27, 2019, 1:30pm

We have added the Comparing NLU Pipelines experiments~

tuxalket · April 2, 2020, 9:04am

Hello i am trying to do some benchmark of my dataset, but i do not know how to set the number of data that rasa uses for the benchmark. i did a split of my data. thanks

Topic		Replies	Views
Donate your NLU training data! Announcements	17	2321	March 17, 2021
Bootstrapping domain training Rasa Open Source	2	359	July 1, 2020
Performance Assessment for Intent Classification and Entity Recognition Rasa Open Source	2	394	May 27, 2021
Natural Language to query generation Rasa Open Source	1	631	February 18, 2020
NLU Training Data Source Suggestions Rasa Open Source	1	333	December 18, 2021

Rasa-nlu-benchmark: Collection of dataset and corresponding benchmark for Rasa NLU

Related topics