Questions about TED model

miguel-kjh · February 8, 2022, 6:16pm

Hi, My team is trying to replicate the TED model in pytorch, however we do have some problem about the perfomace of the model. We wonder if someone could tell us what hyperparameters they used at the time of the experiments that appear in the original paper. It seems to us a good model to which we want to make a couple of modifications

Thanks for the help

MatthiasLeimeister · February 10, 2022, 10:53am

Hi @miguel-kjh, welcome to the Rasa community forum

For more details on the TED experiments, I would point you to this repository, which contains experimental files and the commits that were used for the paper.

In order to find out the config for a particular run, you would have to combine information from various places though (see the linked files for examples):

In the relevant git commit, you can find the default hyperparameters of the TED alogrithm, e.g. here.
In the config.yml file inside the experimental folder, you can see the custom set parameters that would overwrite the defaults.
Finally, the train.sh scripts show the command line arguments used for the experiments, pointing to the config file and datasets.

Hope this helps

miguel-kjh · February 26, 2022, 8:56pm

Thank you for the anwser . I finished the TED model in pytorch however I have some problem with the mask of transformer, I see that use in the encoder and by the error calculation but I not understant why. Beside in the paper the features are composte by the intent, the entities, the slots and the previus actions, but in code: Do you use something else?

For those who want to take a look at it or in case someone wants to use it, I leave the github.

Topic		Replies	Views
TED fails - overfitting Rasa Open Source	16	945	February 2, 2021
TED classifier has lower accuracy after migration to Rasa 2.0 Rasa Open Source	16	952	January 22, 2021
Training Keras hyperparameters Rasa Open Source	1	352	July 2, 2020
Passing custom features to TED policy Rasa Open Source	6	484	April 29, 2021
Entity Extraction Failure Feedback on Rasa Open Source entity , ted	0	334	October 7, 2022

Questions about TED model

Related topics