DIET Architecture - Individual Token Pathway

rsklearner · November 5, 2021, 6:41pm

Hi,

I was reading about DIET Architecture and I got below doubt. The DIET Architecture has 2 individual token pathway

Pre-trained Embedding
Sparse Features + FFNN

But I don’t see any hyperparameters in DIETClassifier in the pipeline [Related to these 2]. Please help me to understand if these are controlled by the Pipeline components that come before DIETClassifier?

stephens · November 7, 2021, 2:56am

You can see the DIET hyperparameters here

rsklearner · November 7, 2021, 10:27am

@stephens Hi, Thank you for the reply! I checked the parameters in Rasa Docs as well. But still I couldn’t get below point.

Please help.

ChrisRahme · November 8, 2021, 6:19am

All these on the left are INSIDE the DIET Classifier.

The LanguageModelFeaturizer and CountVectorsFeaturizer do not appear on that diagram.

An alternative link to the one proposed by Greg is this one. There you can find DIET’s components/hyperparameters. Here’s an example usage:

- name: DIETClassifier
  epochs: 141
  model_confidence: linear_norm
  loss_type: cross_entropy
  constrain_similarities: true
  number_of_transformer_layers: 2
  number_of_attention_heads: 4
  batch_size:
  - 64
  - 128
  evaluate_on_number_of_examples: 200
  evaluate_every_number_of_epochs: 5
  regularization_constant: 0.002
  random_seed: 1
  tensorboard_log_directory: ./.tensorboard/DIET
  tensorboard_log_level: epoch
  checkpoint_model: True

rsklearner · November 14, 2021, 4:30am

@ChrisRahme Sorry for late reply. I still couldn’t understand this. The DIET Architecture has 2 token pathways…1) Sparse Feature 2) Pretrained Embedding …As per the youtube learning series. We can decide if we need to include both or one…Even in pretrained we can decide which language model to use. But I do not see these as hyperparameters in DIET Classifier …So I thought maybe the previous pipeline decide

Could you please let me know how to control or change the language model in token pathway for DIET?

Topic		Replies	Views
Clarification regarding NLU Pipeline and DIETClassifier Rasa Open Source	4	1576	March 4, 2021
Dietclassifier hyperparameter tuning Rasa Open Source	1	1370	July 24, 2020
Hyperparameters for Pipeline Components Rasa Open Source	2	381	October 2, 2021
DIETClassifier with sparse input features only Rasa Open Source	9	2555	January 19, 2021
DIETClassifier: Where do pretrained embeddings come from? Rasa Open Source	2	1265	July 28, 2020

DIET Architecture - Individual Token Pathway

Related topics