Here is my config, i just upgraded to 3.x with JiebaTokenizer and also with bert, it actually has errors, any thought?
version: "3.0"
language: zh
pipeline:
- name: JiebaTokenizer
dictionary_path: ./data/jieba_dict/
- name: LanguageModelFeaturizer
# Name of the language model to use
model_name: "bert"
# Pre-Trained weights to be loaded
model_weights: "bert-base-chinese"
- name: CountVectorsFeaturizer
- name: CountVectorsFeaturizer
analyzer: char_wb
min_ngram: 1
max_ngram: 4
- name: DIETClassifier
epochs: 300
constrain_similarities: true
entity_recognition: false
evaluate_on_number_of_examples: 6000
evaluate_every_number_of_epochs: 5
tensorboard_log_directory: "./tensorboard"
tensorboard_log_level: "epoch"
ranking_length: 5
number_of_negative_examples: 20
policies:
- name: MemoizationPolicy
- name: TEDPolicy
max_history: 5
epochs: 100
- name: RulePolicy
Here are the log outputs
2022-02-03 00:34:07 INFO transformers.modeling_tf_utils - loading weights file https://cdn.huggingface.co/bert-base-chinese-tf_model.h5 from cache at /root/.cache/torch/transformers/86a460b592673bcac3fe5d858ecf519e4890b4f6eddd1a46a077bd672dee6fe5.e6b974f59b54219496a89fd32be7afb020374df0976a796e5ccd3a1733d31537.h5
2022-02-03 00:34:12 INFO transformers.modeling_tf_utils - Layers from pretrained model not used in TFBertModel: ['nsp___cls', 'mlm___cls']
2022-02-03 00:36:21 INFO rasa.engine.training.hooks - Restored component 'CountVectorsFeaturizer' from cache.
2022-02-03 00:38:11 INFO rasa.engine.training.hooks - Restored component 'CountVectorsFeaturizer' from cache.
2022-02-03 00:40:17 INFO rasa.engine.training.hooks - Starting to train component 'DIETClassifier'.
Epochs: 0% 0/300 [00:00<?, ?it/s]Traceback (most recent call last):
File "/usr/local/lib/python3.7/dist-packages/rasa/engine/graph.py", line 458, in __call__
output = self._fn(self._component, **run_kwargs)
File "/usr/local/lib/python3.7/dist-packages/rasa/nlu/classifiers/diet_classifier.py", line 919, in train
shuffle=False, # we use custom shuffle inside data generator
File "/usr/local/lib/python3.7/dist-packages/rasa/utils/tensorflow/temp_keras_modules.py", line 181, in fit
tmp_logs = train_function(iterator)
File "/usr/local/lib/python3.7/dist-packages/tensorflow/python/eager/def_function.py", line 885, in __call__
result = self._call(*args, **kwds)
File "/usr/local/lib/python3.7/dist-packages/tensorflow/python/eager/def_function.py", line 917, in _call
return self._stateless_fn(*args, **kwds) # pylint: disable=not-callable
File "/usr/local/lib/python3.7/dist-packages/tensorflow/python/eager/function.py", line 3040, in __call__
filtered_flat_args, captured_inputs=graph_function.captured_inputs) # pylint: disable=protected-access
File "/usr/local/lib/python3.7/dist-packages/tensorflow/python/eager/function.py", line 1964, in _call_flat
ctx, args, cancellation_manager=cancellation_manager))
File "/usr/local/lib/python3.7/dist-packages/tensorflow/python/eager/function.py", line 596, in call
ctx=ctx)
File "/usr/local/lib/python3.7/dist-packages/tensorflow/python/eager/execute.py", line 60, in quick_execute
inputs, attrs, num_outputs)
tensorflow.python.framework.errors_impl.InvalidArgumentError: ConcatOp : Dimensions of inputs should match: shape[0] = [64,33,128] vs. shape[1] = [64,32,768]
[[node rasa_sequence_layer_text/rasa_feature_combining_layer_text/concatenate_sparse_dense_features_text_sequence/concat (defined at /lib/python3.7/dist-packages/rasa/utils/tensorflow/rasa_layers.py:339) ]] [Op:__inference_train_function_719741]
Errors may have originated from an input operation.
Input Source operations connected to node rasa_sequence_layer_text/rasa_feature_combining_layer_text/concatenate_sparse_dense_features_text_sequence/concat:
rasa_sequence_layer_text/rasa_feature_combining_layer_text/concatenate_sparse_dense_features_text_sequence/dropout/dropout/Mul_1 (defined at /lib/python3.7/dist-packages/rasa/utils/tensorflow/rasa_layers.py:309)
IteratorGetNext (defined at /lib/python3.7/dist-packages/rasa/utils/tensorflow/temp_keras_modules.py:181)
Function call stack:
train_function
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/bin/rasa", line 8, in <module>
sys.exit(main())
File "/usr/local/lib/python3.7/dist-packages/rasa/__main__.py", line 121, in main
cmdline_arguments.func(cmdline_arguments)
File "/usr/local/lib/python3.7/dist-packages/rasa/cli/train.py", line 59, in <lambda>
train_parser.set_defaults(func=lambda args: run_training(args, can_exit=True))
File "/usr/local/lib/python3.7/dist-packages/rasa/cli/train.py", line 103, in run_training
finetuning_epoch_fraction=args.epoch_fraction,
File "/usr/local/lib/python3.7/dist-packages/rasa/api.py", line 117, in train
finetuning_epoch_fraction=finetuning_epoch_fraction,
File "/usr/local/lib/python3.7/dist-packages/rasa/model_training.py", line 171, in train
**(nlu_additional_arguments or {}),
File "/usr/local/lib/python3.7/dist-packages/rasa/model_training.py", line 232, in _train_graph
is_finetuning=is_finetuning,
File "/usr/local/lib/python3.7/dist-packages/rasa/engine/training/graph_trainer.py", line 105, in train
graph_runner.run(inputs={PLACEHOLDER_IMPORTER: importer})
File "/usr/local/lib/python3.7/dist-packages/rasa/engine/runner/dask.py", line 101, in run
dask_result = dask.get(run_graph, run_targets)
File "/usr/local/lib/python3.7/dist-packages/dask/local.py", line 558, in get_sync
**kwargs,
File "/usr/local/lib/python3.7/dist-packages/dask/local.py", line 496, in get_async
for key, res_info, failed in queue_get(queue).result():
File "/usr/lib/python3.7/concurrent/futures/_base.py", line 428, in result
return self.__get_result()
File "/usr/lib/python3.7/concurrent/futures/_base.py", line 384, in __get_result
raise self._exception
File "/usr/local/lib/python3.7/dist-packages/dask/local.py", line 538, in submit
fut.set_result(fn(*args, **kwargs))
File "/usr/local/lib/python3.7/dist-packages/dask/local.py", line 234, in batch_execute_tasks
return [execute_task(*a) for a in it]
File "/usr/local/lib/python3.7/dist-packages/dask/local.py", line 234, in <listcomp>
return [execute_task(*a) for a in it]
File "/usr/local/lib/python3.7/dist-packages/dask/local.py", line 225, in execute_task
result = pack_exception(e, dumps)
File "/usr/local/lib/python3.7/dist-packages/dask/local.py", line 220, in execute_task
result = _execute_task(task, data)
File "/usr/local/lib/python3.7/dist-packages/dask/core.py", line 119, in _execute_task
return func(*(_execute_task(a, cache) for a in args))
File "/usr/local/lib/python3.7/dist-packages/rasa/engine/graph.py", line 467, in __call__
) from e
rasa.engine.exceptions.GraphComponentException: Error running graph component for node train_DIETClassifier4.