Can we control the number of stories generated by checkpoint?

Hi there!

We are currently using a lot of checkpoints in our stories (probably too many). Right now, we have 151 story blocks which leads to 380,000 processed trackers when training which leads to 493,971 training samples for Keras.

I’m not exactly sure how the stories are generated but is there a way to:

  1. Control the total number of stories that would be generated?
  2. Control which checkpoints generate more stories than others?

Let me give an example of what I mean for point number (2). If we have stories like the one below, one of the “target” of the checkpoint might be much more relevant than others as we expect them to happen much more often.

## select
* show_me_items
  - action_display_item_list
> select_item

## select by clicking on button -> most should do that
> select_item
* /select{"item_id": 123456}
> post_selection

## select by natural language -> few would do that
> select_item
* chose_item_by_position
  - action_set_selection
  - utter_ask_confirmation
* affirm
> post_selection

## select by natural language -> few would do that
> select_item
* chose_item_by_position
  - action_set_selection
  - utter_ask_confirmation
* deny
  - utter_sorry
  - utter_select_item_again
> select_item
...

Basically, we would love to see how much control we have with checkpoints so that adding a feature that would point to > select_item for example would not blow up the number of generated stories.

Thanks a lot for the information! Kind regards, Nicolas

@nbeuchat in general we don’t recommend using checkpoints this heavily, as it leads to exactly the kinds of issues you’re describing. Which version of rasa are you using? There’s an --augmentation flag which can reduce the number of stories created in addition to the ones you have created: Command Line Interface