Rasa dataset versioning in CI/CD

Hello everyone,

I’m setting up a CI/CD pipeline for my Rasa project and I’m looking for advice on managing dataset versioning as part of this process. If anyone has experience with this, I’d love to hear your insights!

Specifically, how do you handle versioning for NLU data? I’m considering storing different dataset versions in AWS S3, but I’m curious if there are better options out there.

Any tips, examples, or best practices you can share would be really appreciated!

Thanks in advance!