Hi!
It would be nice to have an optional top3 (or even better topn) accuracy metric in the rasa_nlu.evaluate.py script. There are multiple use cases for this, like testing if your model is good in suggesting the top3 actions based on intents.
Thanks, L-P