Evaluate a model with test set

How does a script to evaluate a model using a test set work? Thanks.

please check the code here: rasa_core/evaluate.py at 3bc99cfd93a1572ab6b0a9571985a83cc932f05e · RasaHQ/rasa_core · GitHub