How to configure new nlg server endpint

I have created rasa chatbot but it is template based. So I fine tuned gpt2 model on my dataset. and it is perfectly generating the response. But I don’t know how to use this new generation model to generate response . Somewhere i read that i need to change the NLG server end point. but I don’t know how to do this. Please help.