Utterance Duplication

Hello,

Is there anyway to check utterance duplication in training set? Especially working on large training sets and similar utterances to each other, it might be an issue.

Thanks

@serdar Hi, do you mean checking for duplicate utterance with different intents in training data? If they are assigned to the same intent, duplicate examples would be filtered out.

Hi @dakshvar22, I mean the utterance might fall under the different intents. Is there anyway to check it? Thanks

No, that wouldn’t be flagged. I think it’s a good idea to include it. Would you be up to open a feature request on our GH repo?

Hi @dakshvar22, I open a feature request on Git as you suggest. Thanks