Pipeline for indonesia language

What is the best pipeline settings for indonesian language?

Hi Diaz,

it’s hard to upfront say “this pipeline always works best in language X” because it mainly depends more on the specific dataset that you have. A Dutch/English/German/Indonesian chatbot for customer service might need a different pipeline if it used for customer service compared to when it is used for HR.

That said, despite not being very familiar with Indonesian, I am trying to make more compatible tools for Rasa to ensure that we also support Non-English languages. Have you seen the rasa nlu examples project? It offers dense embeddings for Indonesian via the BytePair Embeddings and FastText. Typically these embeddings boost entity detection performance.

1 Like