Datasets benchmark for Arabic language

I am looking for datasets for the Arabic language. I have many reasons for that:

  1. Train my chatbot on an existing dataset to check the performance.
  2. I don’t have enough data to train my model at the moment.

Does anyone know how can I get this/these dataset(s)?

@Pain Hi, Well I don’t know the Arabic language, but may be this repo will guide you with the initial project start: https://github.com/RedaElmar/CovidBot-Telegram

1 Like

If interested, my bot contains English, French, Armenian, and Arabic (with Arab and Latin letters): https://github.com/ChrisRahme/fyp-chatbot

1 Like