Hi @koaning, thank you so much for your answer ! Actually I’m not dealing with Arabic letters, but rather Arabic names written in latin letters, for example “Maryam”, “Alaa”, “Samir”, etc. Those names and the french/english names.
My Chatbot is trained in french, I’ve managed to extract first names with a combination of CRFEntityExtractor, Spacy and lookup tables. But for last names, it’s much harder ! Last names can be composed of 1, 2 or more words, they are very different depending on the country and you can’t just enumerate them in a lookup table, that’s where I’m stuck !