Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stemming "فاطفالهم" #22

Open
LeviticusMB opened this issue Feb 12, 2018 · 3 comments
Open

Stemming "فاطفالهم" #22

LeviticusMB opened this issue Feb 12, 2018 · 3 comments

Comments

@LeviticusMB
Copy link

First of all, I do not speak any Arabic whatsoever, but we're using Snowball in one of our products and we're now adding support for Arabic.

According to the README (and Google Translate seems to agree with this), the stem of "فاطفالهم" should be "اطفال". However, both our code, after linking the current master of this project, and http://arabicstemmer.com/ stems this word as "فاطفال".

Is this a bug or am I doing something wrong?

@assem-ch
Copy link
Owner

Yes, something is wrong, it should be اطفال, we ll investigate it , thanks

@LeviticusMB
Copy link
Author

Any news?

@assem-ch
Copy link
Owner

assem-ch commented Mar 9, 2018

@LeviticusMB Sorry for being late to reply, It is fixed now to give the same as in the example, though فاطفالهم is a wrong writing for فأطفالهم (Alef instead of Alef-Hamza), so I will think if we need to consider common typos in the stemmer or not.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants