NAME Lingua::AR::Tashkeel - Subroutines for handling Arabic Vowels and Vowel marks SYNOPSIS use Lingua::AR::Tashkeel qw(strip prune fix); # Strip all short vowels strip('مَكَرÙونَة'); ‎# => مكرونة # Heuristic for removing short vowels without causing ambiguity prune('Ùَتَّة'); ‎# => Ùتّة # Heuristic for fixing mixed up short and long vowels fix('ماØشي'); ‎# => Ù…ÙŽØشي DESCRIPTION Subroutines for working with Arabic long (Øرو٠علة) and short vowels (Øركات تشكيل) METHODS AND ARGUMENTS strip($string) Strips away all Arabic short vowels (Tashkeel). prune($string) Heuristic for pruning the short vowels that a native speaker wouldn't write, as leaving them out wouldn't introduce ambiguity. This is often preferable to strip, as Shaddas, or Dammas that indicate a passive verb are useful clues that one might want to keep. fix($string) Transliterating from a romanized representation of Arabic to actual Arabic script often gives incorrect results regarding short/long vowels. This subroutine implements a heuristic for fixing such mix ups. GIT REPOSITORY http://github.com/athreef/Lingua-AR-TASHKEEL SEE ALSO AUTHOR Ahmad Fatoum <athreef@cpan.org>, http://a3f.at COPYRIGHT AND LICENSE Copyright (C) 2016 Ahmad Fatoum This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.