Tashkeela: Novel corpus of Arabic vocalized texts, data for auto-diacritization systems.
Data Brief. 2017 Apr;11:147-151
Authors: Zerrouki T, Balla A
Abstract Arabic diacritics are often missed in Arabic scripts. This feature is a handicap for new learner to read َArabic, text to speech conversion systems, reading and semantic analysis of Arabic texts. The automatic diacritization systems are the best solution to handle this issue.