Understanding Text Readability Measures in Sesotho
Exploring the application of classical readability formulas to measure text readability in Sesotho, along with considerations for automation and translation to English and Dutch. Various classic readability metrics such as Kincaid, Flesch, SMOG, Coleman-Liau, ARI, LIX, Dale-Chall are discussed in relation to Sesotho language text readability evaluation.
Download Presentation
Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
E N D
Presentation Transcript
Using classical readability formulas to measure text readability in Sesotho Sibeko Johannes Sibeko Johannes
Recognized language Official language Where is Sesotho spoken
QUICK BACKGROUND How could text readability be measured in Sesotho? How could this be automated? Translated texts English as helper language? Dutch as helper language? Classical readability metrics
CLASSIC READABILITY FORMULAS #?????? #?????????+ 11.8 #????????? #?????? 15.59 Kincaid = 0.39 #?????? #?????????+ 84.6 #????????? #?????? Flesch = 206.835 1.015 Syllable-based 30 SMOG #????????? = 3.1291 + 1.043 #???????????? ????? ? #?????? #?????????+ 100 #??????? ????? #????? ] Fog = 0.4[
CLASSIC READABILITY FORMULAS #??????? #??????? 0.296 #????????? #??????? 15.8 Coleman-Liau = 0.0588 #??????? #????? #????? #????????? 21.43 ARI = 4.7 + 0.5 Word-length based #???? ????? #????? #????? #????????? LIX = ? 100 + =#???? ????? #????????? RIX
CLASSIC READABILITY THEORIES Frequency-list based #????? #?????????+ 11.8 #????????? ????? #????? ? 0.1579 + 3.6365 Dale-Chall = 0.0496
CONNECTION TO CLARIN Syllabification system Syllabification corpus Machine Translation corpus Exam text domain corpus End goal: Text readability system