Important New Developments in Arabographic Optical Character Recognition

Matthew Thomas Miller

doi:10.7916/alusur.v25i1.6996

Vol. 25 No. 1 (2017), Research Articles

Vol. 25 No. 1 (2017)

Important New Developments in Arabographic Optical Character Recognition

Research Articles

https://doi.org/10.7916/alusur.v25i1.6996

Published 2017-11-15

Matthew Thomas Miller

Matthew Thomas Miller

PDF

Abstract

The Open Islamicate Texts Initiative (OpenITI) team1 —building on the foundational opensource OCR work of the Leipzig University (LU) Alexander von Humboldt Chair for Digital Humanities—has achieved Optical Character Recognition (OCR) accuracy rates for printed classical Arabic-script texts in the high nineties. These numbers are based on our tests of seven different Arabic-script texts of varying quality and typefaces, totaling over 7,000 lines

https://doi.org/10.7916/alusur.v25i1.6996

PDF

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Most read articles by the same author(s)

Matthew Thomas Miller, The Poetics of the Sufi Carnival , Al-ʿUsur al-Wusta: Vol. 30 (2022)