Important New Developments in Arabographic Optical Character Recognition

Abstract

The Open Islamicate Texts Initiative (OpenITI) team1 —building on the foundational opensource OCR work of the Leipzig University (LU) Alexander von Humboldt Chair for Digital Humanities—has achieved Optical Character Recognition (OCR) accuracy rates for printed classical Arabic-script texts in the high nineties. These numbers are based on our tests of seven different Arabic-script texts of varying quality and typefaces, totaling over 7,000 lines

https://doi.org/10.7916/alusur.v25i1.6996
PDF
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Copyright (c) 2017 Matthew Thomas Miller