Digital Philology and Manuscript Sustainability: A Semantic Annotation Model for Classical Arabic Texts
Downloads
Background. The sustainability of classical Arabic manuscripts is often confined to digitization efforts that focus solely on image preservation and limited text markup. These approaches do not fully address the semantic richness and epistemological structure inherent in Islamic intellectual heritage.
Purpose. This study aims to develop a semantic annotation model tailored for classical Arabic texts to support digital philology and enrich manuscript sustainability through machine-readable and concept-linked interpretations.
Method. Using a developmental qualitative research design, three classical manuscripts were annotated semantically using a custom-built model based on RDF and Islamic ontology. The model was evaluated by domain experts in philology and computational linguistics, focusing on four criteria: semantic accuracy, contextual relevance, interoperability, and usability.
Results. The model achieved annotation accuracy above 91% across all manuscripts. Experts rated semantic precision (4.7/5) and contextual relevance (4.6/5) as its strongest aspects. The system successfully mapped technical terms and logical concepts within classical texts and linked them across manuscripts. A case study demonstrated the model’s effectiveness in identifying relationships between epistemological terms and enabling thematic exploration.
Conclusion. This semantic annotation model advances digital philology by enabling structured, concept-based analysis of classical Arabic texts. It bridges computational methods with Islamic textual traditions and opens new pathways for collaborative, sustainable, and meaningful engagement with manuscript heritage.
ABBOU, T. (2025). A DESCRIPTIVE STUDY OF THE PLATFORM OF ALGERIAN MANUSCRIPTS. Asian &African Studies (13351257), Query date: 2025-07-21 13:50:09. https://www.sav.sk/journals/uploads/05301250aas2025-1_05_Abbou_kor3.pdf
Al-Homed, L., Jambi, K., & Al-Barhamtoshy, H. (2023). A deep learning approach for Arabic manuscripts classification. Sensors, Query date: 2025-07-21 13:50:09. https://www.mdpi.com/1424-8220/23/19/8133
Aliyev, C. (2024). An Overview of Al-Mutanabbi’s Diwan Manuscripts: Catalog Information and Global Library Holdings. Global Spectrum of Research and Humanities, Query date: 2025-07-21 13:50:09. https://gsrh.net/index.php/home/article/view/35
Bensattalah, A., Chalal, R., & Nader, F. (2023). An Adaptive Semantic Annotation Tool for Teachers Based on Context-Aware and Internet of Things. International Journal of Software …, Query date: 2025-07-21 13:50:09. https://doi.org/10.1142/S0218194023500341
Bos, J. (2022). A semantically annotated corpus of tombstone inscriptions. International Journal of Digital Humanities, Query date: 2025-07-21 13:50:09. https://doi.org/10.1007/s42803-021-00039-y
Bouziane, A., Bouchiha, D., & ... (2020). Annotating Arabic texts with linked data. 2020 4th International …, Query date: 2025-07-21 13:50:09. https://ieeexplore.ieee.org/abstract/document/9416543/
Crane, G., Tauber, J., Babeu, A., Cerrato, L., & ... (2024). The Sixth Generation of the Perseus Digital Library and a Workflow for Open Philology—DRAFT. arXiv preprint arXiv …, Query date: 2025-07-21 13:50:09. https://arxiv.org/abs/2411.10604
Elayeb, B. (2019). Arabic word sense disambiguation: A review. Artificial Intelligence Review, Query date: 2025-07-21 13:50:09. https://doi.org/10.1007/S10462-018-9622-6
Farina, A., Marongiu, P., Bru, M., & ... (2025). When data meets the past: Data collection, sharing, and reuse in ancient world studies. Open Information …, Query date: 2025-07-21 13:50:09. https://doi.org/10.1515/opis-2025-0014
Fedeli, A. (2020). The Qur’?nic Text from Manuscript to Digital Form: Metalinguistic Markup of Scribes and Editors. Judaism, Christianity, and Islam–Tension …, Query date: 2025-07-21 13:50:09. https://library.oapen.org/bitstream/handle/20.500.12657/41217/9783110634440.pdf?sequence=1&isAll#page=232
Goto, K. (2024). 11. Philology and Grammar. Journal for the Study of the Old Testament, Query date: 2025-07-21 13:50:09. https://doi.org/10.1177/03090892241240277
Gruendler, B., Ginkel, J., Redwan, R., & ... (2020). An interim report on the editorial and analytical work of the AnonymClassic project. medieval …, Query date: 2025-07-21 13:50:09. https://refubium.fu-berlin.de/handle/fub188/33267
Hassan, R. (2025). AI Analysis of Islamic History Sources: Challenges and Opportunities. Computing, Internet of Things and Data Analytics …, Query date: 2025-07-21 13:50:09. https://books.google.com/books?hl=en&lr=&id=bRlxEQAAQBAJ&oi=fnd&pg=PA287&dq=semantic+annotation+classical+arabic+manuscripts+digital+philology&ots=4FeUJ0QtVj&sig=TM00YS0TOQyN7uy6C9m83_SX3aU
Jreis-Navarro, L. (t.t.). The last pre-modern Arab splendor over the Euro-African Strait: Renaissance" gap" and digital transition. academia.edu, Query date: 2025-07-21 13:50:09. https://www.academia.edu/download/84060471/PremodernArabSplendor_LailaJreis_AcademiaEdu.pdf
Keersmaekers, A. (2020). A computational approach to the Greek papyri: Developing a corpus to study variation and change in the post-classical Greek complementation system. Query date: 2025-07-21 13:50:09. https://lirias.kuleuven.be/retrieve/590983
Khan, A., Chiarcos, C., Declerck, T., Gifu, D., & ... (2022). When linguistics meets web technologies. Recent advances in modelling linguistic linked data. Semantic …, Query date: 2025-07-21 13:50:09. https://doi.org/10.3233/SW-222859
Kiraz, G. (2023). Towards a Syriac Semantic Web from the Perspective of 2020. The Third Lung: New Trajectories in Syriac Studies …, Query date: 2025-07-21 13:50:09. https://brill.com/downloadpdf/display/title/64228.pdf#page=290
Liao, X., & Zhao, Z. (2019). Unsupervised approaches for textual semantic annotation, a survey. ACM Computing Surveys (CSUR), Query date: 2025-07-21 13:50:09. https://doi.org/10.1145/3324473
Lima, B., Omar, N., Avansi, I., & Castro, L. de. (2025). Artificial Intelligence Applied to the Analysis of Biblical Scriptures: A Systematic Review. Analytics, Query date: 2025-07-21 13:50:09. https://www.mdpi.com/2813-2203/4/2/13
Madi, B., Atamni, N., Tsitrinovich, V., & ... (2024). Automated Dating of Medieval Manuscripts with a New Dataset. … on Document Analysis …, Query date: 2025-07-21 13:50:09. https://doi.org/10.1007/978-3-031-70642-4_8
McGillivray, B., Kondakova, D., Burman, A., & ... (2022). A new corpus annotation framework for Latin diachronic lexical semantics. … of Latin Linguistics, Query date: 2025-07-21 13:50:09. https://doi.org/10.1515/joll-2022-2007
Muhammed, M., Azab, S., Ali, N., & ... (2024). Arabic Ontology for Hadith texts-A survey. The Egyptian Journal of …, Query date: 2025-07-21 13:50:09. https://journals.ekb.eg/article_352397.html
Osman, A. (2020). A methodological approach to utilize Egyptian colloquial Arabic as a source for ancient Egyptian linguistic analysis. search.proquest.com. https://search.proquest.com/openview/be7074e73a4ad6ff705c747cc8b6e28e/1?pq-origsite=gscholar&cbl=2026366&diss=y
Salah, R., Mukred, M., Zakaria, L. binti, & ... (2024). A Machine Learning Approach for Named Entity Recognition in Classical Arabic Natural Language Processing. KSII Transactions on …, Query date: 2025-07-21 13:50:09. https://koreascience.kr/article/JAKO202433743226435.page
Sawalha, M. (2019). The Design and the Construction of the Traditional Arabic Lexicons Corpus (The TAL-Corpus). Modern Applied Science, Query date: 2025-07-21 13:50:09. https://www.researchgate.net/profile/Majdi-Sawalha/publication/330252539_The_Design_and_the_Construction_of_the_Traditional_Arabic_Lexicons_Corpus_The_TAL-Corpus/links/5c47782092851c22a3896daa/The-Design-and-the-Construction-of-the-Traditional-Arabic-Lexicons-Corpus-The-TAL-Corpus.pdf
Taye, M., Abulail, R., & Al-Oudat, M. (2023). An ontology learning framework for unstructured arabic text. 2023 7th International …, Query date: 2025-07-21 13:50:09. https://ieeexplore.ieee.org/abstract/document/10391548/
Zouaoui, S., & Rezeg, K. (2021). A novel quranic search engine using an ontology-based semantic indexing. Arabian Journal for Science and Engineering, Query date: 2025-07-21 13:50:09. https://doi.org/10.1007/s13369-020-05082-5
Copyright (c) 2025 Surip Stanislaus, Mona Abdallah

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.