International Journal of Computer Theory and Engineering

Editor-In-Chief: Prof. Mehmet Sahinoglu
Frequency: Quarterly
ISSN: 1793-8201 (Print), 2972-4511 (Online)
Publisher:IACSIT Press

OPEN ACCESS
4.1
CiteScore

⚠️ Important Security Notice: Beware of Fraudulent Emails Impersonating IJCTE Officials
IJCTE 2025 Vol.17(4): 189-201
DOI: 10.7763/IJCTE.2025.V17.1381

Beyond OCR: GAN-Driven Restoration of Severely Degrading Document

Yosua Kristianto* and Benfano Soewito
Computer Science Department, BINUS Graduate Program—Master of Computer Science, Bina Nusantara University, Jakarta 11480, Indonesia
Email: yosua.kristianto@binus.ac.id (Y.K.); bsoewito@binus.edu (B.S.)
*Corresponding author

Manuscript received November 20, 2024; revised January 31, 2025; accepted June 23, 2025; published November 11, 2025

Abstract—The recent advancements in deep learning have opened avenues for substantial improvements in text extracting accuracy. These technological innovations have significantly enhanced the capabilities of existing tools, particularly in the realm of Optical Character Recognition (OCR). The OCR plays a pivotal role in digitizing image-based texts. To enhance its efficacy, and advanced preprocessing model aimed at improving text clarity and readability should be implemented prior to the text extraction process. Our works explore the application of Generative Adversarial Networks (GANs) for super resolution tasks, aiming to enhance image quality by increasing resolution. Various GAN architectures and training techniques are being experimented including the usage of state-of-the-art model, Super Resolution Generative Adversarial Network (SRGAN). While the results showed some improvements quantitatively, it also highlighted areas for further optimizations. The findings suggest that GAN-based approach holds promise for super resolution tasks in enhancing text extraction result for document image. Significantly, the experiment showed that OCR successfully improved even when dealing with photos that had experienced 75% damage, which in this stage, an image had experienced substantial information loss. Future work will focus on addressing the identified challenges and enhancing model performance.

Keywords—image enhancement, super resolution, generative adversarial network

[PDF]

Cite: Yosua Kristianto and Benfano Soewito, "Beyond OCR: GAN-Driven Restoration of Severely Degrading Document," International Journal of Computer Theory and Engineering, vol. 17, no. 4, pp. 189-201, 2025.

Copyright © 2025 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Article Metrics in Dimensions

Menu