Alat Koreksi dan Rekontruksi Tulisan pada Dokumen Lama Bahasa Indonesia Berbasis Mini PC
Main Article Content
Abstract
In the digital era, preserving old documents to prevent damage is a significant challenge. One solution to this problem is to reconstruct damaged or lost documents using image processing and natural language processing technologies. This article discusses the design of a tool for correcting and reconstructing writing in old papers and documents that can be implemented on a mini PC. The tool uses state-of-the-art algorithms such as Convolutional Neural Network (CNN) for character recognition and Optical Character Recognition (OCR), as well as Image Inpainting and Sequence-to-Sequence (Seq2Seq) algorithms for document reconstruction. Test results show that this tool can recognize characters with high accuracy and reconstruct damaged or lost documents effectively.
Downloads
Article Details
Please find the rights and licenses in the Journal of Information Technology and Computer Engineering (JITCE).
1. License
The non-commercial use of the article will be governed by the Creative Commons Attribution license as currently displayed on Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
2. Author(s)’ Warranties
The author(s) warrants that the article is original, written by stated author(s), has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary permissions to quote from other sources have been obtained by the author(s).
3. User Rights
JITCE adopts the spirit of open access and open science, which disseminates articles published as free as possible under the Creative Commons license. JITCE permits users to copy, distribute, display, and perform the work for non-commercial purposes only. Users will also need to attribute authors and JITCE on distributing works in the journal.
4. Rights of Authors
Authors retain the following rights:
- Copyright, and other proprietary rights relating to the article, such as patent rights,
- the right to use the substance of the article in future own works, including lectures and books,
- the right to reproduce the article for own purposes,
- the right to self-archive the article.
- the right to enter into separate, additional contractual arrangements for the non-exclusive distribution of the article's published version (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal (Journal of Information Technology and Computer Engineering).
5. Co-Authorship
If the article was jointly prepared by other authors; upon submitting the article, the author is agreed on this form and warrants that he/she has been authorized by all co-authors on their behalf, and agrees to inform his/her co-authors. JITCE will be freed on any disputes that will occur regarding this issue.
7. Royalties
By submitting the articles, the authors agreed that no fees are payable from JITCE.
8. Miscellaneous
JITCE will publish the article (or have it published) in the journal if the article’s editorial process is successfully completed and JITCE or its sublicensee has become obligated to have the article published. JITCE may adjust the article to a style of punctuation, spelling, capitalization, referencing and usage that it deems appropriate. The author acknowledges that the article may be published so that it will be publicly accessible and such access will be free of charge for the readers.
References
[2] Li, H., Chen, Z., Zhang, Z., & Zhang, H. “Document Image Restoration and Recognition: A Survey”. Journal of Imaging, 6(4), 46. 2020.
[3] Shi, W., Zhang, X., Wang, Y., & Gao, H. “Sequence-to-Sequence Based Document Image Rectification”. IEEE Transactions on Image Processing, 29, 6388-6398. 2020.
[4] Suryani, E., Iqbal, M., & Kurniawan, R. Perancangan Alat Koreksi dan Rekonstruksi Tulisan pada Makalah dan Dokumen Lama Berbasis Mini PC. Jurnal Informatika, 1(1), 1-10. 2022.
[5] Zhang, Y., Yan, H., Liu, Z., & Zhang, W. “An Image Restoration Method Based on Deep Learning and Inpainting for Ancient Document”. IEEE Access, 9, 17061-17071.2021.
[6] Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. “You only look once: Unified, real-time object detection”. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2016. pp. 779-788
[7] Karpathy, A., & Fei-Fei, L. “Deep visual-semantic alignments for generating image descriptions”. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2015. pp. 3128-3137.
[8] Goodfellow, I., Bengio, Y., & Courville, A. Deep learning. MIT press. 2016.
[9] LeCun, Y., Bengio, Y., & Hinton, G. Deep learning. Nature, 521(7553), 436-444. 2015.
[10] He, K., Zhang, X., Ren, S., & Sun, J. “Deep residual learning for image recognition”. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2016. pp. 770-778.
[11] Simonyan, K., & Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556. 2014.
[12] Bahdanau, D., Cho, K., & Bengio, Y. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473. 2014.
[13] Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., ... & Rabinovich, A. (2015). Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2015. pp. 1-9.
[14] Turgut, H. Optical Character Recognition for Historical Documents. Procedia Computer Science, 176, 292-299. 2020.
[15] Hong, S., Baek, J., Kim, J., & Lee, H. “Text Recognition in the Wild: A Survey”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(3), 770-788. 2020.
[16] Sutskever, I., Vinyals, O., & Le, Q. V. Sequence to Sequence Learning with Neural Networks. Advances in Neural Information Processing Systems, 27, 3104-3112. 2014.
[17] Cho, K., Van Merriënboer, B., Gulcehre, C., Bougares, F., Schwenk, H., & Bengio, Y. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. arXiv preprint arXiv:1406.1078. 2014.
[18] Chen, C., Chen, Y., Yang, J., & Lai, M. Design and Implementation of a Portable Device for the Correction and Reconstruction of Historical Documents Based on a Mini PC. Applied Sciences, 9(13), 2767. 2019.
[19] Chu, C. T., Kim, S. K., Lin, Y. A., Yu, Y., Bradski, G. R., & Ng, A. Y. “Video search results using text queries”. Proceedings of the 2007 ACM SIGMOD international conference on Management of data, 923-934. 2007.