Algorithms for contactless scanning of book monuments
https://doi.org/10.20913/2618-7575-2021-3-9-15
Abstract
The article is devoted to the questions of cultural heritage preservation by creating the digital collection of book monuments. The original documents are monuments of book culture and their dilapidated state requires careful handling, splitting of documents for scanning is extremely undesirable. The market does not present the equipment for contactless scanning of books without embroidering, therefore an algorithm that allows digitalizing book monuments in a contactless way has been developed. The technique has been constructed using an algorithm based on the projection of the light grid on the object scanned. The authors propose a sequence of actions consisting of image processing and comparing the results between two images. The first snapshot determines the initial parameters of the grid; the second snapshot determines the actual distortion of the test snapshot. Subsequent mathematical processing allows getting scanned images without absence of geometric distortions of the scanned page due to the system of using the two-dimensional array of corrections. The application of the system has been modeled on the example of «The legend of the destruction of Siberian cities of Tara and Tyumen by the lesser Tatars / / Collection of moral stories, words, lives and other articles [hand.]». The evaluation parameters of the simulation result have been the following: text distinctness, absence of geometric distortions, color quality, uniformity of document scanning quality within a single book, etc., as checked and recognized as high by the experts.
The experience described opens possibilities of book monuments digitization using the new algorithm. The development of the system is aimed at expanding the database of objects of material culture to be digitized, perfecting the software, improving the quality of digital images, as well as the capabilities of image recognition and search for the document itself and information it contains.
About the Authors
S. I. SivkovRussian Federation
Sivkov Stepan Igorevich, Head of the Department of Technical Control and Management Systems
Lesnoy, Sverdlovsk Region
S. P. Simakov
Russian Federation
Simakov Sergey Pavlovich, Director
Tyumen
A. I. Vinokur
Russian Federation
Vinokur Aleksey Iosifovich, Professor of the Department of Informatics and Information Technologies
Moscow
References
1. UNESCO Charter on the Preservation of the Digital Heritage. United Nations. URL: http://www.un.org/russian/documen/declarat/digital_charter.pdf (accessed 15.03.2021).
2. Meeting of the Presidium of the Presidential Council for Economic Modernization and Innovative Development of Russia: transcript. Pravitel’stvo Rossii. URL: http://government.ru/news/4008/ (accessed 15.03.2021). (In Russ.).
3. Federal Law no. 463-FZ of December 22, 2020 “On amendments to the Federal Law “On librarianship” in terms of improving the procedure for state registration of book monuments”. Ofitsial’nyi internet-portal pravovoi informatsii. URL: http://publication.pravo.gov.ru/Document/View/0001202012220093 (accessed 15.03.2021). (In Russ.).
4. Decree of the President of the Russian Federation no. 204 o 07.05.2018 “On the national goals and strategic objectives of the Russian Federation development for the period upto 2024”. Ofitsial’nyi internet-portal pravovoi informatsii. URL: http://publication.pravo.gov.ru/Document/View/0001201805070038?index=0&rangeSize=1 (accessed 15.03.2021). (In Russ.).
5. Vinokur A. I., Artyushina I. L. Information systems: problems of image registration and reproduction. Izvestiya vuzov. Problemy poligrafii i izdatel’skogo dela, 2011, 4: 75–82. (In Russ.).
6. Shabanov A. V. Comparison of installations to digitize Russian old-printed and handwritten books and methods of image processing. Bibliosphere, 2010, 2: 30–32. (In Russ.).
7. Yezhova N. M. On the issue of search engine possibilities in electronic libraries of digital educational resources. Sovremennye informatsionnye tekhnologii i pis’mennoe nasledie: ot drevnikh tekstov k elektronnym bibliotekam: materialy Mezhdunar. nauch. konf. Kazan, 2008: 99-104. (In Russ.).
8. Zholobov O. F. Handwritten heritage and electronic and computer technologies. Sovremennye informatsionnye tekhnologii i pis’mennoe nasledie: ot drevnikh tekstov k elektronnym bibliotekam: materialy Mezhdunar. nauch. konf. Kazan, 2008: 111–113. (In Russ.).
9. Kornienko S. I., Cherepanov F. M., Yasnitsky L. N. Text recognition of handwritten and old printed books based on neural network technologies. Sovremennye informatsionnye tekhnologii i pis’mennoe nasledie: ot drevnikh tekstov k elektronnym bibliotekam: materialy Mezhdunar. nauch. konf. Kazan, 2008: 155–156. (In Russ.).
10. Emanov A. G. (ed.) Povest’ o gorodakh Tare i Tyumeni [The story of Tara and Tyumen cities]. Tyumen, Tyumen State Univ., 2021. 296 p. (In Russ.).
11. A XVII century manuscript on the nomadic raid on Tara and Tyumen was first published at the Urals. TASS. URL: https://tass.ru/ural-news/11185017 (accessed 15.03.2021). (In Russ.).
Review
For citations:
Sivkov S.I., Simakov S.P., Vinokur A.I. Algorithms for contactless scanning of book monuments. Proceedings of SPSTL SB RAS. 2021;(3):9-15. (In Russ.) https://doi.org/10.20913/2618-7575-2021-3-9-15