HYBRID MULTIMODAL TEXT DIGITIZATION FOR PUBLISHING AND PRINTING

Authors

Kh. Kulchytska
Lviv Polytechnic National University, Ukraine
https://orcid.org/0000-0002-6184-988X

Abstract

This paper investigates AI application in text input for the publishing sector. It establishes a classification system for digitization methods based on text complexity and defines key selection criteria. To improve the processing of complex content, the author proposes a hybrid Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) approach, alongside a specialized multimodal algorithm integrated into publishing workflows.

Author Biography

Kh. Kulchytska, Lviv Polytechnic National University

Associate Professor, Multimedia Technologies Department, Institute of Printing Art and Media Technologies


Поліграфічні, мультимедійні та web-технології у цифровому середовищі. Том 1: колективна монографія

Pages

335–344

Published

June 5, 2026

Details about this monograph

ISBN-13 (15)

978-617-8254-58-2