ДОСЛІДЖЕННЯ МЕТОДІВ ТА МОДЕЛЕЙ АВТОМАТИЧНОГО РОЗПІЗНАВАННЯУЧАСНИКІВ АУДІО РОЗМОВИ
Abstract
The object of the study is the process of automatic speech recognition (ASR) and one of the branch of it – speaker recognition or diarization (SD).
The purpose of the work is to study the stages of the process of speech and speakers recognition, the analysis of methods and models of machine learning and neural networks, as well as modern frameworks for training speech recognition models and diarization. Based on the received results of the study it would be selected the existing models, systems or products that would play a role of starting point in the creation of a custom ASR and SD models for
enhancing those processes and bringing more values and benefits in different areas of human being.

Радіоелектроніка та молодь у XXI столітті. Т. 6 : Конференція "Інформаційні інтелектуальні системи": матеріали 28-го Міжнар. молодіж. форуму, 16–18 квітня 2024 р.
Downloads
Pages
428-429
Published
December 12, 2024
Copyright (c) 2024 Press of the Kharkiv National University of Radioelectronics
Details about this monograph
ISBN-13 (15)
978-966-659-396-5