ТЕСТУВАННЯ ЛОГІЧНИХ ЗДІБНОСТЕЙ ВЕЛИКИХ МОВНИХ МОДЕЛЕЙ
Abstract
This work is devoted to investigating the performance of Large Language Models (LLMs) in solving logical problems in Ukrainian, where key words are replaced with nonsensical ones to challenge the models' reliance on prior knowledge. It highlights a study comparing the abilities of four models—ChatGPT 3.5, ChatGPT 4.0, Copilot, and Gemini—across different testing scenarios, including both isolated and contextual problem-solving. The findings reveal that all models significantly outperform random guessing, with ChatGPT 4.0 showing exceptionally high accuracy, suggesting its potential in applications requiring complex logical reasoning.

Радіоелектроніка та молодь у XXI столітті. Т. 6 : Конференція "Інформаційні інтелектуальні системи": матеріали 28-го Міжнар. молодіж. форуму, 16–18 квітня 2024 р.
Downloads
Pages
86-88
Published
December 12, 2024
Copyright (c) 2024 Press of the Kharkiv National University of Radioelectronics
Details about this monograph
ISBN-13 (15)
978-966-659-396-5