Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.11851/12717
Title: Büyük Dil Modeli Tabanli Veri Artirimi ile Türkçe Varlık İsmi Çıkarımı
Other Titles: Large Language Model Based Data Augmentation for Turkish Named Entity Recognition
Authors: Deniz, Oguz
Unsoy, N. Ceyda
Eravci, Bahaeddin
Keywords: Data Augmentation
Large Language Models (LLM)
Named Entity Recognition (NER)
Natural Language Processing
Binary Alloys
Large Datasets
Natural Language Processing Systems
Data Augmentation
Language Model
Language Processing
Large Language Model
Model-Based OPC
Named Entity Recognition
Natural Language Processing
Natural Languages
Turkishs
Cost Effectiveness
Publisher: Institute of Electrical and Electronics Engineers Inc.
Abstract: Named Entity Recognition (NER) plays a fundamental role in identifying and classifying named entities within texts. However, in resource-scarce languages and applications - particularly in Turkish - the lack of annotated data leads to a decline in model performance. In this study, synthetic examples were generated using Large Language Models (LLMs) to augment the existing primary dataset, with the aim of enhancing the k-shot learning performance of NER models. Experimental results demonstrate that models trained on the augmented dataset achieve performance improvements by a factor of 40 to 60 compared to those trained on the original dataset, indicating that the proposed method offers a cost-effective and viable alternative for resource-scarce applications. © 2025 Elsevier B.V., All rights reserved.
Description: Isik University
URI: https://doi.org/10.1109/SIU66497.2025.11112036
https://hdl.handle.net/20.500.11851/12717
ISBN: 9798331566555
Appears in Collections:Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

Show full item record



CORE Recommender

Page view(s)

8
checked on Oct 20, 2025

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.