Dönüstürücü dil modellerine etkili hassas ayar yapmak için veri mühendisligi yöntemleri

Zengin, Muhammed Said

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.11851/10582

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Kutlu, Mücahid	-
dc.contributor.author	Zengin, Muhammed Said	-
dc.date.accessioned	2023-08-20T19:42:27Z	-
dc.date.available	2023-08-20T19:42:27Z	-
dc.date.issued	2022	-
dc.identifier.uri	https://tez.yok.gov.tr/UlusalTezMerkezi/TezGoster?key=RsTBl6RWK25OBMIKtIgYYaEOlQc7_woO7U_oVzBpk90EV8RUvZXg_OuX38f9JpI4	-
dc.identifier.uri	https://hdl.handle.net/20.500.11851/10582	-
dc.description.abstract	Geleneksel yöntemlerle metinden öznitelik çıkarmak ve bir doğal dil işleme görevini yerine getirmek mümkündür, fakat kısıtlı miktardaki etiketli veriyle cümle yapısı ve kelime vektörleri yeterince öğrenilmediği için model performansı kısıtlı kalmaktadır. Bu sebeple son yıllarda araştırmacılar önceden eğitilmiş dil modellerine hassas ayar yapmayı, geleneksel yöntemlere göre daha çok tercih etmektedir. Büyük miktarda veriyle hazırlanan önceden eğitilmiş dönüştürücü dil modelini kullanarak belirli bir görev üzerinde hassas ayar yapmak birçok doğal dil işleme görevinde en yüksek performansı vermektedir. Etiketli veri hazırlamak maliyetli bir işlem olduğu ve etiketli veri sınırlı bir kaynak olduğu için araştırmacılar az veri kullanarak daha iyi sonuç alma yöntemlerini incelemektedir. Bu sebeple veri arttırma yöntemleri olarak aktif öğrenme ve zayıf denetim yolları kullanılmıştır. Aynı zamanda yarı denetimli ve denetimsiz yöntemler de üzerinde çalışılan araştırma konuları olmuştur. Bu tezin kapsamı ise, kısıtlı miktardaki etiketli veri kullanılarak, dönüştürücü dil modellerine en etkili hassas ayar yapma yöntemini araştırmaktır. Etkili hassas ayar yapmak için çapraz dilli eğitim, zayıf denetim, geri çeviri, aşırı örnekleme, aktif öğrenme gibi veri mühendisliği yöntemleri kullanılmıştır. Bu tez kapsamında incelenen konu üç farklı doğal dil işleme görevi üzerinde incelenmiştir. Bu görevler, kontrole değer iddiaların tespiti, taraf tespiti ve konum tespitidir.	en_US
dc.description.abstract	It is possible to extract features from the text and perform a natural language processing task with traditional methods, but the model performance is limited because the sentence structure and word vectors are not learned enough with the limited amount of labeled data. For this reason, in recent years, researchers prefer to fine-tune pre-trained language models more than traditional methods. Fine-tuning a particular task using a pre-trained transformers language model prepared with large amounts of data yields state of the art results in many natural language processing tasks. Because preparing labeled data is a costly process and labeled data is a limited resource, researchers are examining ways to get better results using less data. For this reason, active learning and weak supervision methods were used as data augmentation methods. At the same time, semi-supervised and unsupervised methods have also been studied research topics. The scope of this thesis is to investigate the most effective fine-tuning method for transformers language models using a limited amount of labeled data. Data engineering methods such as cross-language training, weak supervision, back translation, oversampling, active learning have been used for effective fine-tuning. The subject examined in this thesis is detailed on three different natural language processing tasks. These tasks are detecting check-worthy claims, stance detection, and geolocation detection.	en_US
dc.language.iso	tr	en_US
dc.publisher	TOBB ETÜ	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	en_US
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.subject	Doğal dil işleme	en_US
dc.subject	Natural language processing	en_US
dc.title	Dönüstürücü dil modellerine etkili hassas ayar yapmak için veri mühendisligi yöntemleri	en_US
dc.title.alternative	Data engineering methods for effective fine tuning transformers language models	en_US
dc.type	Master Thesis	en_US
dc.department	Enstitüler, Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Ana Bilim Dalı	en_US
dc.department	Institutes, Graduate School of Engineering and Science, Computer Engineering Graduate Programs	en_US
dc.identifier.startpage	1	en_US
dc.identifier.endpage	64	en_US
dc.institutionauthor	Zengin, Muhammed Said	-
dc.relation.publicationcategory	Tez	en_US
dc.identifier.yoktezid	752523	en_US
item.fulltext	No Fulltext	-
item.grantfulltext	none	-
item.languageiso639-1	tr	-
item.openairecristype	http://purl.org/coar/resource_type/c_18cf	-
item.openairetype	Master Thesis	-
item.cerifentitytype	Publications	-
Appears in Collections:	Bilgisayar Mühendisliği Yüksek Lisans Tezleri / Computer Engineering Master Theses

Show simple item record

CORE Recommender

Page view(s)

28

checked on May 6, 2024

Google Scholar^TM

Check

Page view(s)

Google ScholarTM

Google Scholar^TM