Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.11851/6906
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Erdoğdu, Utku | - |
dc.contributor.author | Tan, Mehmet | - |
dc.contributor.author | Alhajj, Reda | - |
dc.contributor.author | Polat, Faruk | - |
dc.contributor.author | Rokne, Jon | - |
dc.contributor.author | Demetrick, Douglas | - |
dc.date.accessioned | 2021-09-11T15:44:12Z | - |
dc.date.available | 2021-09-11T15:44:12Z | - |
dc.date.issued | 2013 | en_US |
dc.identifier.issn | 1748-5673 | - |
dc.identifier.issn | 1748-5681 | - |
dc.identifier.uri | https://doi.org/10.1504/IJDMB.2013.056090 | - |
dc.identifier.uri | https://hdl.handle.net/20.500.11851/6906 | - |
dc.description.abstract | The availability of enough samples for effective analysis and knowledge discovery has been a challenge in the research community, especially in the area of gene expression data analysis. Thus, the approaches being developed for data analysis have mostly suffered from the lack of enough data to train and test the constructed models. We argue that the process of sample generation could be successfully automated by employing some sophisticated machine learning techniques. An automated sample generation framework could successfully complement the actual sample generation from real cases. This argument is validated in this paper by describing a framework that integrates multiple models (perspectives) for sample generation. We illustrate its applicability for producing new gene expression data samples, a highly demanding area that has not received attention. The three perspectives employed in the process are based on models that are not closely related. The independence eliminates the bias of having the produced approach covering only certain characteristics of the domain and leading to samples skewed towards one direction. The first model is based on the Probabilistic Boolean Network (PBN) representation of the gene regulatory network underlying the given gene expression data. The second model integrates Hierarchical Markov Model (HIMM) and the third model employs a genetic algorithm in the process. Each model learns as much as possible characteristics of the domain being analysed and tries to incorporate the learned characteristics in generating new samples. In other words, the models base their analysis on domain knowledge implicitly present in the data itself. The developed framework has been extensively tested by checking how the new samples complement the original samples. The produced results are very promising in showing the effectiveness, usefulness and applicability of the proposed multi-model framework. | en_US |
dc.description.sponsorship | Scientific and Technological Research Council of TurkeyTurkiye Bilimsel ve Teknolojik Arastirma Kurumu (TUBITAK) [110E179] | en_US |
dc.description.sponsorship | This work is partially supported by the Scientific and Technological Research Council of Turkey under Grant No. 110E179. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Inderscience Enterprises Ltd | en_US |
dc.relation.ispartof | International Journal of Data Mining And Bioinformatics | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | gene expression data | en_US |
dc.subject | sample generation | en_US |
dc.subject | multiple perspectives | en_US |
dc.subject | learning | en_US |
dc.subject | HIMM | en_US |
dc.subject | hierarchical markov models | en_US |
dc.subject | genetic algorithms | en_US |
dc.subject | PBN | en_US |
dc.subject | probabilistic boolean networks | en_US |
dc.title | Integrating Machine Learning Techniques Into Robust Data Enrichment Approach and Its Application To Gene Expression Data | en_US |
dc.type | Article | en_US |
dc.department | Faculties, Faculty of Engineering, Department of Computer Engineering | en_US |
dc.department | Fakülteler, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | tr_TR |
dc.identifier.volume | 8 | en_US |
dc.identifier.issue | 3 | en_US |
dc.identifier.startpage | 247 | en_US |
dc.identifier.endpage | 281 | en_US |
dc.authorid | 0000-0003-0509-9153 | - |
dc.authorid | 0000-0002-1741-0570 | - |
dc.identifier.wos | WOS:000324166600001 | en_US |
dc.institutionauthor | Tan, Mehmet | - |
dc.identifier.pmid | 24417021 | en_US |
dc.identifier.doi | 10.1504/IJDMB.2013.056090 | - |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.identifier.scopusquality | Q2 | - |
item.openairetype | Article | - |
item.languageiso639-1 | en | - |
item.grantfulltext | none | - |
item.fulltext | No Fulltext | - |
item.openairecristype | http://purl.org/coar/resource_type/c_18cf | - |
item.cerifentitytype | Publications | - |
crisitem.author.dept | 02.1. Department of Artificial Intelligence Engineering | - |
Appears in Collections: | Bilgisayar Mühendisliği Bölümü / Department of Computer Engineering PubMed İndeksli Yayınlar Koleksiyonu / PubMed Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection |
CORE Recommender
SCOPUSTM
Citations
4
checked on Dec 21, 2024
WEB OF SCIENCETM
Citations
3
checked on Oct 5, 2024
Page view(s)
100
checked on Dec 23, 2024
Google ScholarTM
Check
Altmetric
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.