Enhancing grouping-scoring-modeling (G-S-M) approach through a statistical pre-scoring component: A case study for high-dimensional transcriptomic data analysis

KHOKHAR, MAHAM

dc.contributor.author	KHOKHAR, MAHAM
dc.date.accessioned	2025-04-10T14:11:02Z
dc.date.available	2025-04-10T14:11:02Z
dc.date.issued	2024	en_US
dc.date.submitted	2024-06-10
dc.identifier.uri	https://hdl.handle.net/20.500.12573/2465
dc.description.abstract	Rapid advancements in transcriptomic technologies have significantly increased the volume of data available for analysis, which presents challenges in terms of efficiency and computational demand. This thesis introduces a Pre-Scoring component to the Grouping-Scoring-Modeling (G-S-M) framework to address inefficiencies caused by the excessive number of gene groups generated by traditional GSM. By selectively prioritizing gene groups based on their statistical significance, this innovation aims to reduce the computational demands associated with scoring these groups using machine learning models, thereby streamlining the analysis process. Assessed across nine diverse Gene Expression datasets, the Pre-Scoring G-S-M framework not only maintained accuracy comparable to the traditional approach but did so with significantly fewer genes. This refinement conserves resources while maintaining the robustness and reliability of the data analysis, crucial for advancing research in personalized medicine and therapeutic strategies. The findings suggest that the modified G-S-M framework serves as a valuable tool in bioinformatics, offering a more efficient approach to handling large-scale genomic datasets. Future work will focus on adapting this enhanced framework to incorporate diverse types of omics knowledge, such as proteomics and metabolomics, further optimizing its performance to broaden its applicability in both clinical and research settings	en_US
dc.description.abstract	Transkriptomik teknolojilerdeki hızlı ilerlemeler, analiz için kullanılabilir veri miktarını önemli ölçüde artırmış, bu da verimlilik ve hesaplama talepleri açısından zorluklar oluşturmuştur. Bu tez, geleneksel GSM tarafından üretilen aşırı sayıdaki gen gruplarından kaynaklanan verimsizlikleri ele almak için Gruplandırma-Puanlama- Modelleme (G-S-M) çerçevesine bir Ön-Puanlama bileşeni tanıtmaktadır. İstatistiksel öneme göre seçici bir şekilde gen gruplarını önceliklendirerek, bu yenilik, bu grupların makine öğrenimi modelleri kullanılarak puanlanmasıyla ilişkili hesaplama taleplerini azaltmayı hedeflemekte ve böylece analiz sürecini daha verimli hale getirmektedir. Dokuz çeşitli Gen İfadesi veri seti üzerinde değerlendirildiğinde, Ön Puanlama G-S- M çerçevesi, geleneksel yaklaşımla karşılaştırılabilir doğrulukta performans göstermekle kalmamış, aynı zamanda önemli ölçüde daha az gen ile bunu başarmıştır. Bu iyileştirme, kişiselleştirilmiş tıp ve tedavi stratejilerinde araştırmaları ilerletmek için hayati olan veri analizinin sağlamlığını ve güvenilirliğini korurken kaynakları korur.	en_US
dc.language.iso	eng	en_US
dc.publisher	Abdullah Gül Üniversitesi / Sosyal Bilimler Enstitüsü	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Gene Selections	en_US
dc.subject	Machine Learning	en_US
dc.subject	Grouping Scoring Modeling	en_US
dc.subject	Transcriptomics	en_US
dc.subject	Feature Selection	en_US
dc.subject	Gen Seçimi	en_US
dc.subject	Makine Öğrenimi	en_US
dc.subject	Gruplandırma Puanlama Modelleme	en_US
dc.subject	Transkriptomik	en_US
dc.subject	Özellik Seçimi	en_US
dc.title	Enhancing grouping-scoring-modeling (G-S-M) approach through a statistical pre-scoring component: A case study for high-dimensional transcriptomic data analysis	en_US
dc.title.alternative	Istatistiksel ön puanlama bileşeni ile gruplama puanlama modellemesi (GSM) yaklaşımın geliştirilmesi: Yüksek boyutlu transkriptomik veri analizi için bir vaka çalışması	en_US
dc.type	masterThesis	en_US
dc.contributor.department	AGÜ, Sosyal Bilimler Enstitüsü, İşletme ve Ekonomi İçin Veri Bilimi Ana Bilim Dalı	en_US
dc.relation.publicationcategory	Tez	en_US

Files in this item

Name:: 875300.pdf
Size:: 1.332Mb
Format:: PDF
Description:: Yüksek Lisans Tezi

View/Open

This item appears in the following Collection(s)

Veri Bilimi Anabilim Dalı Tez Koleksiyonu [7]

Show simple item record