Moleküler Biyoloji ve Genetik Bölümü Koleksiyonu

Moleküler Biyoloji ve Genetik Bölümü Koleksiyonu https://hdl.handle.net/20.500.12573/209 Fri, 08 May 2026 17:58:15 GMT 2026-05-08T17:58:15Z Matching variants for functional characterization of genetic variants https://hdl.handle.net/20.500.12573/2541 Matching variants for functional characterization of genetic variants Cevik,Sabiha; Zhao,Pei; Zorluer,Atiyye; Pir, Mustafa S.; Bian, Wenyin; Kaplan, Oktay I. Rapid and low-cost sequencing, as well as computer analysis, have facilitated the diagnosis of many genetic diseases, resulting in a substantial rise in the number of disease-associated genes. However, genetic diagnosis of many disorders remains problematic due to the lack of interpretation for many genetic variants, especially missenses, the infeasibility of high-throughput experiments on mammals, and the shortcomings of computational prediction technologies. Additionally, the available mutant databases are not well-utilized. Toward this end, we used Caenorhabditis elegans mutant resources to delineate the functions of eight missense variants (V444I, V517D, E610K, L732F, E817K, H873P, R1105K, and G1205E) and two stop codons (W937stop and Q1434stop), including several matching variants (MatchVar) with human in ciliopathy associated IFT-140 (also called CHE-11)//IFT140 (intraflagellar transport protein 140). Moreover, MatchVars carrying C. elegans mutants, including IFT-140(G680S) and IFT-140(P702A) for the human (G704S) (dbSNP: rs150745099) and P726A (dbSNP: rs1057518064 and a conflicting variation) were created using CRISPR/Cas9. IFT140 is a key component of IFT complex A (IFT-A), which is involved in the retrograde transport of IFT along cilia and the entrance of G protein-coupled receptors into cilia. Functional analysis of all 10 variants revealed that P702A and W937stop, but not others phenocopied the ciliary phenotypes (short cilia, IFT accumulations, mislocalization of membrane proteins, and cilia entry of nonciliary proteins) of the IFT-140 null mutant, indicating that both P702A and W937stop are phenotypic in C. elegans. Our functional data offered experimental support for interpreting human variants, by using ready-to-use mutants carrying MatchVars and generating MatchVars with CRISPR/Cas9. Sun, 01 Jan 2023 00:00:00 GMT https://hdl.handle.net/20.500.12573/2541 2023-01-01T00:00:00Z Integrating Biological Domain Knowledge with Machine Learning for Identifying Colorectal-Cancer-Associated Microbial Enzymes in Metagenomic Data https://hdl.handle.net/20.500.12573/2540 Integrating Biological Domain Knowledge with Machine Learning for Identifying Colorectal-Cancer-Associated Microbial Enzymes in Metagenomic Data Bakir-Gungor, Burcu; Ersoz, Nur Sebnem; Yousef, Malik Advances in metagenomics have revolutionized our ability to elucidate links between the microbiome and human diseases. Colorectal cancer (CRC), a leading cause of cancer-related mortality worldwide, has been associated with dysbiosis of the gut microbiome. This study aims to develop a method for identifying CRC-associated microbial enzymes by incorporating biological domain knowledge into the feature selection process. Conventional feature selection techniques often evaluate features individually and fail to leverage biological knowledge during metagenomic data analysis. To address this gap, we propose the enzyme commission (EC)-nomenclature-based Grouping-Scoring-Modeling (G-S-M) method, which integrates biological domain knowledge into feature grouping and selection. The proposed method was tested on a CRC-associated metagenomic dataset collected from eight different countries. Community-level relative abundance values of enzymes were considered as features and grouped based on their EC categories to provide biologically informed groupings. Our findings in randomized 10-fold cross-validation experiments imply that glycosidases, CoA-transferases, hydro-lyases, oligo-1,6-glucosidase, crotonobetainyl-CoA hydratase, and citrate CoA-transferase enzymes can be associated with CRC development as part of different molecular pathways. These enzymes are mostly synthesized by Eschericia coli, Salmonella enterica, Klebsiella pneumoniae, Staphylococcus aureus, Streptococcus pneumoniae, and Clostridioides dificile. Comparative evaluation experiments showed that the proposed model consistently outperforms traditional feature selection methods paired with various classifiers. Wed, 01 Jan 2025 00:00:00 GMT https://hdl.handle.net/20.500.12573/2540 2025-01-01T00:00:00Z TextNetTopics-SFTS-SBTS: TextNetTopics Scoring Approaches Based Sequential Forward and Backward https://hdl.handle.net/20.500.12573/2535 TextNetTopics-SFTS-SBTS: TextNetTopics Scoring Approaches Based Sequential Forward and Backward Voskergian, Daniel; Bakir-Gungor, Burcu; Yousef, Malik TextNetTopics is a text classification-based topic modeling approach that performs topic selection rather than word selection to train a machine learning algorithm. However, one main limitation of TextNetTopics is that its scoring component (the S component) assesses each topic independently and ranks them accordingly, neglecting the potential relationship between topics. In order to address this limitation and improve the classification performance, this study introduces an enhancement to TextNetTopics. TextNetTopics-SFTS-SBTS integrates two novel scoring approaches: Sequential Forward Topic Scoring (SFTS) and Sequential Backward Topic Scoring (SBTS), which consider topic interactions by assessing sets of topics simultaneously. This integration aims to streamline the topic selection process and enhance classifier efficiency for text classification. The results obtained across three datasets offer valuable insights into the context-dependent effectiveness of the new scoring mechanisms across diverse datasets and varying numbers of topics involved in the analysis. Mon, 01 Jan 2024 00:00:00 GMT https://hdl.handle.net/20.500.12573/2535 2024-01-01T00:00:00Z Classification of Breast Cancer Molecular Subtypes with Grouping-Scoring-Modeling Approach that Incorporates Disease-Disease Association Information https://hdl.handle.net/20.500.12573/2532 Classification of Breast Cancer Molecular Subtypes with Grouping-Scoring-Modeling Approach that Incorporates Disease-Disease Association Information Qumsiyeh, Emma; Bakir-Gungor, Burcu; Yousef, Malik This study uses modern sequencing technology and large biological databases to investigate the molecular intricacies of complicated diseases like cancer. Using gene expression databases and biomarkers, the research aims to improve breast cancer molecular subtype identification for better patient outcomes. Using BRCA LumAB_ Her2Basal dataset, this study compares an integrative machine learning-based strategy (GediNET) to traditional feature selection approaches across machine learning classifiers. GediNET excels at uncovering crucial disease-disease connections and potential biomarkers using the Grouping-Scoring-Modeling (GSM) approach, which favors gene groupings above individual genes. Our comparative analysis highlights GediNET's exceptional performance, notably in terms of accuracy and Area Under the Curve metrics, underscoring its effectiveness in uncovering the genetic intricacies of breast cancer. GediNET's promise to improve disease classification and biomarker identification by improving biological mechanism understanding goes beyond exceeding traditional approaches. The work shows that GediNET's integrative method can promote bioinformatics research by identifying the most informative genes associated with certain diseases, enabling focused and customized medicine. Mon, 01 Jan 2024 00:00:00 GMT https://hdl.handle.net/20.500.12573/2532 2024-01-01T00:00:00Z