Paper Title

A machine learning integrated bioinformatics analysis for tissue specific breast cancer gene classification

Authors

Ghazala Sultan , Dr. Swaleha Zubair

Keywords

Breast Cancer, Machine Learning, Supervised Learning, Unsupervised Clustering, Gene Classification

Abstract

Machine learning techniques has been extensively utilized at early stages of biomedical research to analyze large datasets. This study aimed to develop machine learning models with strong prediction power and interpretability for gene classification between normal and cancer samples based on their expression level in different origins of tis-sues. We collected various candidate features from the clinical features of samples and generated filtered relatable features from original features set. Best features were selected through feature evaluation for classification of cancer specific genes. We used 30% of the data as a test dataset and 70% cases of data as a training and validation dataset on 7110 features from epithelial and stromal tissue. To develop the cancer gene prediction model, we considered five ma-chine learning algorithms: Logistic Regression, random forest (RF), support vector machine (SVM) and k-nearest neighbor (KNN) and C5.0. We found that random forest model shows the best learning model that produces the highest validation accuracy. In the random forest model, the classification accuracy of 95%, sensitivity is 0.926, specificity is 0.915, and AUC is 0.970. The developed prediction models show high accuracy, sensitivity, specificity and AUC in classifying among cancerous and healthy samples. This model could be used to predict BRCA in other patients with epithelial or stromal origin cancer. This study suggests that combination of multiple learning models may increase the cancer prediction accuracy.

How To Cite

"A machine learning integrated bioinformatics analysis for tissue specific breast cancer gene classification", IJSDR - International Journal of Scientific Development and Research (www.IJSDR.org), ISSN:2455-2631, Vol.8, Issue 1, page no.93 - 98, January-2023, Available :https://ijsdr.org/papers/IJSDR2301017.pdf

Issue

Volume 8 Issue 1, January-2023

Pages : 93 - 98

Other Publication Details

Paper Reg. ID: IJSDR_203361

Published Paper Id: IJSDR2301017

Downloads: 000347224

Research Area: Science & Technology

Country: Aligarh, Uttar Pradesh, India

Published Paper PDF: https://ijsdr.org/papers/IJSDR2301017

Published Paper URL: https://ijsdr.org/viewpaperforall?paper=IJSDR2301017

About Publisher

ISSN: 2455-2631 | IMPACT FACTOR: 9.15 Calculated By Google Scholar | ESTD YEAR: 2016

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 9.15 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Publisher: IJSDR(IJ Publication) Janvi Wave

Article Preview

academia
publon
sematicscholar
googlescholar
scholar9
maceadmic
Microsoft_Academic_Search_Logo
elsevier
researchgate
ssrn
mendeley
Zenodo
orcid
sitecreex