Batch effect detection and removal for human liver RNA-Seq with an unsupervised learning approach

Shamima Nasrin; Md. Zahangir Alom; Tarek M. Taha

doi:10.1117/12.2677962

5 October 2023 Batch effect detection and removal for human liver RNA-Seq with an unsupervised learning approach

Shamima Nasrin, Md. Zahangir Alom, Tarek M. Taha

Proceedings Volume 12675, Applications of Machine Learning 2023; 126750N (2023) https://doi.org/10.1117/12.2677962
Event: SPIE Optical Engineering + Applications, 2023, San Diego, California, United States

Abstract

In Bioinformatics, batch effect detection is a challenging task where the clustering approaches have been explored most of the time. In this study, we proposed a novel approach to identify batch effects and visualization with unsupervised analysis methods. We used the most significant gene sets 500,1500, and 2500 genes out of 35238 genes for the human-liver RNA seq dataset by applying standard deviation (SD). The skmeans and kmeans methods were explored on the selected gene subsets. Then, principal component analysis (PCA) was used for embedding to the 10-dimensional subspace. Finally, the Uniform Manifold Approximation and Project (UMAP) was applied to cluster and visualize the outputs. The experimental results demonstrate the robust representation and achieve the best clustering and visualization for features extracted from 1500 genes. These findings are not only useful for batch effect detection and removal tasks but also can be used to label new samples to train the supervised machine learning methods.

Conference Presentation

Citation Download Citation

Shamima Nasrin, Md. Zahangir Alom, and Tarek M. Taha "Batch effect detection and removal for human liver RNA-Seq with an unsupervised learning approach", Proc. SPIE 12675, Applications of Machine Learning 2023, 126750N (5 October 2023); https://doi.org/10.1117/12.2677962

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
PRESENTATION

WATCH
PRESENTATION SAVE TO MY LIBRARY

GET CITATION

KEYWORDS

Machine learning

Biological samples

Databases

Liver

Principal component analysis

Visualization

Bioinformatics

Show All Keywords

RELATED CONTENT

Human activity recognition by smartphones regardless of device orientation
Proceedings of SPIE (February 18 2014)

Real time detection of patient head position and cephalometric landmarks...
Proceedings of SPIE (April 04 2022)

A 25 reader performance study for hepatic metastasis detection ...
Proceedings of SPIE (April 04 2022)

Image segmentation evaluation for very-large datasets
Proceedings of SPIE (March 24 2016)

Automated clustering of EM side channel emissions to detect anomalous...
Proceedings of SPIE (April 24 2020)

Spectral images browsing using principal component analysis and set partitioning...
Proceedings of SPIE (November 23 2011)

Content based retrieval using MPEG 7 visual descriptor and hippocampal...
Proceedings of SPIE (December 06 2005)

Subscribe to Digital Library

Receive Erratum Email Alert

Show All Keywords

Keywords/Phrases

Search In:

Publication Years