Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer. / Lopacinska-Jørgensen, Joanna; Petersen, Patrick H. D.; Oliveira, Douglas V. N. P.; Høgdall, Claus K.; Høgdall, Estrid V.

I: PLoS ONE, Bind 18, Nr. 5, e0282576, 2023.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Lopacinska-Jørgensen, J, Petersen, PHD, Oliveira, DVNP, Høgdall, CK & Høgdall, EV 2023, 'Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer', PLoS ONE, bind 18, nr. 5, e0282576. https://doi.org/10.1371/journal.pone.0282576

APA

Lopacinska-Jørgensen, J., Petersen, P. H. D., Oliveira, D. V. N. P., Høgdall, C. K., & Høgdall, E. V. (2023). Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer. PLoS ONE, 18(5), [e0282576]. https://doi.org/10.1371/journal.pone.0282576

Vancouver

Lopacinska-Jørgensen J, Petersen PHD, Oliveira DVNP, Høgdall CK, Høgdall EV. Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer. PLoS ONE. 2023;18(5). e0282576. https://doi.org/10.1371/journal.pone.0282576

Author

Lopacinska-Jørgensen, Joanna ; Petersen, Patrick H. D. ; Oliveira, Douglas V. N. P. ; Høgdall, Claus K. ; Høgdall, Estrid V. / Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer. I: PLoS ONE. 2023 ; Bind 18, Nr. 5.

Bibtex

@article{94c7a322c9924e359bb457e3d42d4a99,
title = "Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer",
abstract = "MicroRNAs (miRNAs) are small non-coding RNA molecules regulating gene expression with diagnostic potential in different diseases, including epithelial ovarian carcinomas (EOC). As only a few studies have been published on the identification of stable endogenous miRNA in EOC, there is no consensus which miRNAs should be used aiming standardization. Currently, U6-snRNA is widely adopted as a normalization control in RT-qPCR when investigating miRNAs in EOC; despite its variable expression across cancers being reported. Therefore, our goal was to compare different missing data and normalization approaches to investigate their impact on the choice of stable endogenous controls and subsequent survival analysis while performing expression analysis of miRNAs by RT-qPCR in most frequent subtype of EOC: high-grade serous carcinoma (HGSC). 40 miRNAs were included based on their potential as stable endogenous controls or as biomarkers in EOC. Following RNA extraction from formalin-fixed paraffin embedded tissues from 63 HGSC patients, RT-qPCR was performed with a custom panel covering 40 target miRNAs and 8 controls. The raw data was analyzed by applying various strategies regarding choosing stable endogenous controls (geNorm, BestKeeper, NormFinder, the comparative ΔCt method and RefFinder), missing data (single/multiple imputation), and normalization (endogenous miRNA controls, U6-snRNA or global mean). Based on our study, we propose hsa-miR- 23a-3p and hsa-miR-193a-5p, but not U6-snRNA as endogenous controls in HGSC patients. Our findings are validated in two external cohorts retrieved from the NCBI Gene Expression Omnibus database. We present that the outcome of stability analysis depends on the histological composition of the cohort, and it might suggest unique pattern of miRNA stability profiles for each subtype of EOC. Moreover, our data demonstrates the challenge of miRNA data analysis by presenting various outcomes from normalization and missing data imputation strategies on survival analysis. ",
author = "Joanna Lopacinska-J{\o}rgensen and Petersen, {Patrick H. D.} and Oliveira, {Douglas V. N. P.} and H{\o}gdall, {Claus K.} and H{\o}gdall, {Estrid V.}",
note = "Publisher Copyright: {\textcopyright} 2023 Lopacinska-J{\o}rgensen et al.",
year = "2023",
doi = "10.1371/journal.pone.0282576",
language = "English",
volume = "18",
journal = "PLoS ONE",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "5",

}

RIS

TY - JOUR

T1 - Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer

AU - Lopacinska-Jørgensen, Joanna

AU - Petersen, Patrick H. D.

AU - Oliveira, Douglas V. N. P.

AU - Høgdall, Claus K.

AU - Høgdall, Estrid V.

N1 - Publisher Copyright: © 2023 Lopacinska-Jørgensen et al.

PY - 2023

Y1 - 2023

N2 - MicroRNAs (miRNAs) are small non-coding RNA molecules regulating gene expression with diagnostic potential in different diseases, including epithelial ovarian carcinomas (EOC). As only a few studies have been published on the identification of stable endogenous miRNA in EOC, there is no consensus which miRNAs should be used aiming standardization. Currently, U6-snRNA is widely adopted as a normalization control in RT-qPCR when investigating miRNAs in EOC; despite its variable expression across cancers being reported. Therefore, our goal was to compare different missing data and normalization approaches to investigate their impact on the choice of stable endogenous controls and subsequent survival analysis while performing expression analysis of miRNAs by RT-qPCR in most frequent subtype of EOC: high-grade serous carcinoma (HGSC). 40 miRNAs were included based on their potential as stable endogenous controls or as biomarkers in EOC. Following RNA extraction from formalin-fixed paraffin embedded tissues from 63 HGSC patients, RT-qPCR was performed with a custom panel covering 40 target miRNAs and 8 controls. The raw data was analyzed by applying various strategies regarding choosing stable endogenous controls (geNorm, BestKeeper, NormFinder, the comparative ΔCt method and RefFinder), missing data (single/multiple imputation), and normalization (endogenous miRNA controls, U6-snRNA or global mean). Based on our study, we propose hsa-miR- 23a-3p and hsa-miR-193a-5p, but not U6-snRNA as endogenous controls in HGSC patients. Our findings are validated in two external cohorts retrieved from the NCBI Gene Expression Omnibus database. We present that the outcome of stability analysis depends on the histological composition of the cohort, and it might suggest unique pattern of miRNA stability profiles for each subtype of EOC. Moreover, our data demonstrates the challenge of miRNA data analysis by presenting various outcomes from normalization and missing data imputation strategies on survival analysis.

AB - MicroRNAs (miRNAs) are small non-coding RNA molecules regulating gene expression with diagnostic potential in different diseases, including epithelial ovarian carcinomas (EOC). As only a few studies have been published on the identification of stable endogenous miRNA in EOC, there is no consensus which miRNAs should be used aiming standardization. Currently, U6-snRNA is widely adopted as a normalization control in RT-qPCR when investigating miRNAs in EOC; despite its variable expression across cancers being reported. Therefore, our goal was to compare different missing data and normalization approaches to investigate their impact on the choice of stable endogenous controls and subsequent survival analysis while performing expression analysis of miRNAs by RT-qPCR in most frequent subtype of EOC: high-grade serous carcinoma (HGSC). 40 miRNAs were included based on their potential as stable endogenous controls or as biomarkers in EOC. Following RNA extraction from formalin-fixed paraffin embedded tissues from 63 HGSC patients, RT-qPCR was performed with a custom panel covering 40 target miRNAs and 8 controls. The raw data was analyzed by applying various strategies regarding choosing stable endogenous controls (geNorm, BestKeeper, NormFinder, the comparative ΔCt method and RefFinder), missing data (single/multiple imputation), and normalization (endogenous miRNA controls, U6-snRNA or global mean). Based on our study, we propose hsa-miR- 23a-3p and hsa-miR-193a-5p, but not U6-snRNA as endogenous controls in HGSC patients. Our findings are validated in two external cohorts retrieved from the NCBI Gene Expression Omnibus database. We present that the outcome of stability analysis depends on the histological composition of the cohort, and it might suggest unique pattern of miRNA stability profiles for each subtype of EOC. Moreover, our data demonstrates the challenge of miRNA data analysis by presenting various outcomes from normalization and missing data imputation strategies on survival analysis.

U2 - 10.1371/journal.pone.0282576

DO - 10.1371/journal.pone.0282576

M3 - Journal article

C2 - 37141239

AN - SCOPUS:85158866911

VL - 18

JO - PLoS ONE

JF - PLoS ONE

SN - 1932-6203

IS - 5

M1 - e0282576

ER -

ID: 368900380