Comparative Study of Transcriptomic profiling and Functional enrichment in Ovarian Cancer Cell lines

Authors Affiliation(s)

¹Department of Computational Biology and Bioinformatics, Kariavattom campus, Thiruvananthapuram, Kerala 695581, INDIA

Can J Biotech, Volume 1, Special Issue, Page 65, DOI: https://doi.org/10.24870/cjb.2017-a52

Presenting author: tripathinisha84@gmail.com

Abstract

High-throughput cDNA sequencing (RNA-seq) has emerged as a sophisticated tool for transcriptomic studies, especially for identifying differentially expressed genes (DEGs) and measuring the transcripts between different sample groups or conditions. There are several pipelines and tools available for performing the task, but still there is no general consent for the protocol to be used for the analysis. In this comparative study, transcriptomic profiling of Ovarian cancer cell lines data sets were carried out by using two different pipelines- ‘Tuxedo’ protocol (Tophat, Cuflinks-Cuffdiff, CummerBund) and ‘new Tuxedo’ protocol (HISAT, StringTie, Desq2) were used for estimating the transcript abundancies and for analysing differential expression. ‘New Tuxedo’ protocol was found to be fast and efficient than ‘Tuxedo’ protocol and the run time on an 8 GB RAM PC was ~ 2 hr and ~ 6 days, respectively. A total of 613 and 371 DEGs were obtained by using ‘Tuxedo’ and ‘New Tuxedo’ pipeline, respectively. Functional profiling was performed, by a comparative study of high throughput functional enrichment tools (clueGO, DAVID, EnRichr, FunRich, gProfiler, GSEA, PANTHER and webGestalt) to get the functions and pathways of most enriched genes involved in ovarian cancer cell lines. The common biological pathways and Gene Ontology (GO) terms were extracted with common genes from all the tools to get most enriched genes with the GO functional terms. Thus, the characterization of biological pathway and GO processes (Biological processes and Molecular Function) of most enriched gene sets involved in ovarian cancer cell lines were obtained.

References

Trapnell, C., Roberts, A., Goff, L., Pertea, G., Kim, D., Kelley, D.R., Pimentel, H., Salzberg, S.L., Rinn, J.L. and Pachter, L. (2012) Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc 7: 562-578. Crossref
Pertea, M., Kim, D., Pertea, G.M., Leek, J.T. and Salzberg, S.L. (2016) Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat Protoc 11: 1650-1667. Crossref
Conesa, A., Madrigal, P., Tarazona, S., Gomez-Cabrero, D., Cervera, A., McPherson, A., et al. (2016) A survey of best practices for RNA-seq data analysis. Genome Biol 17: 13. Crossref
Huang, D.W., Sherman, B.T. and Lempicki, R.A. (2008) Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 37: 1-13. Crossref
Mi, H., Huang, X., Muruganujan, A., Tang, H., Mills, C., Kang, D. and Thomas, P.D. (2017) PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements. Nucleic Acids Res 45: D183-D189. Crossref
Bindea, G., Mlecnik, B., Hackl, H., Charoentong, P., Tosolini, M., Kirilovsky, A., et al. (2009) ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25: 1091-1093. Crossref
Dennis, G., Sherman, B.T., Hosack, D.A., Yang, J., Gao, W., Lane, H.C. and Lempicki, R.A. (2003) DAVID: database for annotation, visualization, and integrated discovery. Genome Biol 4: P3. Crossref