QuickRNAseq: Integrated and Interactive RNA-seq Analysis Report
Overview
It provides the QC metrics and expression tables, including overall read mapping statistics, QC report for a subset of GTEx samples, interactive gene expression visualization and analytical tools to gain insights about the data. Please click for interactive plots.
Team
Shanrong Zhao, Li Xi, Jie Quan, Hualin Xi, Ying Zhang, David von Schack, Michael Vincent and Baohong Zhang
Sequencing Method
RNA-seq was performed using the Illumina TruSeq library construction protocol (non-stranded, polyA+ selection).

Total RNA was quantified using the Quant-iTTM RiboGreen®RNA Assay Kit and normalized to 5 ng per µL. An aliquot of 200 ng for each sample was transferred into library preparation, which was an automated variant of the Illumina Tru SeqTM RNA sample preparation protocol (Revision A, 2010). This method used oligo dT beads to select mRNA from the total RNA sample followed by heat fragmentation and cDNA synthesis from the RNA template. The resultant cDNA then went through library preparation (end repair, base 'A' addition, adapter ligation, and enrichment) using Broad Institute-designed indexed adapters substituted in for multiplexing. After enrichment, the libraries were quantified with qPCR using the KAPA Library Quantification Kit for Illumina Sequencing Platforms and then pooled equimolarly. The entire process was performed in 96-well plates and all pipetting was performed by either Agilent Bravo or Hamilton Starlet liquid handlers with electronic tracking throughout the process in real-time, including reagent lot numbers, specific automation used, time stamps for each process step, and automatic registration.

Pooled libraries were normalized to 2 nM and denatured using 0.1 N NaOH prior to sequencing. Flow cell cluster amplification and sequencing were performed according to the manufacturer’s protocols using either the HiSeq 2000 or HiSeq 2500. Sequencing generated 76bp paired-end reads and an eight-base index barcode read, and was run with a coverage goal of 50M reads (the median achieved was ~82M total reads).

Parallel Plot of QC
Click on the image above to access the interactive plot. Raw Data
Raw Data Files
sample.annotation.txtsample annotation file
star-mapping-summary.txtread mapping summary for STAR run
fc-counting-summary.txtread counting summary for featureCounts run
RSeQC-read-distribution.txtThe distribution of mapped reads along gene elements
Gene-count-table.txtcounting table
Gene-count-table.flt.txtfiltered counting table. A gene is filtered if having 0 read in more than 50% samples
Gene-fpkm-table.txtRPKM table, calculated from Gene-count-table.txt
Gene-fpkm-table.flt.txtfiltered RPKM table, calculated from Gene-count-table.flt.txt
RNASeq-expr-QC.txtcorrelation based QC of expression profile
RNASeq-expr-corr.txtAll-against-all gene expression correlation matrix
RNASeq-snp-corr.txtAll-against-all SNP correlation matrix
RNASeq-merged-metrics.txtMerged RNA-seq metrics from mapping, counting, distribution and QC
read_map_sum.10x7.pngplot of star-mapping-summary.txt. Intended for powerpoint presentation.
read_map_sum.pngplot of star-mapping-summary.txt. Intended for interactive HTML presentation.
read_count_sum.10x7.pngplot of fc-counting-summary.txt. Intended for powerpoint presentation.
read_count_sum.pngplot of fc-counting-summary.txt. Intended for interactive HTML presentation.
read_dist_sum.10x7.pngplot of RSeQC-read-distribution.txt. Intended for powerpoint presentation.
read_dist_sum.pngplot of RSeQC-read-distribution.txt. Intended for interactive HTML presentation.
expr_count_RPKM.10x7.pngplot the number of genes with a give RPKM cutoff. Intended for powerpoint presentation.
expr_count_RPKM.pngplot the number of genes with a give RPKM cutoff. Intended for interactive HTML presentation.
RNASeq-expr-corr.txt.corr.9x7.pngcorrelation plot for RNASeq-expr-corr.txt.Intended for powerpoint presentation.
RNASeq-expr-corr.txt.corr.pngcorrelation plot for RNASeq-expr-corr.txt.Intended for interactive HTML presentation.
RNASeq-snp-corr.txt.corr.9x7.pngcorrelation plot for RNASeq-snp-corr.txt.Intended for powerpoint presentation.
RNASeq-snp-corr.txt.corr.pngcorrelation plot for RNASeq-snp-corr.txt
References
  1. S. Zhao, L. Xi, J. Quan, H. Xi, D. von Schack, M. Vincent, B. Zhang. QuickRNASeq lifts large-scale RNA-seq data analysis to next level of automation and visualization. BMC Genomics. 2016, 17:39.
  2. W. He, S. Zhao, C. Zhang, M.S. Vincent, B. Zhang. QuickRNASeq: User Guide for Pipeline Implementation and for Interactive Results Visualization. Methods in Molecular Biology, 2018, 1751:57-70.
  3. Nils Gehlenborg. Nozzle.R1 Package (2013)
  4. Wang, L., Wang, S., & Li, W. RSeQC: quality control of RNA-seq experiments. Bioinformatics, 2012, 28(16), 2184–2185.
  5. N Gehlenborg, MS Noble, G Getz, L Chin and PJ Park. Nozzle: a report generation toolkit for data analysis pipelines. Bioinformatics 29:1089-1091 (2013)
  6. D3.js - Data-Driven Documents