Quality Assessment and Filtration of NGS Sequencing Data

We used FastQC v0.11.8 (FastQC, RRID:SCR_014583) [12 ] to assess overall sequencing quality for MGI and Illumina sequencing platforms. PCR duplications (reads were considered duplicates when forward read and reverse read of the 2 PE reads were identical) were detected by PRINSEQ v0.20.4 (PRINSEQ, RRID:SCR_005454) [26 (link)]. The random sequencing error rate was calculated by measuring the occurrence of “N” bases at each read position in raw reads. Reads with sequencing adapter contamination were examined according to the manufacturer's adapter sequences (Illumina sequencing adapter left = “GATCGGAAGAGCACACGTCTGAACTCCAGTCAC,” Illumina sequencing adapter right = “GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT,” MGI sequencing adapter left = “AAGTCGGAGGCCAAGCGGTCTTAGGAAGACAA,” and MGI sequencing adapter right = “AAGTCGGATCGTAGCCATGTCGTTCTGTGAGCCAAGGAGTTG”). We conducted base quality filtration of raw reads using the NGS QC Toolkit v2.3.3 (cut-off read length for high quality 70; cut-off quality score, 20) (NGS QC Toolkit, RRID:SCR_005461) [27 (link)]. We used clean reads after removing low-quality reads and adapter-containing reads for the mapping step.

Free full text: Click here

Kim H.M., Jeon S., Chung O., Jun J.H., Kim H.S., Blazyte A., Lee H.Y., Yu Y., Cho Y.S., Bolser D.M, & Bhak J. (2021). Comparative analysis of 7 short-read sequencing platforms using the Korean Reference Genome: MGI and Illumina sequencing benchmark for whole-genome sequencing. GigaScience, 10(3), giab014.

Publication 2021

FastQC

Filtration

Corresponding Organization : Genome Research Foundation

Top 5 similar protocols

Variable analysis

independent variables

Sequencing platforms (MGI and Illumina)

dependent variables

Overall sequencing quality
PCR duplication rate
Random sequencing error rate
Adapter contamination

control variables

Sequencing adapter left and right sequences for Illumina and MGI platforms
Quality score threshold (20)
Read length threshold (70)

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!