Fastqc

Manufactured by Illumina

Sourced in United States

FastQC is a quality control tool for high throughput sequence data. It provides a modular set of analyses that can be used to provide a quick overview of whether your data has any problems of which you should be aware before doing any further analysis.

Automatically generated - may contain errors

Lab products found in correlation

Multiqc, by Illumina (4 mentions) Trim galore, by Illumina (4 mentions) Trimmomatic, by Illumina (3 mentions) Novaseq control software, by Illumina (2 mentions) Real time analysis component, by Illumina (2 mentions) Trueseq stranded mrna ht library preparation kit, by Illumina (2 mentions) Hiseq 2000, by Illumina (2 mentions) Bcl2fastq, by Illumina (2 mentions) Agilent 2100 bioanalyzer, by Agilent Technologies (2 mentions) Nanodrop 1000 spectrophotometer, by Thermo Fisher Scientific (2 mentions) Bcl2fastq script, by Illumina (1 mentions) Nextseq 400m high output kit, by Illumina (1 mentions) Superscript 2, by Thermo Fisher Scientific (1 mentions) Hiseq control software (hcs), by Illumina (1 mentions) Truseq stranded mrna sample prep kit, by Illumina (1 mentions) Kapa library quantification kit, by Clinisciences (1 mentions) Casava 1, by Illumina (1 mentions) Ampure xp bead, by Beckman Coulter (1 mentions) Dna 1000 chip, by Agilent Technologies (1 mentions) Acgt101 mir v3, by LC Sciences (1 mentions)

67 protocols using fastqc

Evaluating Quality and Contamination in Sequencing Data

Cited 1 time

Check if the same lab product or an alternative is used in the 5 most similar protocols

The technical quality and potential sample contamination in Illumina PE reads were evaluated using FastQC v0.11.8 (FastQC, RRID:SCR_014583) and FastQ Screen v0.11.1 (FastQ Screen, RRID:SCR_000141), respectively. The technical quality of PacBio raw data was checked using the “QC module” in the PacBio SMRT Analysis Software SMRT Link version 8.0 (SMRT-Analysis, RRID:SCR_002942). Iso-Seq reads were clustered into high-quality (accuracy 99.9%, HQ) transcripts using the “Iso-Seq Analysis” Application in PacBio SMRT Analysis Software (SMRT Link v10.1.0.119588). The technical quality of Hi-C data was checked using HiCUP v0.8.0 (HiCUP, RRID:SCR_005569) [53 (link)].

Qi W., Lim Y.W., Patrignani A., Schläpfer P., Bratus-Neuenschwander A., Grüter S., Chanez C., Rodde N., Prat E., Vautrin S., Fustier M.A., Pratas D., Schlapbach R, & Gruissem W. (2022). The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features. GigaScience, 11, giac028.

+ Open protocol

+ Expand

Time-Resolved Transcriptomic Profiling of Human Cells

Check if the same lab product or an alternative is used in the 5 most similar protocols

Raw reads were trimmed to remove low quality base calls and Illumina universal adapters using Trim Galore! (Version 0.6.5) with default parameters and then assessed using fastQC (version 0.11.4) and multiqc (version 1.10.1). Reads were then aligned to the human genome (GRCh38) using STAR with default parameters. Alignment quality control was performed using RSeQC and Qualimap. Quantification was performed using RSEM. Quantification quality control was performed using EDASeq (version 2.3) and NOISeq (version 2.4). Time-course differential expression analysis was performed using msSigPro (version 1.68). Clustering of differential time-course genes was performed by identifying the optimal number of clusters using mclust (version 5.4.1) and then clustering using k-means method. Gene ontology analysis was performed using clusterProfiler (version 4.4.4) where cell-type enrichments utilized MSigDB (version 7.5.1). The code used for the analysis is provided in Additional file 2.

Stransky S., Cutler R., Aguilan J., Nieves E, & Sidoli S. (2022). Investigation of reversible histone acetylation and dynamics in gene expression regulation using 3D liver spheroid model. Epigenetics & Chromatin, 15, 35.

+ Open protocol

+ Expand

Genome Sequencing of Evolved Nitrite Strains

Cited 2 times

Check if the same lab product or an alternative is used in the 5 most similar protocols

We sequenced the genomes of evolved isolates using methodology described elsewhere [20 (link)]. Briefly, we streaked each evolved nitrite (NO₂⁻) cross-feeding co-culture onto LB agar plates containing 10 μg ml^− 1 of gentamicin and 0.1 mM of IPTG and picked a single colony of the nitrite producing and reducing strain (each colony expressed a different fluorescent protein) from each co-culture for genome sequencing. We grew the single clones in LB medium overnight and extracted the DNA with a Wizard Gemoic DNA purification kit (Promega, Madison, WI). We then sent the extracted DNA to the ETH Quantitative Genomics Facility (Basel, Switzerland) for sequencing. The genomes were sequenced with an Illumina HiSeq 200 sequencer (Illumina, San Diego, CA) with 100 cycles of paired-end sequencing. Primary data analysis, de-multiplexing and quality control analysis of the sequencing data were performed using FastQC (Illumina, San Diego, CA). We reported the complete set of parameters used for quality control elsewhere [17 (link)].

Lilja E.E, & Johnson D.R. (2019). Substrate cross-feeding affects the speed and trajectory of molecular evolution within a synthetic microbial assemblage. BMC Evolutionary Biology, 19, 129.

+ Open protocol

+ Expand

Quality Assessment and Filtration of NGS Sequencing Data

Cited 2 times

Check if the same lab product or an alternative is used in the 5 most similar protocols

We used FastQC v0.11.8 (FastQC, RRID:SCR_014583) [12 ] to assess overall sequencing quality for MGI and Illumina sequencing platforms. PCR duplications (reads were considered duplicates when forward read and reverse read of the 2 PE reads were identical) were detected by PRINSEQ v0.20.4 (PRINSEQ, RRID:SCR_005454) [26 (link)]. The random sequencing error rate was calculated by measuring the occurrence of “N” bases at each read position in raw reads. Reads with sequencing adapter contamination were examined according to the manufacturer's adapter sequences (Illumina sequencing adapter left = “GATCGGAAGAGCACACGTCTGAACTCCAGTCAC,” Illumina sequencing adapter right = “GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT,” MGI sequencing adapter left = “AAGTCGGAGGCCAAGCGGTCTTAGGAAGACAA,” and MGI sequencing adapter right = “AAGTCGGATCGTAGCCATGTCGTTCTGTGAGCCAAGGAGTTG”). We conducted base quality filtration of raw reads using the NGS QC Toolkit v2.3.3 (cut-off read length for high quality 70; cut-off quality score, 20) (NGS QC Toolkit, RRID:SCR_005461) [27 (link)]. We used clean reads after removing low-quality reads and adapter-containing reads for the mapping step.

Kim H.M., Jeon S., Chung O., Jun J.H., Kim H.S., Blazyte A., Lee H.Y., Yu Y., Cho Y.S., Bolser D.M, & Bhak J. (2021). Comparative analysis of 7 short-read sequencing platforms using the Korean Reference Genome: MGI and Illumina sequencing benchmark for whole-genome sequencing. GigaScience, 10(3), giab014.

+ Open protocol

+ Expand

NextSeq Sequencing and Data Processing

Check if the same lab product or an alternative is used in the 5 most similar protocols

The libraries were sequenced using a 75 bp Illumina NextSeq 400M high output kit (parameters of the sequencing run can be found in Table S2). In addition, 5% PhiX were used as a spike-in control. Illumina’s bcl2fastq script was used to generate the fastq files, which were subsequently quality controlled using FastQC (Andrews, 2010 ). The data was further filtered, quantified (ran with the option–min-reads 1000 to discard sequencing background from the downstream analysis), and sorted using the inDrop analysis pipeline (parameters of the yaml can be found in Table S2).

van Hoogmoed C.G., van der Kuijl-Booij M., van der Mei H.C, & Busscher H.J. (2000). Inhibition of Streptococcus mutans NS Adhesion to Glass with and without a Salivary Conditioning Film by Biosurfactant- Releasing Streptococcus mitis Strains. Applied and Environmental Microbiology, 66(2), 659-663.

+ Open protocol

+ Expand

RNA-seq Libraries Preparation with TruSeq

Check if the same lab product or an alternative is used in the 5 most similar protocols

The RNA-seq libraries were prepared with the TruSeq^® Stranded mRNA sample prep kit (Illumina). Samples depleted of rRNA were fragmented and reverse-transcribed with random hexamers, Superscript II (Life Technologies) and actinomycin D. During the generation of the second strand, dTTP was replaced with dUTP. Double-stranded cDNAs were adenylated at their 3′ ends before ligation with Illumina indexed adapters. Ligated cDNAs were amplified by 15 cycles of PCR and purified with AMPure XP Beads (Beckman Coulter Genomics). Libraries were validated with a DNA1000 chip (Agilent) and quantified with the KAPA Library quantification kit (Clinisciences). Twelve libraries were pooled in equimolar amounts in a single lane and were sequenced on a HiSeq2000 machine, with the single-read protocol (50 nt). Image analysis and base-calling were performed with Illumina HiSeq Control Software and the Real-Time Analysis component. Demultiplexing was performed with Illumina sequencing analysis software (CASAVA 1.8.2). Data quality was assessed with FastQC from the Babraham Institute and the sequencing analysis viewer (SAV) from Illumina software.

Mouammine A., Pages S., Lanois A., Gaudriault S., Jubelin G., Bonabaud M., Cruveiller S., Dubois E., Roche D., Legrand L., Brillard J, & Givaudan A. (2017). An antimicrobial peptide-resistant minor subpopulation of Photorhabdus luminescens is responsible for virulence. Scientific Reports, 7, 43670.

+ Open protocol

+ Expand

Small RNA Sequencing Data Analysis

Cited 1 time

Check if the same lab product or an alternative is used in the 5 most similar protocols

The raw data were processed with data cleaning analysis using ACGT101-miR v3.5 (LC Sciences, Huston, TX). In brief, the quality of raw data was measured by Illumina Fast QC to obtain Q30 data. Clean full-length reads were collected after removing all low-quality reads, adapter contaminants, and reads smaller than 18 nt and junk sequences (≥2 N, ≥7A, ≥8C, ≥6G, ≥7 T, ≥10 Dimer, ≥6 Trimer or ≥ 5 Tetramer). In addition, the clean data were filtered using various RNA databases, such as mRNA, RFam (release 9.1) and Repbase (version 15.07) databases, and rRNA, scRNA, snoRNA, snRNA, tRNA, etc. were found and removed as much as possible. The remaining unique sequences were mapped to the precursors in miRBase 21.0. by the fast gapped-read alignment software Bowtie 2 [88 (link)]. The unique sequences mapping to specific species mature miRNAs in hairpin arms were identified as known miRNAs. The unannotated sRNAs were expanded to about 250 nt and their structures were predicted using Mfold software (http://unafold.rna.albany.edu/?q=mfold). Novel miRNAs were obtained according to Meyers and Li prediction criteria [39 (link), 89 (link)].

Yang Z., Zhu P., Kang H., Liu L., Cao Q., Sun J., Dong T., Zhu M., Li Z, & Xu T. (2020). High-throughput deep sequencing reveals the important role that microRNAs play in the salt response in sweet potato (Ipomoea batatas L.). BMC Genomics, 21, 164.

+ Open protocol

+ Expand

Complete Bacterial Genome Assembly

Cited 4 times

Check if the same lab product or an alternative is used in the 5 most similar protocols

Total DNA was purified from overnight culture using Qiagen Genomic-tip 100/G columns (Qiagen, Germantown, MD, United States) per the manufacturer’s instructions. Whole-genome sequencing (WGS) was performed on the Illumina HiSeq 2500 platform and the Oxford MinIon Nanopore platform. FastQC (version 0.11.9)¹ and NanoQC (Version 0.9.4)² were used to assess the quality of short reads generated by Illumina and long reads generated by MinIon, respectively. High-quality long reads were assembled de novo using Canu (version 2.1.1)³ (Koren et al., 2017 (link)). Contigs were circularized by Circlator⁴ using the following parameters: merge_min_id, 85; merge_breaklen, 1,000; bwa_opts, -x ont2d; assembler, canu (Hunt et al., 2015 (link)). High-quality short reads were used to correct circularized contigs with two iterations of Pilon (version 1.24)⁵ (Walker et al., 2014 (link)) correction and one round of Racon (version 1.4.3)⁶ (Vaser et al., 2017 (link)) polishing. All programs were run with default parameters unless otherwise specified.

Yao M., Zhu Q., Zou J., Shenkutie A.M., Hu S., Qu J., He Z, & Leung P.H. (2022). Genomic Characterization of a Uropathogenic Escherichia coli ST405 Isolate Harboring blaCTX-M-15-Encoding IncFIA-FIB Plasmid, blaCTX-M-24-Encoding IncI1 Plasmid, and Phage-Like Plasmid. Frontiers in Microbiology, 13, 845045.

+ Open protocol

+ Expand

Illumina Paired-End Read Processing

Check if the same lab product or an alternative is used in the 5 most similar protocols

The initial quality of paired-end raw reads obtained from the Illumina sequencer was confirmed using the FASTQC (FASTQC/">https://www.bioinformatics.babraham.ac.uk/projects/FASTQC/) tool (Illumina). Unwanted regions in the reads (adapters, low-quality reads, and ambiguous bases ‘N) were trimmed, andhigh-quality trimmed reads were obtained for further analysis. The reads from each sample were normalized and assembled de novo separately using Trinity [65 (link)] (K-mer25[GitHub, San Francisco, CA, USA]). Trinity-generated assemblies were clustered based on sequence similarity. Transcripts were clustered using CD-HIT (cluster database at high identity with tolerance [GitHub]) at 95% identity and query coverage to reduce the redundancy without exclusion of sequence diversity. Clustered transcripts were used for further annotation.

Manimekalai R., Selvi A., Narayanan J., Vannish R., Shalini R., Gayathri S, & Rabisha V.P. (2023). Comparative physiological and transcriptome analysis in cultivated and wild sugarcane species in response to hydrogen peroxide-induced oxidative stress. BMC Genomics, 24, 155.

+ Open protocol

+ Expand

RNA-seq Variant Calling Pipeline

Cited 1 time

Check if the same lab product or an alternative is used in the 5 most similar protocols

Sequence reads passing quality filter from Illumina RTA were first checked using FastQC [64 (link)] and then mapped to GENCODE (https://www.gencodegenes.org/) annotation database (V25) and human reference genome (GRCh38.p7) using Tophat2 [65 (link)] with a lenient alignment strategy allowing at most two mismatches per read to accommodate potential editing events. The mapped bam files were further QCed using RSeqQC [66 (link)]. Then, all samples were run through the GATK best practices pipeline of SNV calling (https://gatkforums.broadinstitute.org/gatk/discussion/3892/the-gatk-best-practices-for-variant-calling-on-rnaseq-in-full-detail) using RNA-seq data to obtain a list of candidate variant sites. All known SNPs from dbSNP (V144) [67 (link)] were removed from further analyses.

Sharma S., Wang J., Alqassim E., Portwood S., Cortes Gomez E., Maguire O., Basse P.H., Wang E.S., Segal B.H, & Baysal B.E. (2019). Mitochondrial hypoxic stress induces widespread RNA editing by APOBEC3G in natural killer cells. Genome Biology, 20, 37.

+ Open protocol

+ Expand

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!