Illumina Universal adapters were removed and reads were trimmed using Trim Galore63 with a minimum read length parameter 50 bp. The resulting reads were filtered using Kraken37 (link), as described below in Section 4.3, with a custom database built from the PhiX genome (NCBI Reference Sequence: NC_001422.1). Removal of PhiX content is suggested as it is a common contaminant in Illumina sequencing data64 (link). Trimmed non-PhiX reads were used in subsequent matrix filtering and microbial identification steps.
Free full text: Click here