The largest database of trusted experimental protocols

19 protocols using newbler 2

1

Comparative Viral Metagenome Assembly Analysis

Check if the same lab product or an alternative is used in the 5 most similar protocols
Four different assembly methods, exhaustive iterative assembly (Schurch et al., 2014 (link)), CLC Genomics Workbench 6.0.4 assembler (CLC bio, Aarhus, Denmark), Genovo version 0.4 (Laserson et al., 2011 (link)), and Newbler 2.5 (Roche), were compared in their efficiency of detecting viral reads in the three metagenome datasets. The originally used method was iterative exhaustive assembly. Iterative exhaustive assembly of sequences is part of a virus discovery pipeline written in the python programming language (Python 2.7) that includes trimming of reads and initial assembly with Newbler (454GS Assembler version 2.7, Roche), with standard parameters. Trimmed reads and initial contigs were subjected to assembly by CAP3 (VersionDate: 12/21/07) (Huang and Madan, 1999 (link)) with standard parameters. The resulting singletons and contigs were iteratively assembled by CAP3 until no new contigs were formed.
Subsequently, the trimmed reads were mapped back to the identified taxonomic units with Newbler (454 GSMapper version 2.7, Roche) with standard parameters (Schurch et al., 2014 (link)). CLC Genomics Workbench 6.0.4 assembler (CLC bio, Aarhus, Denmark) was run with the previously trimmed reads with automatic bubble and word size. Genovo version 0.4 was run with 40 iterations and otherwise default values (Laserson et al., 2011 (link)). Newbler 2.5 (Roche) was run with default values.
+ Open protocol
+ Expand
2

Sequencing and Analysis of Ectromelia Virus

Check if the same lab product or an alternative is used in the 5 most similar protocols
Two µg of viral DNA, obtained from semi-purified viral stocks as described [34 (link)], were used for the construction of a library using the GS-FLX Titanium system (454 Life Sciences, Roche, Branford, CT, USA) and sequenced with a FLX Genome Sequencer in the Scientific Park of Madrid, Spain. Reads were de novo assembled and also mapped to ECTV-M or ECTV-N reference genomes with Newbler 2.5.3 (Roche Diagnostics, Branford, CT, USA). For Illumina sequencing, five µg of viral DNA of ECTV-M was used to construct a TruSeq library and reads obtained with a Genome Analyzer IIx in the Scientific Park of Madrid, Spain. These Illumina reads were mapped to ECTV-M genome (AF012825.2) as reference using Bowtie2 with default parameters (http://bowtie-bio.sourceforge.net/bowtie2/manual.shtml; Version 2.1.0) (Table 2). A PCR amplification of the region containing the Direct Repeat III (DRIII) region of ECTV-MK was carried out as described [27 (link)]. The product was analyzed in a 2% agarose gel with a 100 bp ladder marker (Invitrogen) considering the most intense band for the estimation of the number of repeats.
+ Open protocol
+ Expand
3

Whole Genome Sequencing Using 454 Platform

Check if the same lab product or an alternative is used in the 5 most similar protocols
WGS was performed using the Roche 454 platform (Roche, Indianapolis, IN, USA) by the Biotechnology Core Facilities Branch, Genomic Sequencing Laboratory (Atlanta, GA, USA). Sequencing libraries were prepared per manufacturers protocols. Contigs were assembled de novo using either Newbler 2.5.3 (Roche) or CLC Genomics Workbench 7.0.3 (Qiagen).
Raw sequence reads were imported into CLC Genomics Workbench 7.0.3 and sequencing QC, raw read mapping, and alignments performed. Sequencing QC reports were to ensure raw read files for each strain had equivalent read length distribution, GC content, overrepresentation of sequence reads, and individual 5-mer distribution. Read mapping parameters included a mismatch cost of 2, insertion cost of 3, deletion cost of 3, length fraction of 0.5, and similarity fraction of 0.8. Reads that mapped to more than 1 region of the reference were allowed to map randomly.
+ Open protocol
+ Expand
4

De Novo Assembly of Antarctic RNA Viruses

Check if the same lab product or an alternative is used in the 5 most similar protocols
Primer sequences were removed from 5 0 and 3 0 ends using the adaptor-clipping tool of BIOPIECES (http://code.-google.com/p/biopieces/). PRINSEQ suite of tools (Schmieder & Edwards 2011) were used for quality fil-tering with the next parameters: -derep 124 -range_len 50-'virome mode + standard deviation' -lc_method entropy -lc_threshold 50 -ns_max_p 1 -ns_max_n 3 -trim_qual_left 20 -trim_qual_right 25 -trim_qual_type mean -trim_qual_window 2 -trim_qual_step 1 -min_-qual_mean 25. Sequences were assembled de novo under strict parameters (97% of minimum identity along 90% of read length) with NEWBLER 2.5.3 (Roche) and CLC GENO-MIC WORKBENCH 5 (http://www.clcbio.com/). The fulllength genomes of four Antarctic RNA viruses were obtained using information from both assemblers and joining contigs by PCR amplification and Sanger sequencing (Table S2, Supporting information). The 3 0 ends were PCR-amplified with specific and poly-T primers and sequenced with Sanger technology.
Taxonomic binning of reads and contigs was based on the best hit of BLASTX searches (E-value <0.001) against the GenBank nr protein database and TBLASTX searches (E-value <0.001) against reference viral genomes (4602 reference viral genomes downloaded from the GenBank on the 10th of April 2013) using in-house computational resources.
+ Open protocol
+ Expand
5

De novo and Reference-based Rhizobium Genome Assembly

Check if the same lab product or an alternative is used in the 5 most similar protocols
Roche Newbler 2.3 assembler was run using the command line (runAssembly) option with 90% sequence identity and 40-bp minimum overlap as parameters to perform de novo assembly of each of the Rhizobium genomes. Shell scripts were written to run runAssembly on multiple datasets. GSMapper 2.3 with 90% sequence identity and 40-bp minimum overlap was used to perform reference-based assembly of each genome using 305 core genes from strain 3841 as the reference genes. These 305 genes were those shown to be common to all chromid-bearing bacteria analysed by Harrison et al. [18 (link)]. Shell scripts were written to run runMapper on multiple datasets. Nucleotide information of 305 core genes was extracted from every draft genome using a Perl script, and a shell script was used to merge this information with their respective genes present in fully sequenced Rhizobium genomes. Each of the 305 files was aligned at nucleotide level by MUSCLE [22 (link)] that was run locally on the University of York Biology Linux grid. Each alignment file was checked and gaps were added for strains that had no reads for a given gene. The final results of FASTA alignments were concatenated by strain to form a 305-gene alignment using Galaxy [23 ]. A 100-gene alignment was also created using only those genes that were represented in every isolate by at least one read of at least 100 nucleotides.
+ Open protocol
+ Expand
6

Multi-platform Genome Assembly Protocol

Check if the same lab product or an alternative is used in the 5 most similar protocols
The 454 reads were assembled using Newbler 2.3 (Roche) with default parameters. The Illumina assembly comprised of Illumina paired-end was obtained by using SOAPdenovo software48 (link) (http://soap.genomics.org.cn) with default parameters. The contigs longer than 100 bp in the Illumina assembly were combined together with 454 reads for the final assembly using Newbler with default parameters.
+ Open protocol
+ Expand
7

Novel Microbial Metabolite Discovery via eDNA

Check if the same lab product or an alternative is used in the 5 most similar protocols
Previously archived soil eDNA cosmid libraries were probed to recover BGCs predicted to encode novel MBAs. Construction, PCR screening with barcoded A-domain degenerate primers, amplicon sequencing and read processing for these cosmid libraries have been described in detail previously.22 (link) Using A-domain sequences from the three known MBAs as references, the eSNaPD (environmental Surveyor of Natural Product Diversity) software package was used to identify similar sequences among A-domain amplicon sequences generated from archived metagenomic libraries. The library well locations of hits found in this analysis were identified using the barcode parsing functionality of the eSNaPD software package. Clones associated with select eDNA A-domain hits were then recovered from the appropriate library wells using a previously described dilution PCR strategy.22 (link) Recovered cosmids were sequenced using a MiSeq Reagent Nano Kit v2 on a MiSeq sequencer (Illumina) and the resulting reads were assembled into contigs using Newbler 2.6 (Roche). Assembled complete and partial BGCs were analyzed using antiSMASH v5.1.2 and our manual in-house NRP predictor to predict the substrate specificity of each A-domain. In this analysis building blocks were predicted by comparing the Stachelhaus code of predicted A-domain to that of A-domains from known NRPs.
+ Open protocol
+ Expand
8

CSFV Genome Sequencing and Analysis

Check if the same lab product or an alternative is used in the 5 most similar protocols
FastQC [27 ] was applied for pre-trimming evaluation of the Fastq files containing the raw sequence reads. Trimming and primer removal were performed using cut-adapt and prinseq-lite [28 (link)]. Fastq files were error-corrected using RC454 [29 (link)] with the CSFV strain “Koslov” nucleotide sequence (GenBank HM237795; [26 (link)]) as reference. The de novo assembly was performed using Newbler 2.6 (Roche software, Basel, Switzerland). Each error-corrected Fastq file was aligned using the BWA-MEM [30 ] algorithm or Mosaik [31 (link)]. Subsequently, the libraries were post-processed with Samtools [32 (link)] and SNPs were called using lofreq [33 (link)]. Downstream SNP effect analysis was performed with snpEff [34 (link)] with no fraction cut-off set. SNP plots were made in R. dN/dS and πN/πS analyses were performed by taking the filtered SNP calls and running them through SNPGenie [35 (link)] with a SNP cut-off set at 1%. Statistical analysis was performed with Graphpad Prism using the T-test with Holm–Sidak correction for multiple comparisons when appropriate.
+ Open protocol
+ Expand
9

Bioprospecting Soil Metagenomic Libraries

Check if the same lab product or an alternative is used in the 5 most similar protocols
In this study, previously archived soil eDNA cosmid libraries were probed to recover cyclomarin/rufomycin-like BGCs. Construction, PCR screening with barcoded A-domain degenerate primers, amplicon sequencing and read processing for these cosmid libraries have been described in detail previously.17 (link)-19 (link) Using the eDNA-derived AD01-p-Trp-like hits in the well-defined clade as references, these amplicon sequences were then analyzed by our previously developed bioinformatic platform eSNaPD (environmental Surveyor of Natural Product Diversity) software package,32 (link) thus generating a panel of p-Trp-like hits from cosmid libraries. The library well locations for targeted hits were identified by the barcode parsing functionality of the eSNaPD software. Then, specific primers targeting each unique sequence of interest were designed manually (Table S1). Single cosmids were recovered from library wells of interest using a serial dilution PCR strategy described previously.18 (link),19 (link) The recovered cosmids were sequenced using a MiSeq Reagent Nano Kit v2 on a MiSeq sequencer (Illumina). Then, sequence reads were assembled into contigs using Newbler 2.6 (Roche). The final assembled BGCs were analyzed using antiSMASH 5.0 to predict the amino acid specificity of each A-domain domain.33 (link)
+ Open protocol
+ Expand
10

Genome Sequencing of Bradyrhizobium Strains

Check if the same lab product or an alternative is used in the 5 most similar protocols
Library generation and genome sequencing of both strains were performed at “Darcy Fontoura de Almeida” Computational Genomics Unity (UGCDFA) of the National Laboratory of Scientific Computation (LNCC) (Petrópolis, Rio de Janeiro, Brazil). The genomes were sequenced using a whole-genome shotgun strategy, with a combination of Roche 454 GS-FLX shotgun and 3 kb-insert paired-end libraries. Libraries were prepared following GS FLX Titanium series protocols. 454 sequence reads were assembled using both Newbler 2.6 (454 Life Sciences, Roche Diagnostics Corporation, Branford, CT) and Celera (WGS, version 7.0) assemblers. The two assemblies were aligned to each other with Cross Match (Phred/Phrap/Consed package) since both yielded results that can be complementary and effective to close gaps. Sanger sequences previously obtained for strain CPAC 15 at Embrapa Soja (Brazil) [41 ] were also joined to the pyrosequence data. For closing gaps, a primer walking strategy was used, PCR products were Sanger sequenced at Embrapa Soja. All the consensus sequences of each contig and the sequence that closed each gap were aligned and joined using Consed (version 20.0). Information for scaffolding and gap closure was also obtained by mapping each assembly against the reference genomes of B. diazoefficiens USDA 110T (BA000040.2) and B. japonicum USDA 6T (AP012206.1).
+ Open protocol
+ Expand

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required

Sign up now

Revolutionizing how scientists
search and build protocols!