Protocol detail

Quantifying Strain and Gene Fitness

Find Similar Protocols

BarSeq reads were converted to a table of the number of times that each bar code was seen in each sample using a custom perl script (MultiCodes.pl). The script requires an exact match to the 8 nucleotides at the beginning of the read that identify the sample (“inline” indexes), or relies on Illumina software for demultiplexing (TruSeq P7 indexes), depending on the primer design (see “BarSeq” above). The script also requires an exact match for the 9 nucleotides upstream of the bar code. We did not check the quality scores for the bar code or the sequence downstream of the bar code (the -minQuality 0 option). However, bar codes that do not match exactly an expected bar code are ignored in later stages of the analysis.
Given a table of bar codes, where they map in the genome, and their counts in each sample, we estimate strain fitness and gene fitness values and their reliability with a custom R script (FEBA.R). Roughly, strain fitness is the normalized log₂ ratio of counts between the treatment sample (i.e., after growth in a certain medium) and the reference “time-zero” sample. Gene fitness is the weighted average of the strain fitness, and a t score is computed based on the consistency of the strain fitness values for each gene. Ideally, the time-zero and treatment samples are sequenced in the same lane. Also, we usually have multiple replicates of any given time zero, with independent extraction of genomic DNA and independent PCR with a different index. We sum the per-strain counts across replicate time-zero samples.

Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link: Access Free Full Text.

Wetmore K.M., Price M.N., Waters R.J., Lamson J.S., He J., Hoover C.A., Blow M.J., Bristow J., Butland G., Arkin A.P, & Deutschbauer A. (2015). Rapid Quantification of Mutant Fitness in Diverse Bacteria by Sequencing Randomly Bar-Coded Transposons. mBio, 6(3), e00306-15.

Publication 2015

Gene Gene fitness Genomic Growth medium Nucleotides Primer Replicate Strain

Top 5 similar protocols

Protocol cited in 35 other protocols

Variable analysis

independent variables

Treatment sample (i.e., after growth in a certain medium)

dependent variables

Strain fitness (normalized log₂ ratio of counts between the treatment sample and the reference 'time-zero' sample)
Gene fitness (weighted average of the strain fitness)
T score (consistency of the strain fitness values for each gene)

control variables

Reference 'time-zero' sample
Multiple replicates of any given time zero, with independent extraction of genomic DNA and independent PCR with a different index

controls

Positive control: Not explicitly mentioned
Negative control: Not explicitly mentioned

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!