We used Jellyfish [26 (link)] for counting k-mers and obtaining their frequency distributions. With these data, we drew frequency plots using k-mer lengths of 15, 17, 19 and 21. To assign the “true” coverage peak, we compared these plots to identify the peak that changed in height (“heterozygous peak”) and the one that did not (“coverage peak”). The latter was then used to calculate the genome size as the total k-mer number divided by the coverage-peak depth [27 (link)]. Finally, we assembled de novo the reads resulting from the quality filtering step using SOAPdenovo v1.05 [27 (link)] with parameters -K 31 -d 1 -M 1 -F–R. Then, we ran the Assemblathon 2 script [28 (link)] to obtain assembly statistics. Using this script, we compared the genome assemblies of M. galloprovincialis with those of A. californica, L. gigantea, P. fucata, and C. gigas (
De novo Genome Assembly of the Mediterranean Mussel
We used Jellyfish [26 (link)] for counting k-mers and obtaining their frequency distributions. With these data, we drew frequency plots using k-mer lengths of 15, 17, 19 and 21. To assign the “true” coverage peak, we compared these plots to identify the peak that changed in height (“heterozygous peak”) and the one that did not (“coverage peak”). The latter was then used to calculate the genome size as the total k-mer number divided by the coverage-peak depth [27 (link)]. Finally, we assembled de novo the reads resulting from the quality filtering step using SOAPdenovo v1.05 [27 (link)] with parameters -K 31 -d 1 -M 1 -F–R. Then, we ran the Assemblathon 2 script [28 (link)] to obtain assembly statistics. Using this script, we compared the genome assemblies of M. galloprovincialis with those of A. californica, L. gigantea, P. fucata, and C. gigas (
Corresponding Organization : Universidade de Vigo
Other organizations : Johns Hopkins Medicine, Johns Hopkins University, Instituto de Investigacións Mariñas, Consejo Superior de Investigaciones Científicas
Protocol cited in 3 other protocols
Variable analysis
- Sequencing library insert sizes (180 bp, 500 bp, 800 bp)
- Genome assembly statistics (e.g., contig length, N50, etc.)
- Genome size
- DNA extraction from a single mussel
- Sequencing platform (Illumina HiSeq2000)
- Sequencing method (paired-end, 100-bp reads)
- Quality filtering criteria (ambiguous bases, poly-A, low quality scores, adapter contamination, overlapping reads)
- Positive control: None mentioned
- Negative control: None mentioned
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!