In the ab initio method, the software packages Augustus (v2.5.5) and SNAP (v2006‐07‐28) were employed with default settings. Genes with incomplete open reading frames (ORFs) or a protein‐coding length less than 300 bp were filtered out. In the RNA-seq-based method, published gene sets from Aedes aegypti (GCF_002204515.2; from NCBI), Spodoptera frugiperda (GCF_011064685.1; from NCBI), P. japonica (GCA_013421045.1; from NCBI), Anoplophora glabripennis (GCF_000390285.2; from NCBI), Drosophila melanogaster (GCF_000001215.4; from NCBI), Bombyx mori (GCF_014905235.1; from NCBI), Photinus pyralis (GCF_008802855.1; from NCBI), Diabrotica virgifera (GCF_003013835.1; from NCBI), Apis mellifera (GCF_003254395.2; from NCBI), Tribolium castaneum (GCF_000002335.3; from NCBI), and H. axyridis (GCA_011033045.1; from NCBI) were downloaded from NCBI or their own databases and used for homology-based annotation. The longest transcript of each protein-coding gene was aligned to the H. vigintioctomaculata genome using BLAST (tblastn, v2.6.0) with an e-value of 1 × 10−5, and gene structures were predicted using GeneWise (v2.2.0).52 (link) In the RNA‐seq‐based gene approach, the de novo assembled transcripts were aligned to the H. vigintioctomaculata genome using BLAT (v35),45 (link) and PASA (v2.1.0)53 (link) was used to link the spliced alignments. Finally, the gene prediction results were integrated into a final gene set using EVidenceModeler (v1.1.1) software.54 (link)For gene function annotation, the predicted protein-coding genes were searched against the following public databases: Gene Ontology (
Annotating the Repeat-Masked Genome of H. vigintioctomaculata
In the ab initio method, the software packages Augustus (v2.5.5) and SNAP (v2006‐07‐28) were employed with default settings. Genes with incomplete open reading frames (ORFs) or a protein‐coding length less than 300 bp were filtered out. In the RNA-seq-based method, published gene sets from Aedes aegypti (GCF_002204515.2; from NCBI), Spodoptera frugiperda (GCF_011064685.1; from NCBI), P. japonica (GCA_013421045.1; from NCBI), Anoplophora glabripennis (GCF_000390285.2; from NCBI), Drosophila melanogaster (GCF_000001215.4; from NCBI), Bombyx mori (GCF_014905235.1; from NCBI), Photinus pyralis (GCF_008802855.1; from NCBI), Diabrotica virgifera (GCF_003013835.1; from NCBI), Apis mellifera (GCF_003254395.2; from NCBI), Tribolium castaneum (GCF_000002335.3; from NCBI), and H. axyridis (GCA_011033045.1; from NCBI) were downloaded from NCBI or their own databases and used for homology-based annotation. The longest transcript of each protein-coding gene was aligned to the H. vigintioctomaculata genome using BLAST (tblastn, v2.6.0) with an e-value of 1 × 10−5, and gene structures were predicted using GeneWise (v2.2.0).52 (link) In the RNA‐seq‐based gene approach, the de novo assembled transcripts were aligned to the H. vigintioctomaculata genome using BLAT (v35),45 (link) and PASA (v2.1.0)53 (link) was used to link the spliced alignments. Finally, the gene prediction results were integrated into a final gene set using EVidenceModeler (v1.1.1) software.54 (link)For gene function annotation, the predicted protein-coding genes were searched against the following public databases: Gene Ontology (
Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link:
Access Free Full Text.
Corresponding Organization : Shaanxi Normal University
Other organizations : Qingdao Agricultural University, Northwestern Polytechnical University, South China Normal University
Variable analysis
- Ab initio annotation
- RNA‐seq‐based annotation
- Homologue‐based annotation
- Gene set annotation
- Repeat sequences were masked, and the repeat-masked genome was used for gene set annotation
- Genes with incomplete open reading frames (ORFs) or a protein‐coding length less than 300 bp were filtered out
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!