The largest database of trusted experimental protocols

Exome

Exome: The complete set of protein-coding DNA sequences within a genome.
Exome analysis has become a powerful tool for identifying genetic variants associated with human diseases and traits.
PubCompare.ai offers an AI-driven platform to efficiently locate the best exome analysis protocols from literature, preprints, and patents.
Leverage cutting-edege comparison tools to identify the optimal solutions and products for your exome research needs.
Unlock new insights and accelerate your discoveries with PubCompare.ai.

Most cited protocols related to «Exome»

We leveraged a variety of sources of internal and external validation data to calibrate filters and evaluate the quality of filtered variants (Supplementary Information Table 7). We adjusted the standard GATK variant site filtering37 (link) to increase the number of singleton variants that pass this filter, while maintaining a singleton transmission rate of 50.1%, very near the expected 50%, within sequenced trios. We then used the remaining passing variants to assess depth and genotype quality filters compared to >10,000 samples that had been directly genotyped using SNP arrays (Illumina HumanExome) and achieved 97–99% heterozygous concordance, consistent with known error rates for rare variants in chip-based genotyping38 (link). Relative to a “platinum standard” genome sequenced using five different technologies39 (link), we achieved sensitivity of 99.8% and false discovery rates (FDR) of 0.056% for single nucleotide variants (SNVs), and corresponding rates of 95.1% and 2.17% for insertions and deletions (indels). Lastly, we compared 13 representative Non-Finnish European exomes included in the call set with their corresponding 30x PCR-Free genome. The overall SNV and indel FDR was 0.14% and 4.71%, while for SNV singletons was 0.389%. The overall FDR by annotation classes missense, synonymous and protein truncating variants (including indels) were 0.076%, 0.055% and 0.471% respectively (Supplementary Information Table 5 and 6). Full details of quality assessments are described in the Supplementary Information Section 1.6.
Publication 2016
DNA Chips Europeans Exome Gene Deletion Genome Heterozygote Hypersensitivity INDEL Mutation Insertion Mutation Mutant Proteins Nucleotides Platinum Transmission, Communicable Disease TRIO protein, human
All samples were obtained under institutional IRB approval and with documented informed consent. A complete list of samples is given in Table S2. Whole-exome capture libraries were constructed and sequenced on Illumina HiSeq flowcells to average coverage of 118x. Whole-genome sequencing was done with the Illumina GA-II or Illumina HiSeq sequencer, achieving an average of ~30X coverage depth. Reads were aligned to the reference human genome build hg19 using an implementation of the Burrows-Wheeler Aligner, and a BAM file was produced for each tumor and normal sample using the Picard pipeline6 (link). The Firehose pipeline was used to manage input and output files and submit analyses for execution. The MuTect30 and Indelocator (Sivachenko, A. et al., manuscript in preparation) algorithms were used to identify somatic single-nucleotide variants (SSNVs) and short somatic insertions and deletions, respectively. Mutation spectra were analyzed using non-negative matrix factorization (NMF). Significantly mutated genes were identified using MutSigCV, which estimates the background mutation rate (BMR) for each gene-patient-category combination based on the observed silent mutations in the gene and noncoding mutations in the surrounding regions. Because in most cases these data are too sparse to obtain accurate estimates, we increased accuracy by pooling data from other genes with similar properties (e.g. replication time, expression level). Significance levels (p-values) were determined by testing whether the observed mutations in a gene significantly exceed the expected counts based on the background model. False Discovery Rates (q-values) were then calculated, and genes with q≤0.1 were reported as significantly mutated. Full methods details are listed in Supplementary Information.
Publication 2013
Diploid Cell DNA Replication Exome Gene Deletion Genes Genes, vif Genetic Background Genome, Human Insertion Mutation Multiple Acyl Coenzyme A Dehydrogenase Deficiency Mutation Neoplasms Nucleotides Patients Silent Mutation
To annotate variants with respect to their functional consequences on genes, ANNOVAR needs to download gene annotation data sets (gene/transcript annotations and FASTA sequences) from the UCSC Genome Browser (12 (link)) and save them to local disk. Several different gene annotation systems, including RefSeq genes, UCSC Genes and the Ensembl genes, can be utilized for annotation. The ‘–downdb’ argument can be utilized for downloading necessary files automatically, if the computer is connected to the Internet. The ‘wget’ system command will be utilized for downloading, or the Net::Ftp/LWP::UserAgent modules (standard Perl modules installed in most systems by default) can be alternatively utilized. The users can specify different genome builds, such as hg18 (human), mm9 (mouse) or bosTau4 (cow), as long as they are available from the UCSC Genome Browser annotation databases. When performing gene-based annotations by Ensembl gene definitions (13 (link)), ANNOVAR will download the FASTA sequences from Ensembl as they were not available from the UCSC Genome Browser.
For region-based annotations, ANNOVAR needs to download annotation databases from the various UCSC Genome Browser tables, based on a user-specified track name. Alternatively, users can specify a custom-built annotation database conforming to Generic Feature Format 3 (GFF3), and ANNOVAR can identify variants overlapping with features annotated in the given GFF3 file. For filter-based annotations, for example, comparing mutations to those detected in the 1000 Genomes Project or dbSNP, ANNOVAR will download specific files from the corresponding websites. ANNOVAR can also download pre-computed SIFT scores for all human non-synonymous mutations, to help annotate human exomes by filter-based annotation procedure.
Publication 2010
Exome Gene Annotation Generic Drugs Genes Genitalia Genome Homo sapiens Mice, Laboratory Missense Mutation Mutation
To illustrate the utility of ANNOVAR in identifying causal genes for Mendelian diseases with recessive inheritance, we synthesized a whole-genome data set with ∼4.2 million SNVs and ∼0.5 million indels. These variants include all variants generated by Illumina on a male Yoruba subject (ftp://ftp.sanger.ac.uk/pub/rd/NA18507/) (14 (link)), as well as two known causal mutations for Miller syndrome (G->A mutation at chr16: 70608443 and G->C mutation at chr16: 70612611, representing G152R and G202A in the DHODH gene). We tested the variants reduction procedure on this data set using ANNOVAR, to examine whether we can identify a small subset of candidate genes that include the causal gene DHODH.
To illustrate the utility of ANNOVAR in identifying causal genes for Mendelian diseases with dominant inheritance, we synthesized whole-exome data sets. Since exome data for four Freeman–Sheldon cases were not available to us, we downloaded the exome data for eight HapMap subjects reported in (11 (link)). We then extracted the exome data for the first four subjects, including two Yoruba subjects (NA18507, NA18517) and two European Americans (NA12156 and NA12878). We next added the four known causal mutations to each of the four HapMap subjects (three C–>T mutations at chr17:10485359 and one C–>T mutation at chr17:10485360, representing R672H and R672C mutations in MYH3). We tested whether ANNOVAR can identify MYH3 as the causal gene by examining exomes from these four subjects.
Publication 2010
Chromosome 11p Deletion Syndrome Dihydroorotate Dehydrogenase Europeans Exome Genes Genes, vif Genome HapMap INDEL Mutation Males Mutation Pattern, Inheritance
Total RNA was extracted from EBV transformed lymphoblastoid cell line pellets by the TRIzol reagent (Ambion), and mRNA and small RNA sequencing of 465 unique individuals was performed on the Illumina HiSeq2000 platform, with paired-end 75bp mRNA-seq and single-end 36bp small RNA-seq. Five samples were sequenced in replicate in each of the seven sequencing laboratories. The mRNA and small RNA reads were mapped with GEM31 and miraligner32 (link), respectively, with an average of 48.9M mRNA-seq reads and 1.2M miRNA reads per sample after QC. Numerous transcript features were quantified using Gencode v1233 (link) and miRBase v1834 (link) annotations: protein-coding and lincRNA genes (16,084 detected in >50% of samples), transcripts (67,603; with FluxCapacitor7 (link)), exons (146,498), annotated splice junctions (129,805; analyzed in detail in Ferreira et al. submitted), transcribed repetitive elements (47,409), and mature miRNAs (715). Data quality was assessed by sample correlations and read and gene count distributions, and technical variation was removed by PEER normalization35 (link) for the QTL and miRNA-mRNA correlation analyses11 (link). The samples clustered uniformly both before and after normalization. The genotype data was obtained from 1000 Genomes Phase 1 data set for 421 samples (80× average exome and 5× whole genome read depth), and the remaining 41 samples were imputed from Omni 2.5M SNP array data. Furthermore, we did functional reannotation for all the 1000 Genomes variants using Gencode v12. QTL mapping was done with linear regression, using genetic variants with >5% frequency in 1MB window and normalized quantifications transformed to standard normal. Permutations were used to adjust FDR to 5%. Full details are provided in Supplementary Methods.
Publication 2013
Cell Line, Transformed DNA Replication Exome Exons Genes Genetic Diversity Genome Genotype Long Intergenic Non-Protein Coding RNA MicroRNAs Pellets, Drug Proteins Repetitive Region RNA, Messenger RNA-Seq trizol

Most recents protocols related to «Exome»

Full exon sequencing (WES 1000 g) was performed by Agilent's liquid chip capture system. Genomic DNA extracted from peripheral blood for each sample was fragmented to an average size of 180–280 bp and subjected to DNA library creation using established Illumina paired-end protocols. The Agilent SureSelect Human All ExonV6 Kit (Agilent Technologies, Santa Clara, CA, USA) was used for exome capture according to the manufacturer’s instructions. The Illumina Novaseq 6000 platform (Illumina Inc., San Diego, CA, USA) was utilized for genomic DNA sequencing in Genechem Bioinformatics Technology Co., Ltd (Beijing, China) to generate 150-bp paired-end reads with a minimum coverage of 10 × for ~ 99% of the genome (mean coverage of 100 ×). After sequencing, base call files conversion and demultiplexing were performed with bcl2fastq software (Illumina). The resulting fastq data were submitted to in-house quality control software for removing low quality reads, and then were aligned to the reference human genome (hs37d5) using the Burrows-Wheeler Aligner (bwa), and duplicate reads were marked using Sambamba tools. ANNOVAR software was used to annotate the variants.
Filtering of rare variants was performed as follows: (1) variants with a MAF less than 0.01 in 1000 genomic data (1000g_all), esp6500siv2_all, gnomAD data (gnomAD_ALL and gnomAD_EAS) and in house Genechem-Zhonghua exome database from Genechem; (2) Only SNVs occurring in exons or splice sites (splicing junction 10 bp) are further analyzed since we are interested in amino acid changes. (3) Then synonymous SNVs which are not relevant to the amino acid alternation predicted by dbscSNV are discarded; The small fragment non-frameshift (< 10 bp) indel in the repeat region defined by RepeatMasker are discarded. (4) Variations are screened according to scores of SIFT, Polyphen, MutationTaster and CADD software. The potentially deleterious variations are reserved if the score of more than half of these four software support harmfulness of variations. Sites (> 2 bp) did not affect alternative splicing were removed.
Full text: Click here
Publication 2023
Amino Acids BLOOD DNA Chips DNA Library Exome Exons Frameshift Mutation Genome Genome, Human Homo sapiens INDEL Mutation Strains
To confirm the involvement of DNM1L variants in PD susceptibility, a meta-analysis combining public studies and our case–control study was conducted. In addition to our data, summary data from the PD variant browser were included in the meta-analysis, which comprised four studies: PD Genome Project, International Parkinson's Disease Genomic Consortium (IPDGC) Exomes, IPDGC Resequencing Project, and UK Biobank (19 (link)). Unlike our cohorts, these cohorts mainly contain European populations, which can increase the power to detect the association between DNM1L variants and PD. As no rare variants of DNM1L found in our cohort were seen in included public data, we only validated the significant association between significant common variants of DNM1L and PD in meta-analysis. We used the Hardy-Weinberg equilibrium model to estimate the SNPs in all cohorts and then excluded the variants with deviations in controls (p < 0.05). To assess the strength of the association between DNM1L variants and PD risk, a pooled OR and 95% confidence intervals (CIs) were calculated under five different models (allele, dominant, recessive, heterozygote, and homozygote model). The Cochrane Q-test and I2 statistics were used to assess study heterogeneity and a significant Q-test (p < 0.1 or I2 > 50%) indicated heterogeneity. Fixed- or random-effects models were selected based on the presence or absence of heterogeneity. A Z-test determined the significance of the pooled ORs. We used the FDR method to correct p-values for multiple comparisons for the variant association analysis. A p-value of < 0.05 was considered statistically significant. Moreover, the p-values of Egger's and Begg's tests were calculated to estimate the publication bias. All analyses were performed using the R package “meta.”
Full text: Click here
Publication 2023
Alleles DNM1L protein, human Europeans Exome Genetic Heterogeneity Genome Heterozygote Homozygote Parkinson Disease Population Group Single Nucleotide Polymorphism Susceptibility, Disease Vision
As shown in our previous study (15 (link)), the WES cohort used SureSelect Human All Exon Kit V6 (Agilent) to capture the whole-exome DNA, prepared the sample library, and then used Illumina HiSeq 10× for pair-end 2 × 150 bp sequencing. The average sequencing depth was 123×, achieving a coverage of at least 10× for 99.32% of the target region. WGS was performed using the Illumina Nova Sequencing platform in a pair-end 2 × 150 bp mode, and the average depth of coverage was about 12×. Sequencing data of both groups were processed and analyzed with the BWA-GATK-ANNOVAR pipeline (16 (link), 17 (link)). Quality control was conducted as described in our previous study (15 (link)). Samples would be removed if they had sex discrepancies, abnormal heterozygosity (>3 SD), pathogenic or likely pathogenic variants of PD-related genes, or unusual relatedness (descent > 0.15). In addition, we performed the principal component analysis (PCA) using PLINK v1.90 to assess potential population structure stratification. In subsequent analyses for the WGS cohort, gender, age, and the first five principal components of population stratification were used as covariates, whereas for the WES cohort (the control group included elderly people without neurological diseases), sex and the first five principal components for population stratification were used (18 (link)).
Full text: Click here
Publication 2023
Aged DNA Library Exome Exons Genetic Diversity Heterozygote Homo sapiens Nervous System Disorder pathogenesis
Genomic DNA was extracted from whole blood from all patients except P17, for whom DNA was obtained from SV40-transformed fibroblasts. The whole exome was sequenced at the Genomics Core Facility of the Imagine Institute (Paris, France), the Yale Center for Genome Analysis the New York Genome Center, and The American Genome Center (Uniformed Services University of the Health Sciences, Bethesda, MD, USA), and the Genomics Division–Institute of Technology and Renewable Energies of the Canarian Health System sequencing hub (Canary Islands, Spain), as previously reported (Asano et al., 2021 (link)). The whole-exome sequences of the patients were filtered against the complete International Union of Immunological Societies list of genes (Tangye et al., 2022 (link)), with the retention of variants with an allele frequency below 0.001. We excluded synonymous mutations, downstream, upstream, intron and non-coding transcript variants and intergenic variants. We also excluded variants predicted to be benign and we checked the quality of the exome sequences. The mutation significance cutoff (http://pec630.rockefeller.edu:8080/MSC/) was used to determine whether variants were likely to be damaging.
Full text: Click here
Publication 2023
BLOOD Exome Fibroblasts Genes Genome Introns Mutation Patients Retention (Psychology) Silent Mutation Simian virus 40 Strains
The UK Biobank is a cohort comprising ∼500,000 individuals recruited through the NHS registry at age 40–69 from across the UK. Individuals were were not selected on the basis of having disease, resulting in a broad cross-section of the UK population. For all participants, a number of baseline physical measurements were taken and an extensive questionnaire completed comprising information on lifestyle, demographic and socioeconomic factors. Additionally, blood and urine samples were collected and further tests including a heel-bone ultrasound, bio-impedance, hand-grip strength, spirometry, blood pressure and several cognitive tests were performed. Each individual was further genotyped and exome sequenced. The participants also agreed to ongoing linkage of their medical records [10 (link)].
Full text: Click here
Publication 2023
BLOOD Blood Pressure Calcaneus Cognitive Testing Exome Physical Examination Spirometry Ultrasonics Urine

Top products related to «Exome»

Sourced in United States, China, Germany, United Kingdom, Hong Kong, Canada, Switzerland, Australia, France, Japan, Italy, Sweden, Denmark, Cameroon, Spain, India, Netherlands, Belgium, Norway, Singapore, Brazil
The HiSeq 2000 is a high-throughput DNA sequencing system designed by Illumina. It utilizes sequencing-by-synthesis technology to generate large volumes of sequence data. The HiSeq 2000 is capable of producing up to 600 gigabases of sequence data per run.
Sourced in United States, China, Germany, United Kingdom, Canada, Switzerland, Sweden, Japan, Australia, France, India, Hong Kong, Spain, Cameroon, Austria, Denmark, Italy, Singapore, Brazil, Finland, Norway, Netherlands, Belgium, Israel
The HiSeq 2500 is a high-throughput DNA sequencing system designed for a wide range of applications, including whole-genome sequencing, targeted sequencing, and transcriptome analysis. The system utilizes Illumina's proprietary sequencing-by-synthesis technology to generate high-quality sequencing data with speed and accuracy.
Sourced in United States, China, United Kingdom, Japan, Germany, Canada, Hong Kong, Australia, France, Italy, Switzerland, Sweden, India, Denmark, Singapore, Spain, Cameroon, Belgium, Netherlands, Czechia
The NovaSeq 6000 is a high-throughput sequencing system designed for large-scale genomic projects. It utilizes Illumina's sequencing by synthesis (SBS) technology to generate high-quality sequencing data. The NovaSeq 6000 can process multiple samples simultaneously and is capable of producing up to 6 Tb of data per run, making it suitable for a wide range of applications, including whole-genome sequencing, exome sequencing, and RNA sequencing.
Sourced in United States, China, United Kingdom, Hong Kong, France, Canada, Germany, Switzerland, India, Norway, Japan, Sweden, Cameroon, Italy
The HiSeq 4000 is a high-throughput sequencing system designed for generating large volumes of DNA sequence data. It utilizes Illumina's proven sequencing-by-synthesis technology to produce accurate and reliable results. The HiSeq 4000 has the capability to generate up to 1.5 terabytes of data per run, making it suitable for a wide range of applications, including whole-genome sequencing, targeted sequencing, and transcriptome analysis.
Sourced in United States, Germany, China, United Kingdom, Australia, France, Italy, Canada, Japan, Austria, India, Spain, Switzerland, Cameroon, Netherlands, Czechia, Sweden, Denmark
The NextSeq 500 is a high-throughput sequencing system designed for a wide range of applications, including gene expression analysis, targeted resequencing, and small RNA discovery. The system utilizes reversible terminator-based sequencing technology to generate high-quality, accurate DNA sequence data.
Sourced in United States, China, Germany, Canada, Hong Kong, Italy, United Kingdom, Sweden, Japan, Spain, Australia, Norway, Denmark, Switzerland, India, France, Cameroon, Brazil
The HiSeq 2000 platform is a high-throughput DNA sequencing system designed for large-scale genomic research. It utilizes sequencing-by-synthesis technology to generate high-quality DNA sequence data. The HiSeq 2000 is capable of processing multiple samples simultaneously, producing large volumes of sequencing data in a single run.
Sourced in United States
The TruSeq Exome Enrichment Kit is a laboratory tool designed for targeted sequencing of the human exome. It provides a comprehensive solution for capturing and enriching the protein-coding regions of the human genome.
Sourced in United States, China, Germany, United Kingdom, Australia, Canada, India, Switzerland, Cameroon, Portugal, Brazil, Japan
The HiSeq platform is a high-throughput DNA sequencing system developed by Illumina. The core function of the HiSeq platform is to perform large-scale genomic analysis by generating high-quality sequence data efficiently.
Sourced in Germany, United States, United Kingdom, Netherlands, Spain, Japan, China, Canada, France, Australia, Switzerland, Italy, Belgium, Denmark, Sweden
The DNeasy Blood & Tissue Kit is a DNA extraction and purification kit designed for the efficient isolation of high-quality genomic DNA from a variety of sample types, including whole blood, tissue, and cultured cells. The kit utilizes a silica-based membrane technology to capture and purify DNA, providing a reliable and consistent method for DNA extraction.
Sourced in United States, China, Germany, Canada, Australia, Italy, India, Japan, United Kingdom, Cameroon, Brazil, Norway, Switzerland
The HiSeq 2500 platform is a high-throughput DNA sequencing system designed for a wide range of genomic applications. It utilizes sequencing-by-synthesis technology to generate high-quality sequence data. The HiSeq 2500 platform is capable of producing up to 1 billion sequencing reads per run, making it a powerful tool for researchers and clinicians working in the field of genomics.

More about "Exome"

Exome analysis is a powerful tool for identifying genetic variants associated with human diseases and traits.
The exome refers to the complete set of protein-coding DNA sequences within a genome, which accounts for approximately 1-2% of the entire human genome.
By focusing on the exome, researchers can efficiently and cost-effectively analyze the regions of the genome that are most likely to harbor disease-causing mutations.
Exome sequencing, often performed on platforms like the HiSeq 2000, HiSeq 2500, NovaSeq 6000, HiSeq 4000, and NextSeq 500, has become a widely adopted approach for genetic research and clinical diagnostics.
It allows researchers to identify single nucleotide variants (SNVs), small insertions and deletions (indels), and other genetic variations that may contribute to complex diseases, rare disorders, and inherited traits.
The TruSeq Exome Enrichment Kit is a popular tool used in exome sequencing workflows, allowing for the efficient capture and enrichment of the protein-coding regions of the genome.
Additionally, the DNeasy Blood & Tissue Kit is often used for high-quality DNA extraction, which is a crucial step in the exome sequencing process.
By leveraging the power of the HiSeq platform and other advanced sequencing technologies, researchers can generate high-quality exome data and utilize AI-driven platforms like PubCompare.ai to efficiently locate the best exome analysis protocols from literature, preprints, and patents.
This enables the identification of optimal solutions and products for exome research, ultimately unlocking new insights and accelerating scientific discoveries.