The largest database of trusted experimental protocols
> Physiology > Genetic Function > Polygenic Traits

Polygenic Traits

Polygenic Traits: Multifactorial characteristics determined by the combined effects of multiple genes, rather than a single gene.
These complex traits, which include common diseases and quantitatice phenotypes, are influenced by both genetic and environmental factors.
PubCompare.ai's cutting-edge AI-driven research protocol optimization can help you easily locate the best protocols and products acrosss literature, pre-prints, and patents to enhance reproducibility and drive your polygenic trait research forward.

Most cited protocols related to «Polygenic Traits»

Mixed linear models are an emerging method of choice when conducting association mapping in the presence of sample structure, including geographic population structure, family relatedness and/or cryptic relatedness1 (link)–12 (link). The basic approach is to build a genetic relationship matrix (GRM) modeling genome-wide sample structure, estimate its contribution to phenotypic variance using a random-effects model (with or without additional fixed effects), and compute association statistics that account for this component of phenotypic variance (Online Methods). We note that mixed linear models can also be used to estimate components of heritability explained by genotyped markers13 (link)–14 (link), and to predict complex traits using genetic data15 (link)–16 (link).
Mixed linear model association (MLMA) methods are effective in preventing false-positive associations due to sample structure in studies of humans and model organisms1 (link)–6 (link). In particular, simulations show that the correction for confounding is nearly perfect for common variants even when geographic population structure, which is a fixed effect, is modeled as a random effect based on overall covariance6 (link),17 (link)–19 (link) (however, rare variants pose a greater challenge for all methods, due to differential confounding of rare and common variants20 (link)). MLMA methods also provide an increase in power, by applying a correction that is specific to sample structure1 (link)–6 (link). In the case of geographic population structure, markers with large allele frequency differences between populations will receive a larger correction. In the case of relatedness structure, the contribution of related individuals to test statistics will be reduced, preventing overweighting of redundant information due to correlation structure.
An underappreciated point is that MLMA can also increase power in studies without sample structure, by implicitly conditioning on associated loci other than the candidate locus that are not genome-wide significant in the data being analyzed8 (link). For example, a GRM computed from all markers can be used to approximate the set of causal markers (implicitly assuming that all markers are causal), but this approximation can be generalized. The increase in power scales with the ratio N/M of the number of samples (N) to the effective number of independent markers (M), since the information about unknown associated loci depends on the number of samples. In simulations of a quantitative trait with no sample structure and no LD between markers (Online Methods), application of MLMA instead of linear regression increased average −log10P-values at causal markers from 2.89 to 2.94 (1.8% increase) when N=10,000 and M=100,000, and from 2.92 to 3.46 (18% increase) when N=10,000 and M=10,000. We note that this improvement is contingent on the exclusion of the candidate marker from the GRM (see below).
Publication 2014
CFC1 protein, human Genetic Structures Genome Homo sapiens Phenotype Polygenic Traits Population Group Reproduction
Details on the cohorts, phenotype measures, genotyping, quality-control filters, and association models are provided in Supplementary Note and Supplementary Table 2 to 5. As shown in Figure 3, there is substantial overlap in samples across the three GWAS meta-analyses.
All analyses were based on autosomal SNPs from cohorts with genotypes imputed against the 1000 Genomes reference panel. The input files in each meta-analysis were subject to a uniform set of quality-control and diagnostic procedures. These are described in the previous SSGAC study11 (link) and Supplementary Note.
As expected under polygenicity23 (link), we observe inflation of the median test statistic in each GWAS (λGC,DEP = 1.36, λGC,NEUR = 1.24, λGC,SWB = 1.28; Supplementary Figure 4, Supplementary Table 6). The intercept estimates from LD score regression are all below 1.02, however, suggesting that nearly all of the observed inflation is due to polygenic signal14 (link) (Supplementary Figure 5). When we report GWAS results, as in the SSGAC study11 (link) we account for the potential bias due to this small amount of stratification by inflating the standard errors of our GWAS estimates by the square root of the LD score regression intercept.
Manhattan plots from each of the GWAS meta-analyses are shown in Supplementary Figures 6a, b, and c. Our NEUR meta-analysis was based on the same cohort-level data as the SSGAC study11 (link) and unsurprisingly yielded substantively identical results: 10 lead SNPs. Consistent with what studies have reported for other complex traits, the increased discovery samples for DEP and SWB relative to the SSGAC study increased the number of lead SNPs: from 2 to 32 for DEP (Neff = 149,707 to 354,862) and from 3 to 13 for SWB (N = 298,420 to 388,538). Applying bivariate LD score regression6 (link) to the GWAS results, we estimate the genetic correlations to be 0.72 (s.e. = 0.026) between DEP and NEUR, −0.67 (s.e. = 0.027) between NEUR and SWB, and −0.69 (s.e. = 0.024) between DEP and SWB (Supplementary Table 7). The intercepts from each of these regressions are found in Supplementary Table 8. Lead SNPs with a P value less than 10−5 from the GWAS for each trait are listed in Supplementary Table 9.
Publication 2017
Genome Genome-Wide Association Study Genotype MLL protein, human Phenotype Plant Roots Polygenic Traits Reproduction Single Nucleotide Polymorphism Tests, Diagnostic
We cleaned and harmonized 963 publicly available GWAS summary-level datasets from 36 consortia, which included 82 diseases, 154 complex traits, 576 metabolites and 151 immune markers (Hemani et al, in preparation).
From this database pool, we chose datasets that fit the following selection criteria:

Non-sex-stratified

Meta-analyses of predominantly European populations. We include a few GWAS meta-analyses that contain a small proportion of non-European individuals in them in the LD Hub database. Whilst we believe the effect of these small numbers of non-European individuals on the LD Score regression analyses will be relatively minor, users should be aware that results from these meta-analyses may be less robust because of inconsistent patterns of linkage disequilibrium between individuals of different ancestry. In order to flag these studies to the user, we have included an additional field in the Test Center and the GWAShare Center (last column) that indicates the population ancestry of individuals in the corresponding meta-analysis, as well as a similar field in the LD Score regression results file (see also Table S1).

Meta-analyses using a GWAS backbone chip only (i.e. exclude meta-analyses involving immuno | metabo | psych | exome chip or GWAS + custom chip)

Number of SNPs is large (N > 450 000)

Number of individuals is large (N > 5000)

Mean Chi-square of the test statistics is larger than 1

As shown in Figure 2, after filtering on the selection criteria, genome-wide results for 173 traits were included in LD Hub, of which 18 are GWAS of diseases (Boraska et al., 2014 (link); Cross-Disorder Group of the Psychiatric Genomics Consortium, 2013 (link); Lambert et al., 2013 (link); Liu et al 2015 (link); Moffatt et al., 2007 (link); Morris et al., 2012 (link); Neale et al., 2010 (link); Nikpay et al., 2015 (link); Okada et al., 2013 (link); Paternoster et al., 2015 (link); Ripke et al., 2012 (link); Ripke et al., 2014 (link); Simon-Sanchez et al., 2009 (link); Sklar et al., 2011 (link)), 48 are medically relevant risk factors/complex traits (Benyamin et al., 2013 (link); Berndt et al., 2013 (link); Bradfield et al., 2012 (link); Dastani et al., 2012 (link); de Moor et al., 2010 (link); Dupuis et al., 2010 (link); Estrada et al., 2012 (link); Furberg et al., 2010 (link); Horikoshi et al., 2012 (link); Huffman et al., 2015 (link); Lango Allen et al., 2010 (link); Manning et al. 2012 (link); Moffatt et al., 2007 (link); Pattaro et al., 2016 ; Perry et al., 2014 (link); Rietveld et al., 2014 (link); Rietveld et al., 2013 (link); Saxena et al., 2010 (link); Shungin et al., 2015 (link); Soranzo et al., 2010 (link); Speliotes et al., 2010 (link); Taal et al., 2012 (link); Teslovich et al., 2010 (link); Teumer et al., 2016 (link); van den Berg et al., 2014 (link); van der Valk et al., 2014 (link)) and 107 are metabolites (Kettunen et al., 2016 ). Table S1, displays descriptive information for each of the GWAS in LD Hub, including, trait name, consortium name, ethnicity, gender, sample size, PubMed ID, year of publication and other relevant information.
Publication 2016
DNA Chips Ethnicity Europeans Exome Gender Genome Genome-Wide Association Study Mental Disorders Polygenic Traits Single Nucleotide Polymorphism Vertebral Column
We cleaned and harmonized 963 publicly available GWAS summary-level data sets from 36 consortia, which included 82 diseases, 154 complex traits, 576 metabolites and 151 immune markers (Hemani et al, in preparation).
From this database pool, we chose datasets that fit the following selection criteria:

Non-sex-stratified

Meta-analyses of predominantly European populations. We include a few GWAS meta-analyses that contain a small proportion of non-European individuals in them in the LD Hub database. Whilst we believe the effect of these small numbers of non-European individuals on the LD Score regression analyses will be relatively minor, users should be aware that results from these meta-analyses may be less robust because of inconsistent patterns of linkage disequilibrium between individuals of different ancestry. In order to flag these studies to the user, we have included an additional field in the Test Center and the GWAShare Center (last column) that indicates the population ancestry of individuals in the corresponding meta-analysis, as well as a similar field in the LD Score regression results file (see also Table S1).

Meta-analyses using a GWAS backbone chip only (i.e. exclude meta-analyses involving immuno | metabo | psych | exome chip or GWAS + custom chip)

Number of SNPs is large (N>450,000)

Number of individuals is large (N>5,000)

Mean Chi-square of the test statistics is larger than 1

As shown in Figure 2, after filtering on the selection criteria, genome-wide results for 173 traits were included in LD Hub, of which 18 are GWAS of diseases, 48 are medically relevant risk factors/complex traits and 107 are metabolites. Table S1, displays descriptive information for each of the GWAS in LD Hub, including, trait name, consortium name, ethnicity, gender, sample size, PubMed ID, year of publication, and other relevant information.
Publication 2016
DNA Chips Ethnicity Europeans Exome Gender Genome Genome-Wide Association Study Polygenic Traits Single Nucleotide Polymorphism Vertebral Column
Our tests relate to previously defined estimators of genetic correlation and covariance between traits. We consider two definitions of genetic covariance at a locus: 1) the covariance between the genetic component of expression and the genetic component of trait; 2) the covariance between the causal effect sizes for expression and the causal effect-sizes for trait. Under assumptions of independent effect-sizes, these definitions yield asymptotically identical quantities37 (link). Assuming a substantially large training set where the genetic component of expression can be perfectly predicted, the direct TWAS tests for a significant association between the genetic component of expression and the trait; equivalent to testing definition #1 for a polygenic trait. Likewise, the summary-based TWAS tests for a significant sum of products of the causal expression effect sizes and the causal trait effect sizes; equivalent to definition #2 up to a scaling factor. The TWAS approach therefore fits naturally with the broader study of shared genetic etiology of multiple phenotypes. At the sample sizes evaluated in this study, the TWAS approach is substantially better powered than an LD-based estimate of local genetic correlation (Supplementary Note).
Publication 2016
Gene Components Multiple Birth Offspring Phenotype Polygenic Traits Reproduction Seizures

Most recents protocols related to «Polygenic Traits»

The intracellular MAP load at 2 h and 7 d p. i. and the expression of NO-, EREG, Gal9, and C3 were the quantitative phenotypes analyzed. The variance components and h2 explained by all the SNPs were calculated using the genome-wide complex trait analysis (GCTA) software 1.93.2, according to the following formula
where σG2 Is the variance explained by all the SNPs and σe2 is the residual variance (28 (link)).
Publication 2023
EREG protein, human Genome LGALS9 protein, human MAP2 protein, human Phenotype Polygenic Traits Protoplasm Single Nucleotide Polymorphism
As an exploratory analysis, we estimated PPS for biomarkers of lipid metabolism.29 (link) The PPS estimates the sum of additive genetic effects across all alleles that affect the biomarkers of lipid metabolism at the patient level. We used the PPS to explore a potential genetic link between lipid metabolism, ALS, and survival time by assessing (1) how much of the variance in biomarker levels at diagnosis can be explained by genetic profile scores and (2) whether the genetic profile score itself is associated with overall survival time. Because PPS does not change over time,30 (link) a statistical association between the genetic profile score and survival may be evidence of abnormal lipid levels caused by genetic variation or hold potential for therapeutic interventions.30 (link) Moreover, their time invariance allowed us to estimate the link between the genetic profile score and overall survival time, defined as time between symptom onset and death.
For all individuals who were enrolled in both our population-based registry and our latest genome-wide association study (GWAS),5 (link) we calculated the PPS. PPS was based on summary statistics from a GWAS on biomarker levels of lipid metabolism in the UK Biobank.31 For each single-nucleotide polymorphism, we calculated a weight for each biomarker using the summary-BavesR module in the Genome-Wide Complex Trait Bayesian analysis toolkit (default parameters)29 (link) and a linkage-disequilibrium matrix originating from 50,000 unrelated individuals of inferred European ancestries included in the UK Biobank. Because the genotype data originated from several different cohorts in the ALS GWAS, we scaled the PPS per GWAS cohort to a mean of zero and a standard deviation of 1. Linear regression models were used to calculate how much of the variance in the biomarker level was explained by their PPS (expressed as adjusted R2); 95% confidence intervals were obtained by means of bootstrapping. Simple univariable Cox models for overall survival time (i.e., from onset to death) were used to estimate HRs.
Publication 2023
Alleles Biological Markers Diagnosis Europeans Genes, vif Genetic Diversity Genetic Profile Genome Genome-Wide Association Study Genotype Lipid A Lipid Metabolism Lipids Metabolism Patients Polygenic Traits Reproduction Single Nucleotide Polymorphism Therapeutics
Polygenic risk scores (PRS) were calculated for EUR subjects. Genotype imputation was performed on the Michigan Imputation Server [69 (link)] using the TOPMed reference panel. Quality control included filtering for genotyping rate > 0.99, sample missingness < 0.01, Hardy-Weinberg Equilibrium P > 1 × 10−6, minor allele frequency > 1%, and imputation score > 0.8. GCTA v1.93.31 [70 (link)] was used to compute the genetic relationship matrix; a relatedness cutoff of 0.05 was applied. Genetic variants in the major histocompatibility complex (MHC) locus were excluded prior to PRS calculation. PRS were generated using summary statistics from the largest available genome-wide association study of schizophrenia [10 (link)] (40,675 cases, 64,643 controls) and adult brain surface area [71 ] (33,992 subjects) in PRS-CS [72 ], applying the LD reference panel from the 1000 Genomes Project phase 3 samples and the recommended global shrinkage parameter for highly polygenic traits (phi = 1e−2). Analyses assessing the interaction between C4 expression and PRS were restricted to youth with common C4 structural haplotypes (AL, AL-AL, AL-BL, AL-BS, BS), resulting in a final sample of 3730 EUR youth (female N = 1737; male N = 1993) and 7,715,663 genetic variants.
Publication 2023
Adult Brain Females Genetic Diversity Genome Genome-Wide Association Study Haplotypes Major Histocompatibility Complex Males Polygenic Traits Reproduction Schizophrenia Strains Youth

Protocol full text hidden due to copyright restrictions

Open the protocol to access the free full text link

Publication 2023
Genes Genome Genome-Wide Association Study Phenotype Polygenic Traits Single Nucleotide Polymorphism
To facilitate investigations of nominated risk variants, members of IPDGC have created a PD GWAS locus browser (https://pdgenetics.shinyapps.io/GWASBrowser/) that makes relevant statistics and datasets available to the public30 (link). Throughout the hackathon, our team continued the development of this browser through the addition of new datasets and features. To identify secondary association signals at each locus from the Nalls et al. 2019 study, we performed conditional analysis using the Genome-wide Complex Trait Analysis (GCTA) tool31 (link),32 (link). Locus zoom plots were added to display the results of this conditional analysis (Fig. 4b)33 . Power calculations were done for each risk variant by Nalls et al. 2019 to determine if the findings were sufficiently powered. To do so, we followed methods used by the Genetic Association Study Power Calculator tool (https://csg.sph.umich.edu/abecasis/gas_power_calculator/), using summary statistics from Nalls et al. 2019, a disease prevalence of 0.01, and a significance level of 0.05 as input. We queried blood gene expression data included in the AMP PD version 2.5 release to measure expression levels in PD cases and controls. We obtained TPM expression at baseline for samples that had case or control status and no PD mutations in whole-genome sequencing data, leaving a total of 1710 samples. Expression data for each gene was displayed in a violin plot and added to the expression section of the browser (Fig. 4b). The literature section of the browser was updated to display a description, PubMed hit count, and word cloud plot for each gene within 1 MB of a PD risk variant. Our last addition to the browser was a display of user statistics. We used the googleAnalyticsR package34 to record and visualize the number of visits for the browser and each risk variant within a period specified by the user (Fig. 4b).
Publication 2023
BLOOD Gene Expression Genes Genetic Association Studies Genome Genome-Wide Association Study Mutation Polygenic Traits

Top products related to «Polygenic Traits»

Sourced in United States, Canada
The BovineSNP50 BeadChip is a lab equipment product designed for genetic analysis of bovine samples. It provides a comprehensive set of single nucleotide polymorphisms (SNPs) for bovine genome-wide association studies and other genetic research applications.
Sourced in United States
The Human610-Quad BeadChip is a laboratory equipment product designed for genome-wide association studies. It features a high-density array of probes that can be used to genotype up to 610,000 single nucleotide polymorphisms (SNPs) in a single experiment.
The Infinium HD Ultra protocol is a laboratory equipment product designed for high-throughput genotyping. It provides a streamlined workflow for DNA sample preparation and analysis. The core function of this protocol is to enable efficient and accurate genotyping of genetic variants across the genome.
Sourced in United States, Austria, Japan, Belgium, United Kingdom, Cameroon, China, Denmark, Canada, Israel, New Caledonia, Germany, Poland, India, France, Ireland, Australia
SAS 9.4 is an integrated software suite for advanced analytics, data management, and business intelligence. It provides a comprehensive platform for data analysis, modeling, and reporting. SAS 9.4 offers a wide range of capabilities, including data manipulation, statistical analysis, predictive modeling, and visual data exploration.
Sourced in United States
The Human1M is a laboratory instrument designed for genetic analysis. It is capable of processing and analyzing large genomic datasets. The core function of the Human1M is to perform high-throughput genotyping and sequencing of human DNA samples.
Sourced in United States, United Kingdom, Germany, Canada, Japan, Sweden, Austria, Morocco, Switzerland, Australia, Belgium, Italy, Netherlands, China, France, Denmark, Norway, Hungary, Malaysia, Israel, Finland, Spain
MATLAB is a high-performance programming language and numerical computing environment used for scientific and engineering calculations, data analysis, and visualization. It provides a comprehensive set of tools for solving complex mathematical and computational problems.
Sourced in United States
The HumanExome BeadChip-12v1_A is a high-throughput genotyping array designed for comprehensive coverage of human exonic regions. It targets over 240,000 genetic variants found primarily in protein-coding regions of the genome. This product is intended for research use only.
Sourced in United States, United Kingdom
GenomeStudio software is a data analysis tool designed for Illumina microarray and sequencing platforms. It provides a unified interface to visualize, analyze, and interpret genomic data generated from Illumina instruments.
Sourced in United States, Germany, Iceland
The Infinium Global Screening Array is a high-throughput genotyping platform designed by Illumina. It provides comprehensive coverage of genetic variation across diverse global populations. The array is optimized for genome-wide association studies and population genetics research.
Sourced in United States
The Genome-Wide Human SNP Array 6.0 platform is a high-density genotyping microarray developed by Thermo Fisher Scientific. It is designed to analyze genetic variations, specifically single nucleotide polymorphisms (SNPs), across the human genome. The array contains probes for over 906,600 unique SNP markers, providing a comprehensive coverage of the genome.

More about "Polygenic Traits"

Polygenic traits are complex, multifactorial characteristics that are influenced by the combined effects of multiple genes, rather than a single gene.
These traits, which include common diseases and quantitative phenotypes, are affected by both genetic and environmental factors.
Researchers can leverage cutting-edge AI-driven research protocol optimization tools like PubCompare.ai to easily locate the best protocols and products across literature, preprints, and patents, enhancing reproducibility and driving their polygenic trait research forward.
Techniques like the BovineSNP50 BeadChip, Human610-Quad BeadChip, and Infinium HD ultra protocol can be used to genotype and analyze genetic variants associated with polygenic traits.
Statistical software such as SAS 9.4 and MATLAB can be employed to perform complex data analysis and modelling.
The Human1M and HumanExome BeadChip-12v1_A platforms provide high-density genome-wide coverage, while the GenomeStudio software and Infinium Global Screening Array enable comprehensive data visualization and interpretation.
Genome-Wide Human SNP Array 6.0 is another powerful tool that can be utilized to investigate the genetic architecture of polygenic traits.
By combining these cutting-edge technologies with PubCompare.ai's AI-driven research optimization, scientists can enhance the reproducibility and efficiency of their polygenic trait studies, ultimately accelerating our understanding of these complex, multifactorial characteristics.