The largest database of trusted experimental protocols

R is a free and open-source software environment for statistical computing and graphics. R version 2.14.1 is the latest stable release, providing a wide range of statistical and graphical techniques, including linear and nonlinear modeling, classical statistical tests, time-series analysis, classification, and clustering.

Automatically generated - may contain errors

10 protocols using r software v2

1

Weighted Genetic Risk Score for PBC

Check if the same lab product or an alternative is used in the 5 most similar protocols
We calculated the wGRS as the weighted sum of risk allele counts, where the weight for each SNP was the natural log of the OR. The OR for each SNP was derived from the discovery dataset. We generated a wGRS_all that included all 26 SNPs, a wGRS_noHLA that included 25 SNPs without the HLA region SNP rs7774434, and a wGRS_HLA that only included rs7774434. The difference in the distribution of wGRS between PBC cases and controls was tested using non-parametric Mann-Whitney test. To examine the discriminant ability of the wGRS, we plotted receiving operating characteristics (ROC) curves and calculated the area under the curve (AUC). An AUC = 1.0 represents a perfect model, while an AUC of 0.5 represents a random model. In addition, we divided the combined cases and control subjects into quartiles based on the wGRS with 26 SNPs. Logistic regression test was carried out to compare the individuals in the second to fourth quartiles to the individuals in the first quartile. Logistic regression, OR estimation, Mann-Whitney test and AUC calculation of the wGRS were conducted in R software v2.15 (http://www.r-project.org/)38 (link). Single SNP association tests in discovery dataset were performed using logistic regression analysis implemented in PLINK v1.07 (http://pngu.mgh.harvard.edu/~purcell/plink/)39 (link). All the association tests were adjusted by the principal components and sex.
+ Open protocol
+ Expand
2

Weighted Genetic Risk Score for PBC

Check if the same lab product or an alternative is used in the 5 most similar protocols
We calculated the wGRS as the weighted sum of risk allele counts, where the weight for each SNP was the natural log of the OR. The OR for each SNP was derived from the discovery dataset. We generated a wGRS_all that included all 26 SNPs, a wGRS_noHLA that included 25 SNPs without the HLA region SNP rs7774434, and a wGRS_HLA that only included rs7774434. The difference in the distribution of wGRS between PBC cases and controls was tested using non-parametric Mann-Whitney test. To examine the discriminant ability of the wGRS, we plotted receiving operating characteristics (ROC) curves and calculated the area under the curve (AUC). An AUC = 1.0 represents a perfect model, while an AUC of 0.5 represents a random model. In addition, we divided the combined cases and control subjects into quartiles based on the wGRS with 26 SNPs. Logistic regression test was carried out to compare the individuals in the second to fourth quartiles to the individuals in the first quartile. Logistic regression, OR estimation, Mann-Whitney test and AUC calculation of the wGRS were conducted in R software v2.15 (http://www.r-project.org/)38 (link). Single SNP association tests in discovery dataset were performed using logistic regression analysis implemented in PLINK v1.07 (http://pngu.mgh.harvard.edu/~purcell/plink/)39 (link). All the association tests were adjusted by the principal components and sex.
+ Open protocol
+ Expand
3

Plasma Carotenoid Profiling and Transcriptome Analysis

Check if the same lab product or an alternative is used in the 5 most similar protocols
Total plasma carotenoid (micromoles per liter of plasma) concentrations were calculated as the sum of α-carotene, β-carotene, β-cryptoxanthin, lutein, lycopene, and zeaxanthin concentrations. Concentrations of plasma carotenoids are available in Additional file 1: Table S1. R software v2.14.1 (R Foundation for Statistical Computing; http://www.r-project.org) [66 ] was used to compute regressions between normalized gene expression levels of all 18,160 probes and total carotenoids adjusted for the family ID. Weighted gene co-expression network analysis (WGCNA) was performed with gene expression levels of 533 probes showing a significant association (p value ≤ 0.05, obtained from the linear model function) with total carotenoids.
+ Open protocol
+ Expand
4

Exploring Carotenoid-Methylation Associations

Check if the same lab product or an alternative is used in the 5 most similar protocols
Concentrations of plasma carotenoids are available in Supplementary Table S1. Plasma total carotenoid concentrations (µmol/L of plasma) were defined as the sum of α-carotene, β-carotene, β-cryptoxanthin, lutein, lycopene, and zeaxanthin concentrations. Regressions between normalized DNA methylation levels of all 472,245 CpG sites and total carotenoid concentrations adjusted for the family ID were computed using R software v2.14.1 (R Foundation for Statistical Computing; http://www.r-project.org) [36 ]. Plasma total carotenoid concentrations (independent variable) were used to predict DNA methylation levels (dependent variable). A p-value ≤ 0.05 was used to identify significant associations. Regressions were adjusted for family ID (fixed effect) to account for the familial structure. In order to maintain a more exploratory approach, the regressions were not adjusted for other confounding factors. Choices of linear model and confounding factors were made for comparison purposes with similar study by our group [37 (link)]. Weighted gene correlation network analysis (WGCNA) was performed with methylation levels of 20,687 CpG sites showing a significant association (p-value ≤ 0.05) with total carotenoid concentrations. This allowed evaluating co-methylation similarities only in CpG sites that are associated with carotenoids.
+ Open protocol
+ Expand
5

Plasma Carotenoid Levels and Transcriptome Analysis

Check if the same lab product or an alternative is used in the 5 most similar protocols
Total plasma carotenoid (µmol/L of plasma) concentrations were calculated as the sum of α-carotene, β-carotene, βcryptoxanthin, lutein, lycopene, and zeaxanthin concentrations. Concentrations of plasma carotenoids are available in a previous study [17] . As previously described, R software v2.14.1 (R Foundation for Statistical Computing; http://www.r-project.org) [26] was used to compute linear regressions between gene expression levels of all 18,160 transcripts, DNA methylation levels of all 472,245 CpG sites and total carotenoids adjusted for the family ID [27, 28] .
+ Open protocol
+ Expand
6

Differential miRNA Expression Analysis

Check if the same lab product or an alternative is used in the 5 most similar protocols
Continuous and categorical variables were compared using Mann-Whitney U and Fisher’s exact tests, respectively. Unsupervised clustering of normalized miRNA levels was performed using classical multidimensional scale (cmdscale) function. Differential analysis of miRNA expression levels was performed using the limma package [29 (link)]. MiRNA levels with fold changes of ≥ 1 (or ≤ -1) and p-values adjusted for false discovery rate (FDR) of ≤ 0.05 were considered significant. The log2(fold change)|>1|is a widely used threshold used to restrict the analysis to miRNA that are at least two fold up or down regulated to identify meaningful and functionally relevant differences in miRNA expressions. FDR-adjusted p-values were calculated using the Benjamini and Hochberg procedure for multiple testing corrections. Heatmaps were generated using the ‘heatmap’ function, where samples and genes were clustered by the hclust function that uses Euclidian distance. Statistical analysis and data visualization were performed using R software V2.15.0 (http://www.R-project.org) and Bioconductor packages.
+ Open protocol
+ Expand
7

Comprehensive Genomic Analysis of Hepatocellular Carcinoma

Check if the same lab product or an alternative is used in the 5 most similar protocols
R software v2.15.0 (http://www.R-project.org) and Bioconductor packages were used for statistical analysis and data visualization. Tests of independence were performed using Chi-square and Fisher’s exact tests. P values were adjusted by Monte Carlo simulation according to Hope et al.40 . The strength of association among gene mutation events was modeled using a binomial logistic regression. We used chi-square tests for trend in proportions to identify genes associated with HCC progression and the Jonckheere-Terpstra test to assess the increase of mutation and CNA numbers along tumor stages. Only genes mutated in ≥3% of cases were included.
Variables associated with overall survival at 60 months were identified using univariate and multivariate Cox proportional hazards regression models (Wald test), using the survival package. Only patients with curative (R0) resection were included in survival analysis (n=216, exclusion of non-curative resections and liver transplantations). Kaplan-Meier plots were used to describe survival rates among all cases.
All reported P values were two-tailed and differences were considered significant when the P value was under 0.05.
+ Open protocol
+ Expand
8

Competing Risk Analysis of Recurrence and Death

Check if the same lab product or an alternative is used in the 5 most similar protocols
Continuous variables were initially explored for normal distribution using the Shapiro-Wilk test. As a normal distribution could not be confirmed for most variables, the variables were expressed as median and IQR. Differences between the groups were compared by the t-test or Mann-Whitney U test. Categorical variables were compared using the χ2 test, Yates’ correction test or Fisher's exact test. The collinearity was assessed by examining the variance inflation factors, and no collinearity was found between the variables. Survival curves were estimated by the Kaplan–Meier method.
Cumulative incidences of recurrence and ICC-related death were estimated while taking into account the competing risk of non-ICC-related death using a competing risk analysis as defined by Fine and Gray [31 ]. Predictive analysis of variables associated with the cause-specific hazard of recurrence and ICC-related death was done using the univariate and multivariate Cox proportional hazard model [31 ].
All statistical analyses were performed using SPSS 19.0 for Windows (SPSS, Chicago, IL) and R software v. 2.15.3 (R Foundation for Statistical Computing, Vienna, Austria; www.r-project.org). All reported p values were two-sided, and p < 0.05 was considered statistically significant.
+ Open protocol
+ Expand
9

Meta-analysis of MGMT Promoter and Outcomes

Check if the same lab product or an alternative is used in the 5 most similar protocols
Time-to-event data (e.g., OS, PFS) were analyzed using the hazard ratios (HRs) with a value less than one indicating favorable outcomes in elderly patients with methylated MGMT promoter or those with TMZ-containing therapies. If the HR was not directly reported, the value was calculated using the Kaplan-Meier survival curves or the methods reported by Tierney et al [17] (link). For proportions, data were computed using the logit transformation formula.[18] (link) The inverse-variance method was used, and the application of either fixed- or random-effect model was based on between-study heterogeneity. The heterogeneity was estimated using Chi2 test and I2 statistic (the percentage of the variability in effect estimates that is due to heterogeneity rather than random error), with Pheterogeneity<0.1 or I2>40% considered to be statistically significant.
Publication bias was assessed by visual impression, and was confirmed by analytic methods such as Egger’s test [19] (link).
Both a subgroups analysis of studies with similar treatment and an interaction analysis between treatment and MGMT promoter status were implemented. A sensitivity analysis based on the assessment of risk of bias was also performed.
All analyses were done in R software v2.15.3 (R Foundation for Statistical Computing, Vienna, Austria) and Review Manager v5.2 (the Cochrane Collaboration, Software Update, Oxford, UK).
+ Open protocol
+ Expand
10

Microbiome and Metabolome Analysis

Check if the same lab product or an alternative is used in the 5 most similar protocols
All statistical analyses were performed in R software (V 2.15.3) (R Foundation for Statistical Computing, Vienna, Austria) and IBM SPSS 20.0 (IBM Corporation, USA). Analysis of variance (ANOVA) and multiple comparison analysis was applied to calculate the mean and standard deviation and for statistical tests. The differences of parameter variance were estimated by ANOVA and the Kruskal Wallis rank-sum test based on the distribution of parameter statistics. For post-hoc comparison, Tukey’s honest significant differences tests and Wilcoxon matched-pairs signed-rank test were applied. R (V 3.2.2) was employed to perform heat map, PCoA, and hierarchical clustering. The R language did a comparison of microorganisms through the ‘adonis’ function from R’s ‘vegan’ package. The multiple null hypothesis testing of the prior group was achieved using the ‘anosim’ or ‘adonis’ function from R’s ‘vegan’ package. According to the method of Franzosa et al. [52 (link)], we clustered the differentially abundant features via a custom approach. Then, the correlation analysis of the differentially abundant chemical components with the differentially abundant microorganisms was undertaken (For all clustering analyses, we applied Spearman’s rank correlation as a similarity measure with a threshold of r = 0.7). Unless otherwise stated, the significance level was set at p < 0.05.
+ Open protocol
+ Expand

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required

Sign up now

Revolutionizing how scientists
search and build protocols!