The largest database of trusted experimental protocols

52 protocols using statistics and machine learning toolbox

1

Statistical Analysis of Direction Recognition Index

Check if the same lab product or an alternative is used in the 5 most similar protocols
To determine which changes in the value of the direction recognition index between particular measurement situations should be considered significant, we completed a statistical analysis of the obtained data. For this purpose, a Wilcoxon test (equivalent to the Mann-Whitney U test) was used. The calculations were performed using MATLAB R2017b (version 9.3) with the Statistics and Machine Learning Toolbox (MathWorks Inc., Natick, MA, USA).
+ Open protocol
+ Expand
2

Ensemble of Decision Trees for Stimulus Labeling

Check if the same lab product or an alternative is used in the 5 most similar protocols
We trained an ensemble of 100 decision trees using the fitcensemble function from the Statistics and Machine Learning Toolbox from MathWorks. Decision tree models are simple, highly interpretable, and can be displayed graphically (James et al., 2013 ). Consequently, the decision process of the classifier can be easily interrogated. However, decision tree models have two key disadvantages: (1) mediocre prediction performance (Caruana and Niculescu-Mizil, 2006 ) and (2) high variance due to overfitting (James et al., 2013 ). Both disadvantages can be mitigated by aggregating an ensemble of decision trees. Empirical comparison of classification models shows that ensembles of decision trees outperform other classification algorithms across a variety of problem sets (Caruana and Niculescu-Mizil, 2006 ).
We used a bootstrap aggregation (bag) method for constructing the ensembles. Each tree in the ensemble is trained on a boot-strapped replica of the data—each replica is a random selection of the data with replacement. The predictions from the ensemble model are determined by a majority vote from each individual tree prediction. We trained the ensemble to learn the stimulus labels (TNF, Pam3CSK4, CpG, LPS, and poly(I:C)) from either the entire set of predictors (all 918 metrics, Table S6A) or a subset of predictors termed “signaling codons” (Table S6B).
+ Open protocol
+ Expand
3

Cognitive Test Reliability and Validity

Check if the same lab product or an alternative is used in the 5 most similar protocols
Within the manuscript, convergent validity, and test-retest reliability for the CGN_ICA test is shown with Pearson’s Correlation. P-values for Pearson’s correlation are based on a Student’s t distribution. Calculations are done using MathWorks’ statistics and machine learning toolbox (https://www.mathworks.com/help/stats/index.html).
To measure dependency of the cognitive tests with level of education, we used explained variance, defined as the square of Pearson’s Correlation between participants’ cognitive score and their level of education (i.e. number of years). Here the statistical significance was obtained by a permutation test (10,000 permutations of participants). To formally assess statistical independence, we used a non-parametric independence test, proposed by Gretton and Gyorfi62 , based on 10,000 bootstrap resampling of participants.
Finally, we used a single factor analysis of variance (ANOVA) to compare average CGN_ICA scores for participants who had taken the CGN_ICA test every other day for two weeks. The goal was to see if the mean CGN_ICA scores are significantly different at any given day.
+ Open protocol
+ Expand
4

Quantifying Thermal Ablation Outcomes

Check if the same lab product or an alternative is used in the 5 most similar protocols
All results were statistically analyzed using MATLAB 2016a and its Statistics and Machine Learning Toolbox (Mathworks, Natick, MA, USA). Linear regression was employed to assess the relationship between predicted IRE areas and ablated areas obtained by H&E staining. The slope and coefficient of determination (R2) were calculated for different probabilities of cell death. Bland-Altman analysis52 (link) was applied to assess the agreement between predicted IRE areas and ablated areas within limits of agreement determined as the mean differences between predicted IRE areas and ablated areas ±1.96 standard deviation (95% confidence interval). A p value of less than 0.05 was considered to indicate statistical significance. The number of animals used for experiments was defined using Pearson correlation guidelines53 (link).
+ Open protocol
+ Expand
5

Machine Learning Algorithms for Segmented Audio Classification

Check if the same lab product or an alternative is used in the 5 most similar protocols
Three machine learning algorithms were trained and tested to classify 30 s-segments into one of two class labels (Good, Bad): Support Vector Machine, K-nearest neighbors and Decision tree. The choice of these algorithms was motivated by their robustness and generalization power in high-dimensional classification problems [16 ], [26 (link)]–[29 (link)]. All the algorithms were implemented in Matlab 2017a using the Statistics and Machine Learning Toolbox (Mathworks Inc, USA).
+ Open protocol
+ Expand
6

Analyzing Visual Cortex BOLD Responses

Check if the same lab product or an alternative is used in the 5 most similar protocols
To analyze the overall slope within V1, we conducted ROI analyses using Matlab 2019a with Statistics and Machine Learning Toolbox (Mathworks Inc.). Vertices within the ROIs satisfied the following two conditions: 1) the offset contrast was significant and 2) within V1 defined by vcAtlas for each participant. We averaged the effect size of the slope across the ROI for each participant (S2 File). The averaged effect size of the slope and the offset within the ROI were tested for the difference from zero across the participants using a one-sample t-test. The significance level was set at P < 0.05.
To investigate the individual difference of the V1 BOLD response to the visual stimulus frequency, we tested Pearson’s product-moment correlation coefficient between the effect size of the slope or the offset within the ROI and the individual characteristics of age, MMSE, or UFOV performance. The significance level was set at P < 0.05 for the correlation test.
+ Open protocol
+ Expand
7

Noise Exposure Analysis of Shooter Exercises

Check if the same lab product or an alternative is used in the 5 most similar protocols
Statistical analysis of the obtained measurement data was carried out in order to assess whether there were significant differences in exposure to noise resulting from the two variants of the number of shooters simultaneously participating in the exercises and to assess the impact of distance on the values of noise parameters. Data were analyzed using MATLAB R2017b (version 9.3) software with the Statistics and Machine Learning Toolbox (MathWorks Inc., Natick, MA, USA). The analysis used the Wilcoxon test, equivalent to the Mann-Whitney U test.
+ Open protocol
+ Expand
8

Protein Sequence Network Analysis

Check if the same lab product or an alternative is used in the 5 most similar protocols
The distributions of the number of direct neighbours for each sequence (degree, n) and of the number of sequences forming connected networks (cluster size, s) were derived from protein sequence networks at different thresholds of pairwise sequence identity. The number of nodes N(n) having a degree of n was fitted by a power law N(n) ~ n, and the scaling exponent γ was derived from a log-log plot [64 (link)]. The number of clusters N(s) with size s was fitted by a power law N(s) ~ s, and the Fisher exponent τ was derived from a log-log plot, too [65 (link)]. Logarithmic histograms for the cluster sizes s were obtained for subsequent intervals (2 ≤ s ≤ 10, 11 ≤ s ≤ 100, 101 ≤ s ≤1000, and 1001 ≤ s ≤ 10,000). The slopes τh of these histograms were determined for sequence networks at different thresholds of sequence identity. The Fisher exponent τ was derived by fitting τh against model distributions as described previously [65 (link)].
The distributions of degrees and cluster sizes were analysed by linear fitting using the fitlm function from the Statistics and Machine Learning Toolbox (version 11.7) in MATLAB (version R2020a, The MathWorks, Natick, MA, USA).
+ Open protocol
+ Expand
9

Mass Spectrometry Data Analysis Protocol

Check if the same lab product or an alternative is used in the 5 most similar protocols
For mass spectral data acquisition and mass spectra comparison, the MSD ChemStation Software (Agilent Technologies) was used. Data export and Savitzky-Golay smoothing of the IMS data were performed with the LAV Software version 2.2.1 (Gesellschaft für Analytische Sensorsysteme mbH). Further data pre-processing of IMS and MS data, model building, validation, and calculation of figures of merit were implemented in own MATLAB routines and carried out in MATLAB (The MathWorks Inc., Natick, MA, USA) using the Statistics and Machine Learning Toolbox (MathWorks).
+ Open protocol
+ Expand
10

Optimal Model Selection Decision Boundary

Check if the same lab product or an alternative is used in the 5 most similar protocols
To generate a decision boundary for the optimal choice of either AICGMM or BICRSS based on the properties of the time series, we used MATLAB’s fitcsvm function from the Statistics and Machine Learning Toolbox (The MathWorks). For each data series from each tested model, we computed a preference score for AICGMM vs. BICRSS based on their individual F1 scores (Eq. 17). PreferenceforAICGMM=F1GMMF1GMM+F1RSS
For each model, these scores were binned in two dimensions according to the effective SNR and log10(number of samples) of their corresponding series. Each bin was labeled as either “AICGMM optimal” or “BICRSS optimal” according to its average preference score (AICGMM optimal for preference scores ≥0.5). These bin labels were used to train a support vector machine (SVM) classifier to determine a linear decision boundary in the two-dimensional space of effective SNR and number of samples.
+ Open protocol
+ Expand

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required

Sign up now

Revolutionizing how scientists
search and build protocols!