We assessed the performance, stability, and reproducibility of the PSO-enhanced thromboSeq platform using multiple training, evaluation, and independent validation cohorts. All classification experiments were performed with the PSO-enhanced thromboSeq algorithm, using parameters optimized by particle swarm intelligence. We assigned for the matched cohort 133 samples for training-evaluation, of which 93 were used for RUV-correction, gene panel selection, and SVM training (training cohort), and 40 were used for gene panel optimization (evaluation cohort). The full cohort contained 208 samples for training-evaluation, of which 120 were used for RUV-correction, gene panel selection, and SMV training (training cohort), and 88 were used for gene panel optimization (evaluation cohort). All random selection procedures were performed using the sample-function as implemented in R. For assignment of samples per cohort to the training and evaluation cohorts, only the number of samples per clinical group was balanced, whereas other potentially contributing variables were not stratified at this stage (assuming random distribution among the groups). Following, an SVM model was trained using the training samples, and the samples assigned to the independent validation cohort were predicted. The late-stage NSCLC samples and early-stage locally advanced NSCLC samples were validated separately resulting in two ROC curves. The 53 locally advanced NSCLC samples were age-matched with 53 non-cancer individuals selected from the non-cancer samples of the independent validation cohort. Performance of the training cohort was assessed by a leave-one-out cross validation approach (LOOCV, see also (Best et al., 2015 (link))). During a LOOCV procedure, all samples minus one (‘left-out sample’) are used for training of the algorithm. Each sample is predicted once, resulting in the same number of predictions as samples in the training cohort. The list of stable genes among the initial training cohort, determined RUV-factors for removal, and final gene panel determined by swarm-optimization of the training-evaluation cohort were used as input for the LOOCV procedure. As a control for internal reproducibility, we randomly sampled training and evaluation cohorts, while maintaining the validation cohorts and the swarm-guided gene panel of the original classifier, and perform 1000 (matched and full cohort NSCLC/non-cancer) training and classification procedures. As a control for random classification, class labels of the samples used by the SVM-algorithm for training of the support vectors were randomly permutated, while maintaining the swarm-guided gene list of the original classifier. This process was performed 1000 times for the matched and full NSCLC/non-cancer cohort classifiers. P values were calculated accordingly, as described previously (Best et al., 2015 (link)). Results were presented in receiver operating characteristics (ROC) curves, and summarized using area under the curve (AUC)-values, as determined by the ROCR-package in R. AUC 95% confidence intervals were calculated according to the method of Delonge using the ci.auc-function of the pROC-package in R.
Best M.G., Sol N., In ‘t Veld S.G., Vancura A., Muller M., Niemeijer A.L., Fejes A.V., Tjon Kon Fat L.A., Huis In ‘t Veld A.E., Leurs C., Le Large T.Y., Meijer L.L., Kooi I.E., Rustenburg F., Schellen P., Verschueren H., Post E., Wedekind L.E., Bracht J., Esenkbrink M., Wils L., Favaro F., Schoonhoven J.D., Tannous J., Meijers-Heijboer H., Kazemier G., Giovannetti E., Reijneveld J.C., Idema S., Killestein J., Heger M., de Jager S.C., Urbanus R.T., Hoefer I.E., Pasterkamp G., Mannhalter C., Gomez-Arroyo J., Bogaard H.J., Noske D.P., Vandertop W.P., van den Broek D., Ylstra B., Nilsson R.J., Wesseling P., Karachaliou N., Rosell R., Lee-Lewandrowski E., Lewandrowski K.B., Tannous B.A., de Langen A.J., Smit E.F., van den Heuvel M.M, & Wurdinger T. (2017). Swarm Intelligence-Enhanced Detection of Non-Small-Cell Lung Cancer Using Tumor-Educated Platelets. Cancer Cell, 32(2), 238-252.e9.
Corresponding Organization : Massachusetts General Hospital
Other organizations :
The Netherlands Cancer Institute, Medical University of Vienna, Umeå University, University Medical Center Utrecht, Heidelberg University, University Hospital Heidelberg, Utrecht University, Princess Máxima Center, Instituto Oncológico Dr. Rosell, Hospital Universitari Germans Trias i Pujol, Institut Català d'Oncologia, Radboud University Nijmegen, Radboud University Medical Center
Parameters optimized by particle swarm intelligence
dependent variables
Performance
Stability
Reproducibility
control variables
Swarm-guided gene panel of the original classifier
Class labels of the samples used by the SVM-algorithm for training of the support vectors
Annotations
Based on most similar protocols
Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to
get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required