Benchmarking Immune Cell Signatures

We considered four external expression data sets from enriched/purified immune cells: two microarray data sets (GEO accession: GSE28490 and GSE2849) [27 (link)], an RNA-seq data set [28 (link)], and a microarray compendium that was used to build the CIBERSORT LM22 signature matrix [17 (link)]. All data sets were preprocessed and normalized as explained in the previous paragraphs. For each gene g specific for a cell type c in the signature matrix, we computed the ratio R_gd between the median expression across all libraries in data set d belonging to the cell type c and the median expression across all libraries in data set d not belonging to the cell type c. For each cell type, the top 30 ranked signature genes (or less, when not available) with median_d(R_gd) ≥ 2 were selected for the final signature matrix. When processing the T_reg signature genes, the data sets belonging to CD4⁺ T cells were not considered. T_reg signature genes were further filtered with a similar approach, but considering the RNA-seq data of circulating CD4⁺ T and T_reg cells from and selecting only the genes with median_d(R_gd) ≥ 1.
The final signature matrix TIL10 (Additional file 1) was built considering the 170 genes satisfying all the criteria reported above. The expression profile of each cell type c was computed as the median of the expression values x_gl over all libraries belonging to that cell type:

x_{gc} = {median}_{lϵc} (x_{gl})

For the analysis of RNA-seq data, quanTIseq further reduces this signature matrix by removing a manually curated list of genes that showed a variable expression in the considered data sets: CD36, CSTA, NRGN, C5AR2, CEP19, CYP4F3, DOCK5, HAL, LRRK2, LY96, NINJ2, PPP1R3B, TECPR2, TLR1, TLR4, TMEM154, and CD248. This default signature considered by quanTIseq for the analysis of RNA-seq data consists of 153 genes and has a lower condition number than the full TIL10 signature (6.73 compared to 7.45), confirming its higher cell specificity. We advise using the full TIL10 matrix (--rmgenes=“none”) for the analysis of microarray data, as they often lack some signature genes, and the reduced matrix (--rmgenes= “default”) for RNA-seq data. Alternatively, the “rmgenes” option allows specifying a custom list of signature genes to be disregarded (see quanTIseq manual).

Free full text: Click here

Finotello F., Mayer C., Plattner C., Laschober G., Rieder D., Hackl H., Krogsdam A., Loncova Z., Posch W., Wilflingseder D., Sopper S., Ijsselsteijn M., Brouwer T.P., Johnson D., Xu Y., Wang Y., Sanders M.E., Estrada M.V., Ericsson-Gonzalez P., Charoentong P., Balko J., de Miranda N.F, & Trajanoski Z. (2019). Molecular and pharmacological modulators of the tumor immune contexture revealed by deconvolution of RNA-seq data. Genome Medicine, 11, 34.

Publication 2019

A genes Cd4 t cells Cell Genes Lrrk2 Ly96 Microarray Rna seq Treg cells

Corresponding Organization : Austrian Drug Screening Institute (Austria)

Other organizations : Universität Innsbruck, Innsbruck Medical University, Leiden University, Vanderbilt University, Vanderbilt University Medical Center, National Center for Tumor Diseases, University Hospital Heidelberg

Top 5 similar protocols

Protocol cited in 30 other protocols

Variable analysis

independent variables

Four external expression data sets from enriched/purified immune cells: two microarray data sets (GEO accession: GSE28490 and GSE2849), an RNA-seq data set, and a microarray compendium that was used to build the CIBERSORT LM22 signature matrix
The ratio Rgd between the median expression across all libraries in data set d belonging to the cell type c and the median expression across all libraries in data set d not belonging to the cell type c

dependent variables

The top 30 ranked signature genes (or less, when not available) with medianD(Rgd) ≥ 2 for each cell type
The expression profile of each cell type c, computed as the median of the expression values xgl over all libraries belonging to that cell type

control variables

When processing the Treg signature genes, the data sets belonging to CD4+ T cells were not considered
For the analysis of RNA-seq data, quanTIseq further reduces this signature matrix by removing a manually curated list of genes that showed a variable expression in the considered data sets

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!