The 450k array was used to obtain genome-wide DNA methylation profiles for tumour samples and normal control tissues, according to the manufacturer’s instructions (Illumina, San Diego, USA). DNA methylation data was generated at the Genomics and Proteomics Core Facility of the DKFZ (Heidelberg, Germany) and the NYU Langone Medical Center (New York, USA). Data was generated from both fresh-frozen and formalin-fixed paraffin-embedded (FFPE) tissue samples. For most fresh-frozen samples, >500 ng of DNA was used as input material. 250 ng of DNA was used for most FFPE tissues. On-chip quality metrics of all samples were carefully controlled. Copy-number variation (CNV) analysis from 450k methylation array data was performed using the conumee Bioconductor package version 1.3.0. Two sets of 50 control samples displaying a balanced copy-number profile from both male and female donors were used for normalization.
Raw signal intensities were obtained from IDAT-files using the minfi Bioconductor package version 1.14.0 36 . Each sample was individually normalized by performing a background correction (shifting of the 5 % percentile of negative control probe intensities to 0) and a dye-bias correction (scaling of the mean of normalization control probe intensities to 10,000) for both colour channels. Subsequently, a correction for the type of material tissue (FFPE/frozen) was performed by fitting univariate, linear models to the log2-transformed intensity values (removeBatchEffect function, limma package version 3.24.15). The methylated and unmethylated signals were corrected individually. Estimated batch effects were also used to adjust diagnostic samples or test samples within the cross-validation. Beta-values were calculated from the retransformed intensities using an offset of 100 (as recommended by Illumina). To analyse for possible confounding batch effects within our pre-processed reference cohort dataset (after adjusting for FFPE versus frozen material) we applied the sva algorithm 37 ,38 . We found no significant surrogate variable (data not shown).
The following filtering criteria were applied: Removal of probes targeting the X and Y chromosomes (n=11,551), removal of probes containing a single-nucleotide polymorphism (dbSNP132 Common) within five base pairs of and including the targeted CpG site (n=7,998), probes not mapping uniquely to the human reference genome (hg19) allowing for one mismatch (n=3,965), and probes not included on the Illumina EPIC array (n=32,260). In total, 428,799 probes targeting CpG sites were kept for further analysis.
Capper D., Jones D.T., Sill M., Hovestadt V., Schrimpf D., Sturm D., Koelsche C., Sahm F., Chavez L., Reuss D.E., Kratz A., Wefers A.K., Huang K., Pajtler K.W., Schweizer L., Stichel D., Olar A., Engel N.W., Lindenberg K., Harter P.N., Braczynski A., Plate K.H., Dohmen H., Garvalov B.K., Coras R., Hölsken A., Hewer E., Bewerunge-Hudler M., Schick M., Fischer R., Beschorner R., Schittenhelm J., Staszewski O., Wani K., Varlet P., Pages M., Temming P., Lohmann D., Selt F., Witt H., Milde T., Witt O., Aronica E., Giangaspero F., Rushing E., Scheurlen W., Geisenberger C., Rodriguez F.J., Becker A., Preusser M., Haberler C., Bjerkvig R., Cryan J., Farrell M., Deckert M., Hench J., Frank S., Serrano J., Kannan K., Tsirigos A., Brück W., Hofer S., Brehmer S., Seiz-Rosenhagen M., Hänggi D., Hans V., Rozsnoki S., Hansford J.R., Kohlhof P., Kristensen B.W., Lechner M., Lopes B., Mawrin C., Ketter R., Kulozik A., Khatib Z., Heppner F., Koch A., Jouvet A., Keohane C., Mühleisen H., Mueller W., Pohl U., Prinz M., Benner A., Zapatka M., Gottardo N.G., Driever P.H., Kramm C.M., Müller H.L., Rutkowski S., von Hoff K., Frühwald M.C., Gnekow A., Fleischhack G., Tippelt S., Calaminus G., Monoranu C.M., Perry A., Jones C., Jacques T.S., Radlwimmer B., Gessi M., Pietsch T., Schramm J., Schackert G., Westphal M., Reifenberger G., Wesseling P., Weller M., Collins V.P., Blümcke I., Bendszus M., Debus J., Huang A., Jabado N., Northcott P.A., Paulus W., Gajjar A., Robinson G., Taylor M.D., Jaunmuktane Z., Ryzhova M., Platten M., Unterberg A., Wick W., Karajannis M.A., Mittelbronn M., Acker T., Hartmann C., Aldape K., Schüller U., Buslei R., Lichter P., Kool M., Herold-Mende C., Ellison D.W., Hasselblatt M., Snuderl M., Brandner S., Korshunov A., von Deimling A, & Pfister S.M. (2018). DNA methylation-based classification of central nervous system tumours. Nature, 555(7697), 469-474.