DNA methylation profiling was performed as described previously6 (link),7 (link). DNA was extracted in the same manner as described for WGS using 500ng as input material for fresh frozen tissue and 250ng input material for FFPE tissue. Array data was created using the Infinium HumanMethylation450 BeadChip array (450k array) according to the manufacturer’s instructions (Illumina, San Diego, USA) at the DKFZ Genomics and Proteomics Core Facility (Heidelberg, Germany). For a subset of samples (described in Supplementary Table 1) Methylation BeadChip (EPIC) arrays were used. CpG probes that were used for the analysis were filtered based on presence of a common SNP within five bases of the probe, reads not mapping uniquely to the reference genome, probes mapping to the X and Y chromosome and reads not overlapping between 450K arrays and EPIC arrays.
Clustering was performed after correction of samples for the origin of the DNA (FFPE or Fresh frozen) using Surrogate Variable Analysis (sva)40 (link) and only the 10000 most variable probes based on the full dataset after correction were selected for clustering. Distance was calculated using 1-Pearson correlation and linkage was calculated using average as measure. Subsequently, t-stochastic neighbourhood embedding (t-SNE) analysis using RTSNE (v0.13) was applied to generate the figures41 .