We included all 24 of the established breast tumour samples for proteomic characterization using iTRAQ. Tumour tissue samples were maintained in cryovials at −80 °C until cryopulverization using a CP02 Cryprep Pulverizer (Covaris, Woburn, MA). 90 mg aliquots of cryofractured material were prepared for proteomic processing in aluminum weighing dishes on dry ice using spatulas kept cold in liquid nitrogen, with remaining material reserved for other applications. The 90 mg target was designed to include 40 mg for each of the collaborating research teams, with an anticipated yield for each team of 1.5–2 mg protein based on 4–5% recovery. To avoid systematic bias, sample processing was block randomized, with each intrinsic subtype proportionally represented in each processing tranche.
The reproducibility of the iTRAQ4-plex global proteome and phosphoproteome analysis workflow used in this study has been extensively tested for quantitative reproducibility both within and across laboratories in the CPTAC program14 (
link)49 (
link). Over a period of several months 5 iTRAQ4-plex replicates were measured at each of the 3 CPTAC proteome analysis centres. Each of these iTRAQ4-plexes contained duplicate measurements for both a basal WHIM2 and a luminal WHIM16 PDX samples that are also part of this study. A high degree of consistency in the number of proteins identified and correlation in the protein expression was obtained49 (
link). Pearson correlations for replicate proteome and phosphoproteome measurements were very high with a
R=0.9 in our previous study14 (
link) and very similar to the correlation observed here for the WHIM13 replicate measurement. These data show that our platform provides highly reproducible quantitative measurements for global proteomes and phosphoproteomes.
Protein extraction, digestion and iTRAQ labelling of peptides from breast cancer tumours. Cryopulverized breast cancer tumour samples tissues (∼2 combined aliquots of 90 mg tissue weight each) were homogenized in 1,000 μl lysis buffer containing 8M urea, 75 mM NaCl, 1 mM EDTA in 50 mM Tris HCl (pH 8), 10 mM NaF, phosphatase inhibitor cocktail 2 (1:100; Sigma, P5726) and cocktail 3 (1:100; Sigma, P0044), 2 μg ml
−1 aprotinin (Sigma, A6103), 10 μg ml
−1 Leupeptin (Roche, #11017101001), and 1 mM PMSF (Sigma, 78830). Lysates were centrifuged at 20,000
g for 10 min before measuring protein concentration of the clarified lysates by BCA assay (Pierce). Protein lysates were subsequently reduced with 5 mM dithiothreitol (Thermo Scientific, 20291) for 45 min at room temperature, and alkylated with 10 mM iodoacetamide (Sigma, A3221) for 45 min in the dark. Samples were diluted 4-fold with 50 mM Tris HCl (pH 8) prior to digesting them with LysC (Wako, 129-02541) for 4 h and trypsin (Promega, V511X) overnight at a 1:50 enzyme-to-protein ratio at room temperature overnight on a shaker.
Digested samples were acidified with formic acid (FA; Fluka, 56302) to a final volumetric concentration of 1% or final pH of ∼3–5, and centrifuged at 2,000 g for 5 min to clear precipitated urea from peptide lysates. Samples were desalted on C18 SepPak columns (Waters, 100 mg, WAT036820) and 1 mg peptide aliquots were dried down using a SpeedVac apparatus.
Construction of the Common internal Reference Pool. The proteomic and phosphoproteomic analyses of xenograft samples were performed as iTRAQ 4-plex experiments. Quantitative comparison between all samples analyzed was facilitated by the use of iTRAQ reporter ion ratios between each individual sample and a common internal reference sample present in each 4-plex. The reference sample was comprised of 16 of the 24 WHIM tumours analyzed in this study with equal contribution for each tumour (WHIM numbers 2, 4, 6, 8, 11, 12, 13, 14, 16, 18, 20, 21, 24, 25, 30 and 46). The 24 tumour samples were analyzed in nine independent 4-plex experiments, with three individual samples occupying the first three channels of each experiment and the 4th channel being reserved for the reference sample. While eight iTRAQ 4-plex experiments were used to analyze the 24 individual WHIM tumour samples, an additional 4-plex experiment was designed to include the WHIM13 sample for process replicate analysis and also internal reference samples from our human primary breast cancer study14 (
link) and a taxol drug response study (unpublished) to allow cross-referencing of the different datasets.
iTRAQ labelling, high pH reversed-phase separation and phosphopeptide enrichment of peptide samples. Desalted peptides were labelled with 4-plex iTRAQ reagents according to the manufacturer's instructions (AB Sciex, Foster City, CA). For each 1 mg peptide from each breast tumour sample, 10 units of labelling reagent were used. Peptides were dissolved in 300 μl of 0.5 M triethylammonium bicarbonate (TEAB) (pH 8.5) solution and labelling reagent was added in 700 μl of ethanol. After 1 h incubation, 1.5 ml of 0.05% TFA was added to stop the reaction. Differentially labelled peptides were mixed and subsequently desalted on 500 mg tC18 SepPak columns. The combined 4 mg iTRAQ samples per experiment were separated into 24 proteome fractions and 12 phosphoproteome fractions using a 4.6 mm × 250 mm column RP Zorbax 300 A ExtendC18 column (Agilent, 3.5 μm bead size) on an Agilent 1100 Series HPLC instrument by basic reversed-phase chromatography as described previously46 (
link). Peptides were separated according to their hydrophobicity using solvent A (2% acetonitrile, 5 mM ammonium formate, pH 10) and a nonlinear increasing concentration of solvent B (90% acetonitrile, 5 mM ammonium formate, pH 10). Phosphopeptides were enriched using Ni-NTA superflow agarose beads (Qiagen, #1018611) that were stripped of nickel with 100 mM EDTA and incubated in an aqueous solution of 10 mM FeCl
3 (Sigma, 451649) as described previously50 (
link). For phosphopeptide enrichment a 80% acetonitrile/0.1% trifluoroacetic acid binding buffer and a 500 mM dibasic sodium phosphate, pH 7.0, (Sigma, S9763) elution buffer were used. Enriched samples were desalted on StageTips as described50 (
link) before analysis by LC–MS/MS.
Analysis of tumour samples by high performance liquid chromatography tandem mass spectrometry (LC–MS/MS). All peptides were separated with an online nanoflow Proxeon EASY-nLC 1000 UHPLC system (Thermo Fisher Scientific) and analyzed on a benchtop Orbitrap Q Exactive mass spectrometer (Thermo Fisher Scientific) equipped with a nanoflow ionization source (James A. Hill Instrument Services, Arlington, MA, USA). The LC system, column, and platinum wire to deliver electrospray source voltage were connected via a stainless-steel cross (360 μm, IDEX Health & Science, UH-906x). The column was heated to 50 °C using a column heater sleeve (Phoenix-ST) to prevent overpressurizing of columns during UHPLC separation. 10% of each global proteome sample in a 2 μl injection volume, or 50% of each phosphoproteome sample in a 4 ul injection volume was injected onto an in-house packed 20 cm × 75 um diameter C18 silica picofrit capillary column (1.9 μm ReproSil-Pur C18-AQ beads, Dr Maisch GmbH, r119.aq; Picofrit 10 um tip opening, New Objective, PF360-75-10-N-5). Mobile phase flow rate was 200 nl min
−1, comprised of 3% acetonitrile/0.1% formic acid (Solvent A) and 90% acetonitrile /0.1% formic acid (Solvent B), and the 110-minute LC-MS/MS method consisted of a 10-min column-equilibration procedure, a 20-min sample-loading procedure, and the following gradient profile: (min:%B) 0:2; 1:6; 85:30; 94:60; 95;90; 100:90; 101:50; 110:50 (last two steps at 500 nl min
−1 flowrate). Data-dependent acquisition was performed using Xcalibur QExactive v2.1 software in positive ion mode at a spray voltage of 2.00 kV. MS1 Spectra were measured with a resolution of 70,000, an AGC target of 3e6 and a mass range from 300 to 1,800
m/
z. Up to 12 MS2 spectra per duty cycle were triggered at a resolution of 17,500, an AGC target of 5e4, an isolation window of 2.5
m/
z, a maximum ion time of 120 msec, and a normalized collision energy of 28. Peptides that triggered MS2 scans were dynamically excluded from further MS2 scans for 20 s. Charge state screening was enabled to reject precursor charge states that were unassigned, 1, or >6. Peptide match was enabled for monoisotopic precursor mass assignment.
Protein-peptide identification, phosphosite localization, and quantitation. All MS data were interpreted using the Spectrum Mill software package v5.1 (for comparison with proteomes of human breast tumours from our previous study14 (
link)) and v6.0 pre-release (Agilent Technologies, Santa Clara, CA, USA) co-developed by the authors. Similar MS/MS spectra acquired on the same precursor
m/
z within ± 45 s were merged. MS/MS spectra were excluded from searching if they failed the quality filter by not having a sequence tag length>0 (that is, minimum of two masses separated by the in-chain mass of an amino acid) or did not have a precursor MH+ in the range of 750–6,000. MS/MS spectra from were searched against a database consisting of RefSeq release 60 containing 31,767 human proteins, 24,821 mouse proteins, and an appended set of 85 common laboratory contaminant proteins (RefSeq.20130727-Human.20130730-MouseNR.mm13.contams). Scoring parameters were ESI-QEXACTIVE-HCD-v2, for whole proteome datasets, and ESI-QEXACTIVE-HCD-v3 parameters were for phosphoproteome datasets. All spectra were allowed ±20 p.p.m. mass tolerance for precursor and product ions, 40% minimum matched peak intensity, and trypsin allow P enzyme specificity with up to four missed cleavages. Fixed modifications were carbamidomethylation at cysteine. iTRAQ labelling was required at lysine, but peptide N-termini were allowed to be either labelled or unlabelled. Allowed variable modifications for whole proteome datasets were acetylation of protein N-termini, oxidized methionine, deamidation of asparagine, pyro-glutamic acid at peptide N-terminal glutamine, and pyro-carbamidomethylation at peptide N-terminal cysteine with a precursor MH+ shift range of −18 to 64 Da. Allowed variable modifications for phosphoproteome dataset were revised to disallow deamidation and allow phosphorylation of serine, threonine, and tyrosine with a precursor MH+ shift range of 0 to 272 Da.
Identities interpreted for individual spectra were automatically designated as confidently assigned using the Spectrum Mill autovalidation module to use target-decoy based FDR estimates to apply score threshold criteria via two-step strategies. For the whole proteome datasets thresholding was done at the spectral and protein levels. For the phosphoproteome datasets thresholding was done at the spectral level. In step 1, peptide autovalidation was done first and separately for each iTRAQ 4-plex experiment consisting of either 25 LC–MS/MS runs (whole proteome) or 13 LC–MS/MS runs (phosphoproteome) using an auto thresholds strategy with a minimum sequence length of 7(whole proteome) or 8 (phosphoproteome), automatic variable range precursor mass filtering, and score and delta Rank1–Rank2 score thresholds optimized to yield a spectral level FDR estimate for precursor charges 2 through 4 of <0.6% for each precursor charge state in each LC–MS/MS run. For precursor charges 5–6, thresholds were optimized to yield a spectral level FDR estimate of <0.3% across all runs per iTRAQ 4-plex experiment (instead of each run), to achieve reasonable statistics, since many fewer spectra are generated for the higher charge states.
In step 2 for the whole proteome datasets, protein polishing autovalidation was applied separately to each iTRAQ 4-plex experiment to further filter the PSM's using a target protein-level FDR threshold of zero. The primary goal of this step was to eliminate peptides identified with low scoring peptide spectrum matches (PSM's) that represent proteins identified by a single peptide, so-called ‘one-hit wonders'. After assembling protein groups from the autovalidated PSM's, protein polishing determined the maximum protein level score of a protein group that consisted entirely of distinct peptides estimated to be false-positive identifications (PSM's with negative delta forward-reverse scores). PSM's were removed from the set obtained in the initial peptide-level autovalidation step if they contributed to protein groups that have protein scores below the max false-positive protein score. In the filtered results each identified protein detected in an iTRAQ 4-plex experiment was comprised of multiple peptides unless a single excellent scoring peptide was the sole match. For the whole proteome datasets the above criteria yielded FDR of <0.5% at the peptide-spectrum match level and <0.8% at the distinct peptide level for each iTRAQ 4-plex experiment. After assembling proteins with all the PSMs from all the iTRAQ 4-plex experiments together the aggregate FDR estimates were 0.42% at the at the peptide-spectrum match level, 1.5% at the distinct peptide level, and <0.01% (1/11,372) at the protein group level. Since the protein level FDR estimate neither explicitly required a minimum number of distinct peptides per protein nor adjusted for the number of possible tryptic peptides per protein, it may underestimate false positive protein identifications for large proteins observed only on the basis of multiple low scoring PSMs.
In calculating scores at the protein level and reporting the identified proteins, redundancy was addressed in the following manner: the protein score was the sum of the scores of distinct peptides. A distinct peptide was the single highest scoring instance of a peptide detected through an MS/MS spectrum. MS/MS spectra for a particular peptide may have been recorded multiple times, (that is, as different precursor charge states, in adjacent bRP fractions, modified by deamidation at Asn or oxidation of Met, or different phosphosite localization) but were still counted as a single distinct peptide. When a peptide sequence >8 residues long was contained in multiple protein entries in the sequence database, the proteins were grouped together and the highest scoring one and its accession number were reported. In some cases when the protein sequences were grouped in this manner there were distinct peptides that uniquely represented a lower scoring member of the group (isoforms, family members, and different species). Each of these instances spawned a subgroup. Multiple subgroups were reported and counted towards the total number of proteins, and were given related protein subgroup numbers (for example, 3.1 and 3.2: group 3, subgroups 1 and 2). To better dissect the tumour/stroma (human/mouse) origin of orthologous proteins in this xenograft experiment, the inclusion of peptides contributing to each subgroup was restricted by enabling the subgroup-specific (SGS) option in Spectrum Mill. Only subgroup-specific peptide sequences were counted toward each subgroup's count of distinct peptides and protein level TMT quantitation. The SGS option omits peptides that are shared between subgroups. If evidence for BOTH human and mouse peptides from an orthologous protein were observed, then peptides that cannot distinguish the two (shared) were ignored. However, the peptides shared between species were retained if there was specific evidence for only one of the species, thus yielding a single subgroup attributed to only the single species consistent with the specific peptides. Furthermore, if all peptides observed for a protein group were shared between species, thus yielding a single subgroup composed of indistinguishable species, then all peptides were retained (the column in
Supplementary Data 3 numSpeciesPresentR1 will have a value of 2 in such cases). Assembly of confidently identified PSM's yielded 20,480 total protein subgroups from 11,372 protein groups. Human and mouse ortholog proteins were typically arranged into individual subgroups.
In step 2 for the phosphoproteome datasets a phosphosite table were assembled with columns for individual iTRAQ 4-plex experiments and rows for individual phosphosites. PSM's were combined into a single row for all non-conflicting observations of a particular phosphosite. (that is, different missed cleavage forms, different precursor charges, confident and ambiguous localizations, different sample handling modifications). For related peptides neither observations with a different number of phosphosites nor different confident localizations were allowed to be combined. Selecting the representative peptide from the combined observations was done such that once confident phosphosite localization was established, higher identification scores and longer peptide lengths are preferable. After assembling the phosphosite table a polishing step was applied to further filter the phosphosites with the primary goal of eliminating phosphosites with representative peptides identified through low scoring peptide spectrum matches (PSM's) that were observed in only a few experiments. The initial table of representative peptides for 82,030 phosphosites had an aggregate FDR of 3.3% at phosphosite-level. The table was sorted by identification score and then by number of iTRAQ 4-plex experiments in which the phosphosite was observed. The cumulative FDR trend showed inflection points at an identification score of ∼8. Phosphosites with an identification score<8.0 observed in <3/9 experiments were therefore removed, yielding 68,385 phosphosites with an aggregate FDR of 0.34% at the phosphosite level. While the Spectrum Mill identification score is based on the number of matching peaks, their ion type assignment, and the relative height of unmatched peaks, the phosphosite localization score is the difference in identification score between the top two localizations. The score threshold for confident localization (>1.1) essentially corresponds to at least 1 b or y ion located between two candidate sites that has a peak height 10% of the tallest fragment ion (neutral losses of phosphate from the precursor and related ions as well as immonium and iTRAQ reporter ions are excluded from the relative height calculation). The ion type scores for b-H3PO4, y-H3PO4, b-H2O, and y-H2O ion types are all set to 0.5. This prevents inappropriate confident localization assignment when a spectrum lacks primary b or y ions between two possible sites but contains ions that can be assigned as either phosphate loss ions for one localization or water loss ions for another localization. In aggregate, 66.3% of the reported phosphosites were fully localized to a particular serine, threonine, or tyrosine residue.
Relative abundances of proteins and phosphosites were determined in Spectrum Mill using iTRAQ reporter ion intensity ratios from each PSM. A protein-level or phosphosite-level iTRAQ ratio was calculated as the median of all PSM level ratios contributing to a protein subgroup or phosphosite remaining after excluding those PSM's lacking an iTRAQ label, having a negative delta forward-reverse score (half of all false-positive identifications), or having a precursor ion purity<50% (MS/MS has significant precursor isolation contamination from co-eluting peptides). Unless stated otherwise for a particular analysis, the following considerations apply to the tumour/stroma (human/mouse) origin of a protein in this xenograft experiment. For the proteome dataset, only PSM's from subgroup-specific peptide sequences contributed to the protein level quantitation (see protein subgrouping description above). A protein detected with all contributing PSM's shared between human and mouse was considered to be human. For the phosphoproteome dataset, a phosphosite was considered to be mouse if the contributing PSM's were distinctly mouse and human if they were either distinctly human or shared between human and mouse. A 2-component Gaussian mixture model-based normalization approach was used to centre the distribution of iTRAQ log-ratios around zero to nullify the effect of differential protein loading and/or systematic MS variation14 (
link). Downstream analyses presented in the main figures were restricted to proteins/phosphosites quantified in at least 10 out of the 24 samples with non-missing values (
Supplementary Fig. 3), with the exception of the previously described mRNA-protein correlation analysis13 (
link) requiring quantification in 30%, or 8 out of 24, PDX samples. Specific filtering procedures are noted in descriptions of the relevant methods.
Huang K.L., Li S., Mertins P., Cao S., Gunawardena H.P., Ruggles K.V., Mani D.R., Clauser K.R., Tanioka M., Usary J., Kavuri S.M., Xie L., Yoon C., Qiao J.W., Wrobel J., Wyczalkowski M.A., Erdmann-Gilmore P., Snider J.E., Hoog J., Singh P., Niu B., Guo Z., Sun S.Q., Sanati S., Kawaler E., Wang X., Scott A., Ye K., McLellan M.D., Wendl M.C., Malovannaya A., Held J.M., Gillette M.A., Fenyö D., Kinsinger C.R., Mesri M., Rodriguez H., Davies S.R., Perou C.M., Ma C., Reid Townsend R., Chen X., Carr S.A., Ellis M.J, & Ding L. (2017). Proteogenomic integration reveals therapeutic targets in breast cancer xenografts. Nature Communications, 8, 14864.