The genomic regions were defined as chromosome arms and centers based on previous works
12 (link),83 (link),84 (link). Briefly, recombination rates and H3K9 methylation enrichment were used to estimate the physical boundaries between the arms and central regions of each chromosome. Chromosome arms were characterized by high levels of recombination rate and H3K9 methylation. Coordinates of arms and center regions of each chromosome were shown in Supplementary Table
4.
Heterochromatin was defined by occupancy of H3K9me2, H3K9me3, and H3K27me3, and euchromatin was determined by covering of H3K4me3, H3K36me3, and H3K79me3
12 (link),85 (link)–88 (link). Association between chromodomain proteins and histone modifications was detected by correlation, heatmap, and peak overlapping analysis.
For correlation analysis, combined peaks were determined by peaks called from any of the chromodomain protein or histone modification ChIP-seq datasets. The overlapping peaks were merged (
n = 42,496). Then the merged peaks were divided into chromosome arm (
n = 24,284) and center peaks (
n = 18,212). The 95th percentile values were extracted from bigWig files using the Python package pyBigWig over each chromosome arm and center peak. The values were normalized to input signals, logarithm-transformed, and then standardized to Z scores. Correlation coefficients were calculated by the cor [Pearson] function in R using the treated values. A two-sided
t test was performed.
For heatmap analysis in Fig.
2b–e, Supplementary Figs.
8d–g,
9a, the called peaks of each chromodomain proteins were used. Deeptools subcommand computeMatrix (version 3.4.3) was used to calculate score matrix for heatmap with defined parameters (reference-point --referencePoint center -b 3000 -a 3000 --skipZeros). The heatmap was plotted with Deeptools subcommand plotHeatmap (version 3.5.0). To identify heterogeneity of chromatin states of chromodomain protein targets, heatmap of each chromodomain protein was clustered. Parameter --kmeans was used to define the number of clusters. For each chromodomain protein, --kmeans 1, 2, and 3 were all applied. Plots displaying no heterogeneity of chromatin states within each cluster were selected. A plot with minimum kmeans number for each chromodomain protein from the selected plots was shown.
Significant overlapped peaks of chromodomain proteins and histone modifications were defined by using IntervalStats software package
89 (link). The method compared each single peak region from a ‘query’ experiment to the set of peak regions in a ‘reference’ experiment. P-values represents the significance of query peak proximity to the reference peak. Pairs of peaks were considered significantly overlapped with
P < 0.05. Both chromodomain proteins and histone modifications were used as queries and references (Fig.
2a, Supplementary Fig.
8a–c). The percentage of overlap was listed in Supplementary Table
5. We used the percentage of PTM-covered peaks of each chromodomain protein for “graded histone occupied” evaluation. For many chromodomain proteins, PTM covered more than 30% of their peaks. However, for PTMs, chromodomain proteins rarely covered more than 30% (Supplementary Fig.
8b, c, Supplementary Table
5). We speculated that for a given PTM, there are a number of additional readers besides chromodomain proteins, such as Tudor and MBT, etc., may function redundantly to recognize the PTM.
“Graded histone modification occupied” for a pair of chromodomain protein and histone modification fell into 3 categories: grade 3 = “prominent”, grade 2 = “detectable”, and grade 1 = “weak”. Grade 3 was defined as pairs meeting all of the 3 criteria: 1. “Pearson correlation coefficient (
r) ≥ 0.300”. 2. “ ≥30% of significant peaks occupied by a certain histone modification (using IntervalStats software package with
P < 0.05)”. 3. “Overlap detected by heatmap analysis”. Grade 2 meets 2 out of the 3 criteria and grade 1 meets any of the 3. For example, on chromosome arms, Pearson correlation coefficient of CEC-5 GFP(A) and H3K9me2,
r = 0.333 (Supplementary Fig.
10); 44.3% peaks of CEC-5 were significantly occupied by H3K9me2 (Fig.
2a and Supplementary Table
5); Heatmap analysis found 1585 out of 1854 targets of CEC-5 were covered by H3K9me2 (Fig.
2b). Therefore, the association of CEC-5 and H3K9me2 on chromosome arms was classified as “grade 3 = prominent” (Fig.
2f and Supplementary Table
6).
Hou X., Xu M., Zhu C., Gao J., Li M., Chen X., Sun C., Nashan B., Zang J., Zhou Y., Guang S, & Feng X. (2023). Systematic characterization of chromodomain proteins reveals an H3K9me1/2 reader regulating aging in C. elegans. Nature Communications, 14, 1254.