To analyze short-read NChIP-seq and CUT&RUN data, two parallel methods were developed: (i) marker-assisted mapping to the T2T-CHM13 reference and (ii) reference-free region-specific marker enrichment. For marker-assisted mapping, reads were aligned to the reference then filtered to include only alignments that overlap precomputed nucleotide oligomers of length k (k-mers) that occur in only one distinct position in the reference. For reference-free enrichment analysis, a set of k-mers that are enriched in CENP-A–targeted sequencing reads (relative to reads from input or immunoglobulin G controls) were first identified. Next, these enriched k-mers were compared with precomputed k-mers in the reference that occur exclusively within a single window of a given size (“region-specific markers”). Windows with multiple matches to enriched k-mers were reported as enriched for CENP-A. We performed a similar analysis using HOR-hap–specific markers on chrX, to reveal the broad enrichment of CENP-A on each HOR-hap across multiple individuals (
Comprehensive Repeat Annotation of CHM13
Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link:
Access Free Full Text.
Corresponding Organization : University of California, Santa Cruz
Other organizations : University of California, Berkeley, University of Washington, Berkeley College, University of California, San Diego, Stanford University, University of Connecticut, Sirius University of Science and Technology, Vavilov Institute of General Genetics, Moscow Polytechnic University, University of California, Davis, Johns Hopkins University, Stowers Institute for Medical Research, St Petersburg University, National Human Genome Research Institute, National Institutes of Health, University of Montana, Howard Hughes Medical Institute, Lomonosov Moscow State University, University of Massachusetts Chan Medical School, Chan Zuckerberg Initiative (United States), Lawrence Berkeley National Laboratory, QB3, Duke University, The University of Kansas Cancer Center, National Institute of Standards and Technology, Russian Academy of Sciences
Protocol cited in 26 other protocols
Variable analysis
- Repeats in the T2T-CHM13 assembly were annotated by parsing and combining output from RepeatMasker along with custom-built pipelines for annotating αSat and HSat2,3
- Phylogenetic analyses of αSat sequences were performed with MEGA5
- Dotplots colored by percent identity were produced with StainedGlass
- Regions identified as "SAR" by RepeatMasker were annotated as HSat1A
- Regions annotated as "HSATI" by RepeatMasker were annotated as HSat1B
- αSat HOR-haps were identified by (i) generating multiple alignments of all HOR units (or subregions of HOR units) from an array, (ii) deriving a consensus sequence, (iii) recoding the individual sequences into binary vectors based on matches to the consensus, and (iv) clustering these binary vectors by use of k-means clustering
- Marker-assisted mapping to the T2T-CHM13 reference and reference-free region-specific marker enrichment were used to analyze short-read NChIP-seq and CUT&RUN data
- Control variables not explicitly mentioned
- None specified
- None specified
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!