Integrative Genomic Analysis of Human Islet Cells

We split the genome into 5 kb windows and removed windows overlapping blacklisted regions (v2) from ENCODE^{86 (link),87 (link)}. For each experiment, we created a sparse m x n matrix containing read depth for m cells passing read depth thresholds at n windows. Using scanpy^{88 (link)} (v.1.4.4.post1), we extracted highly variable windows using mean read depth and normalized dispersion (‘min_mean=0.01, min_disp=0.25’). After normalization to uniform read depth and log-transformation, for each experiment, we regressed out the log-transformed read depth within highly variable windows for each cell. We then performed principal component analysis (PCA) and extracted the top 50 principal components. We used Harmony^{24 (link)} to correct the principal components and remove batch effects across experiments, using donor-of-origin as a covariate. We used Harmony-corrected components to calculate the nearest 30 neighbors using the cosine metric, which were subsequently used for UMAP dimensionality reduction (‘min_dist=0.3’) and Leiden clustering^{89 (link)} (‘resolution=1.5’).
We performed iterative clustering to identify and remove cells with abnormal features prior to the final clustering results (see Supplementary Note). After removing these cells, we ended up with 15,298 cells mapping to 12 clusters. We used chromatin accessibility at windows overlapping promoters for marker hormones to assign cell types for the endocrine islet cell types and chromatin accessibility at windows around marker genes from scRNA-seq to assign cluster labels for non-endocrine islet clusters.

Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link: Access Free Full Text.

Chiou J., Zeng C., Cheng Z., Han J.Y., Schlichting M., Miller M., Mendez R., Huang S., Wang J., Sui Y., Deogaygay A., Okino M.L., Qiu Y., Sun Y., Kudtarkar P., Fang R., Preissl S., Sander M., Gorkin D.U, & Gaulton K.J. (2021). Single cell chromatin accessibility identifies pancreatic islet cell type- and state-specific regulatory programs of diabetes risk. Nature genetics, 53(4), 455-466.

Publication 2021

Cells Chromatin Donor Endocrine Endocrine cell types Genes marker Genome Hormones Islet cell M cells Scrna seq

Corresponding Organization : Emory University

Top 5 similar protocols

Protocol cited in 5 other protocols

Variable analysis

independent variables

Genome window size (5 kb)
Removal of windows overlapping blacklisted regions
Highly variable window selection based on mean read depth and normalized dispersion
Normalization to uniform read depth and log-transformation
Regression of log-transformed read depth within highly variable windows
Principal component analysis (PCA)
Harmony batch effect correction using donor-of-origin as a covariate
Nearest 30 neighbor calculation using cosine metric
UMAP dimensionality reduction
Leiden clustering
Iterative clustering to remove cells with abnormal features

dependent variables

Read depth at genome windows
Highly variable windows
Principal components
Harmony-corrected principal components
Cell clusters

control variables

Blacklisted regions from ENCODE (v2)
Minimum mean read depth (0.01) and minimum normalized dispersion (0.25) thresholds for highly variable window selection
UMAP 'min_dist' parameter set to 0.3
Leiden clustering 'resolution' parameter set to 1.5

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!