Reconstructing Developmental Trajectories from scRNA-seq

We check the scree plot to choose ten dimension as the intrinsic dimensions to reconstruct the developmental trajectory for the Paul dataset (cells used in Figure 1 of the original study^{9 (link)}). Five branch points and six terminal lineages (monocytes, neutrophils or eosinophil, basophils, dendritic cells, megakaryocytes, and erythrocytes) are revealed. We ordered the cells using genes Paul et al. used to cluster their data rather than the genes from dpFeature, for the sake of consistency with their clusetering analysis. Similarly, we reconstruct Olsson datasets in four dimensions. The major bifurcation between the granulocyte and monocyte branch (GMP) as well as the intricate branch between GMP and megakaryocyte/erythrocyte (Ery/Meg) are revealed. Top 1, 000 genes from dpFeature based on WT cells are used in both of the WT and full datasets. The distribution (related to confusion matrix) of percentages of cells in each cluster from the original papers over each segment (state in Monocle 2) of the principal graph are calculated and visualized in the heatmap.
We applied BEAM analysis to identify genes significantly bifurcating between Ery/Meg and GMP branch on the Olsson wildtype dataset. We then calculate the instant log ratios (ILRs) of gene expression between Ery/Meg and GMP branch and find genes have mean ILR larger than 0.5. The ILRs are defined as:

{ILR}_{t} = \log (\frac{Y_{1}^{t}}{Y_{2}^{t}})

{ILR}_{t}

is calculated as the log ratio of fitted value at interpolated pseudotime point

t

for the Ery/Meg lineage and that for the GMP lineage. Those genes are used to calculate the lineage score (simply calculated as average expression of those genes in each cell, same as stemness score below) for both of the Olsson and the Paul dataset which is used to color the cells in a tree plot transformed from the high dimensional principal graph (see Supplementary Notes). The same genes are used to create the multi-way heatmap for both of the Paul and Olsson dataset (see plot multiple_branches_heatmap function). Critical functional genes from this procedure are identified. Car1, Car2 (important erythroid functional genes for reversible hydration of carbon dioxide) as well as Elane, Prtn3 (important proteases hydrolyze proteins within specialized neutrophil lysosomes as well as proteins of the extracellular matrix) are randomly chosen as example for creating multi-lineage kinetic curves in both of the Olsson and Paul dataset (see plot_multiple_branches_pseudotime function).
In addition, pseudotime dependent genes for the Ery/Meg and GMP branch are identified in the Olsson wildtype dataset. All genes that always have lower expression from both lineages than the average in the progenitor cells are selected. Those genes are used to calculate the stemness score for both of the Olsson and the Paul dataset which is used to color the cells in the tree plot.

Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link: Access Free Full Text.

Qiu X., Mao Q., Tang Y., Wang L., Chawla R., Pliner H.A, & Trapnell C. (2017). Reversed graph embedding resolves complex single-cell trajectories. Nature methods, 14(10), 979-982.

Publication 2017

Basophils Carbon dioxide Dendritic cells Eosinophil Erythrocyte Expression genes Extracellular matrix proteins Genes Genes procedure Granulocyte Kinetic Lysosomes proteins Megakaryocyte Monocyte Neutrophil Progenitor cells Proteases Tree

Corresponding Organization :

Other organizations : University of Washington, Shanghai Jiao Tong University, University of Illinois at Chicago

Top 5 similar protocols

Protocol cited in 603 other protocols

Variable analysis

independent variables

Number of dimensions used to reconstruct the developmental trajectory (10 dimensions for Paul dataset, 4 dimensions for Olsson dataset)

dependent variables

Intrinsic dimensions of the developmental trajectory
Number of branch points and terminal lineages revealed
Distribution of percentages of cells in each cluster from the original papers over each segment of the principal graph
Genes significantly bifurcating between Ery/Meg and GMP branch on the Olsson wildtype dataset
Instant log ratios (ILRs) of gene expression between Ery/Meg and GMP branch
Lineage score calculated from genes with mean ILR larger than 0.5
Stemness score calculated from genes that always have lower expression from both lineages than the average in the progenitor cells

control variables

Genes used to order the cells (Paul et al. used genes rather than dpFeature genes)
Top 1,000 genes from dpFeature based on WT cells used in both WT and full datasets

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!