Systematic Evaluation of Trajectory Inference Algorithms
We undertook a systematic evaluation of the performance of Palantir in comparison to widely-used trajectory inference algorithms such as Monocle2, Diffusion Pseudotime (DPT), Partition based Graph Abstraction (PAGA- based on DPT), Slingshot, FateID, and Monocle 2. We first compared the algorithms by evaluating their setup: the prior biology knowledge required as input and the diversity of outputs provided by each algorithm using the following criteria: Supplementary Fig. 17a summarizes the characteristics of the different algorithms according to the criteria outline above: Thus, Palantir uses minimal a priori biological information to (a) automatically determine the different terminal states, (b) generate a unified pseudo-time ordering to compare gene expression trends across lineages and (c) identify continuous branch probabilities and differentiation potential for each cell. We next used the CD34+ human bone marrow data (replicate 1) as a benchmark to compare the results of the different algorithms. Due to the varied nature of the different outputs, we evaluated the ability of the algorithm to determine known and well established features of human hematopoiesis such as (a) identification of the different lineages represented in the data, with emphasis on less frequent populations such as megakaryocytes, cDCs and pDCs, which are more subtle and challenging to infer (b) recovering known expression trends of key genes across multiple lineages. We choose well-studied canonical genes across the different lineages, whose expression dynamics are known and can thus serve as ground truth. The following canonical genes, representing a broad spectrum of gene expression dynamics, were chosen for this evaluation: Supplementary Fig. 17b shows the results of this comparison for the different algorithms. Palantir and DPT were able to identify the megakaryocyte lineages, whereas PAGA and Slingshot included these cells to be part of the erythroid lineage. Palantir was the only algorithm able to recover the distinction between the two DC lineages. Comparing the expression trends, all algorithms except Monocle 2 recovered the downregulation of CD34 across all lineages. Palantir recovers the known gene expression trends across all lineages (Fig. 2). While PAGA, DPT and Slingshot identify the trends in the larger lineages, PAGA (and DPT) suffer from a loss in resolution in gene expression trends and Slingshot does not provide a unified ordering of cells to compare gene expression trends across lineages. FateID with the default clustering using RaceID failed to identify any correct lineages and gene expression trends, whereas FateID with a preprocessing procedure and clustering followed in Palantir identifies correct expression trends in only the monocyte and CLP lineages. Monocle 2 could not recover the key hematopoietic lineages or expression trends from the CD34+ bone marrow data. See Supplementary Note 6 for a detailed description of the different algorithms and their performance.
Partial Protocol Preview
This section provides a glimpse into the protocol. The remaining content is hidden due to licensing restrictions, but the full text is available at the following link:
Access Free Full Text.
Setty M., Kiseliovas V., Levine J., Gayoso A., Mazutis L, & Pe’er D. (2019). Characterization of cell fate probabilities in single-cell data with Palantir. Nature biotechnology, 37(4), 451-460.
Partition based Graph Abstraction (PAGA-based on DPT)
Slingshot
FateID
Monocle 2
dependent variables
Ability to determine known and well-established features of human hematopoiesis
Identification of different lineages represented in the data, with emphasis on less frequent populations such as megakaryocytes, cDCs and pDCs
Recovery of known expression trends of key genes across multiple lineages
control variables
Control variables not explicitly mentioned.
positive controls
None specified.
negative controls
None specified.
Annotations
Based on most similar protocols
The protocols use a variety of trajectory inference algorithms to analyze single-cell RNA-seq data, including Palantir, Monocle2, Diffusion Pseudotime (DPT), Partition based Graph Abstraction (PAGA), Slingshot, and FateID (Protocols 1, 2, 3).
Palantir is highlighted as a particularly robust algorithm that can automatically determine terminal states, generate a unified pseudotime ordering, and identify continuous branch probabilities and differentiation potential for each cell, using minimal a priori biological information (Protocol 1).
The protocols use the CD34+ human bone marrow data (replicate 1) as a benchmark to compare the performance of the different algorithms, evaluating their ability to identify known lineages, including less frequent populations, and recover known gene expression trends (Protocol 1).
For the Palantir algorithm, the protocols describe the use of batch-corrected matrices, PCA, and diffusion components to infer trajectories in young and elderly conditions, as well as the selection of initial states and terminal states (Protocol 2).
The protocols also describe the use of the diffusion pseudotime algorithm for trajectory inference, with the selection of 20 diffusion map components and a k-nearest neighbor graph (k=10) to determine edge weights based on pairwise Euclidean distances between cells (Protocol 3).
Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to
get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required