Proteins models used in the present work.
Protein | AA1 | MSA2 | pLDDT3 | IUPRED23 | RMSF (Å)3 | PCC6 | Slope6 | Int.6 |
---|---|---|---|---|---|---|---|---|
a. LanM | 133 | 1832 | 83.9 ± 19.1 | 0.39 ± 0.16 | 5.8 ± 3.3 | − 0.84 | − 4.9 | 113 |
b. DeHa4 | 300 | 1890 | 96.3 ± 6.9 | 0.25 ± 0.14 | 0.9 ± 0.9 | − 0.94 | − 7.2 | 103 |
c. PAS-A Domain | 108 | 1138 | 81.4 ± 16.3 | 0.20 ± 0.09 | 1.0 ± 0.7 | − 0.65 | − 15.5 | 97 |
d. AFP Type III | 66 | 1080 | 96.4 ± 5.7 | 0.20 ± 0.07 | 0.7 ± 0.7 | − 0.97 | − 8.4 | 103 |
e. GNE | 722 | 5273 | 93.2 ± 11.4 | 0.21 ± 0.12 | 3.0 ± 1.1 | − 0.75 | − 9.6 | 105 |
f. PAS-Kinase | 1323 | 8644 | 52.9 ± 27.5 | 0.43 ± 0.25 | 5.0 ± 3.9 | − 0.63 | − 4.0 | 77 |
g. inaZ | 1200 | 2050 | 88.6 ± 16.5 | 0.41 ± 0.07 | 3.8 ± 3.2 | − 0.65 | − 3.3 | 101 |
h. Heterodimer4: PAS-A, kinase | 108 287 | 1138 1908 | 89.5 ± 13.0 | 0.14 ± 0.10 | 1.3 ± 0.7 | − 0.65 | − 11.7 | 110 |
i. Homodimer5: MtMerR | 146 146 | 1825 1825 | 89.3 ± 13.9 | 0.36 ± 0.13 | 3.8 ± 2.5 | − 0.66 | − 3.7 | 103 |
j. NVJP-1 | 388 | 0 | 43.2 ± 5.3 | 0.84 ± 0.13 | 10.2 ± 2.4 | − 0.03 | − 0.1 | 44 |
k. Randomized | 237 | 0 | 32.4 ± 6.2 | 0.28 ± 0.19 | 2.1 ± 1.1 | − 0.12 | − 0.7 | 34 |
1Number of amino acid residues.
2The MSA hits from the BFD3 (Big Fantastic Database). The MSA hits include those that match the protein partial segments.
3Mean ± SD for per-residue pLDDT, IUPRED2 and RMSF values.
4Two chains of the heterodimer are PAS-A (108 AA) and kinase (287 AA) domain sequences, respectively.
5Both chains of the homodimer have the same sequence of 146 AA.
6The Pearson’s correlation coefficient (PCC) between pLDDT and RMSF scores, the slope and intercepts of the linear fitting between them are also listed; note that as pLDDT and the AF2 scores in this work are anticorrelated, and the PCC values are the negative of those shown in the Figures.