The sensitivity and specificity for both conventionally acquired and SyMRI-generated T1 and T2-weighted contrasts were calculated separately based on the evaluation of experienced and less experienced raters. Furthermore, total values for sensitivity and specificity were calculated on the basis of the assessment of all raters.
An interrater reliability was calculated using the Cohen coefficient (Cκ) and the Fleiss coefficient (Fκ). The Cκ was used to detect concordances between the assessment of raters 1 and 2 (experienced) and raters 3 and 4 (less experienced). The Fκ was used to detect the concordances of the assessment of all raters. According to Landis and Koch, κ was interpreted as follows: κ ≤ 0: poor agreement; 0 < κ ≤ 0.2: slight agreement; 0.2 < κ ≤ 0.4: fair agreement; 0.4 < κ ≤ 0.6: moderate agreement; 0.6 < κ ≤ 0.8: substantial agreement; 0.8 < κ ≤ 1: (almost) perfect agreement [30 (link)].
The determined values were complemented by the corresponding 95% confidence interval (CI).