The standard MobileNetV2 model extracts 62,720 numerical values with shape 7 × 7 × 1,280 of the CT scan image; these features are used as an input to the classification models. MobileNetV2 model with the fine-tuned model on top of it extracts 128 numerical values of the CT scan image for the classification process, as listed in
Several metrics can evaluate the performance of a classifier. In the following, we explain the utilized metrics for evaluating the proposed classifier. Besides, we mean by True Positive (TP) the outcome where the model correctly predicts the positive class, e.g., positive COVID-19 patients diagnosed as COVID-19 (+). A True Negative (TN) is a results where the model predicts the negative class flawlessly, e.g., a CT scan image of COVID-19 patient is diagnosed as a negative COVID-19 (-) case. A False Positive (FP) case where the model incorrectly forecasts the positive class, e.g., patients suffering from other lung diseases and incorrectly classified as COVID-19 (+). False Negative is a case where the model incorrectly forecasts the negative class, e.g., patients infected by COVID-19 (+) and incorrectly classified as COVID-19 (-). Accuracy is defined as the fraction of correct predictions (both True Positive and True Negative).
The first evaluation metric is precision; where can it can be defined as the rate of TP outcomes to the total number of positive outcomes (TP + FP), as shown in
The second evaluation metric is the recall; it is the ratio of TP outcomes to the actual number of positive samples (TP + FN), as shown in
The third evaluation metric is the F1-score; it is the harmonic mean of precision and recall, as shown in
The fourth evaluation metric is the specificity; It is the ratio of TN outcomes to the total negative outcomes (TP + FN) of the model, as shown in
The fifth evaluation metric is the accuracy; It is the ratio of correct outcomes (TP + TN) to the total number of outcomes, as shown in