2 Accuracy measures the proportion of correctly classified instances among all instances. Precision focuses on the correctness of positive predictions, while recall measures the ability to identify actual positive cases. Both F1-score and Area Under the Receiver Operating Characteristic Curve (AUROC) are composite metrics that combine aspects of precision and recall to evaluate the performance of models. A detailed explanation of these metrics is provided in Section 3.7