Log in to download the datasets



Overview model performances

Description
  • Training and model configuration
    The ECPICK, HIT-EC, and CLEAN models were all trained using the same curated dataset. For the three models, we adopted the hyperparameters exactly as specified in their original publication. It is important to note that no separate validation or test sets were used during the training phase.
  • Performance evaluation
    To evaluate the models, we calculated the micro-averaged F1 score and macro-averaged F1 score. Although the evaluation data was included in the training process, the performance metrics were strictly based on samples from the overlapping classes between the original databases (Swiss-Prot, PDB, KEGG) and the curated dataset.