Log in to download the datasets
Overview model performances
- Training and model configuration
The ECPICK, HIT-EC, and CLEAN models were all trained using the same curated dataset. For the three models, we adopted the hyperparameters exactly as specified in their original publication. It is important to note that no separate validation or test sets were used during the training phase. - Performance evaluation
To evaluate the models, we calculated the micro-averaged F1 score and macro-averaged F1 score. Although the evaluation data was included in the training process, the performance metrics were strictly based on samples from the overlapping classes between the original databases (Swiss-Prot, PDB, KEGG) and the curated dataset.