EnzymeX - DataX Lab

Models & Date	Swiss-Prot	PDB	KEGG
07-01-2026	SwissProt_07-01-2026 0	PDB_07-01-2026 0	KEGG_07-01-2026 0
04-01-2026	SwissProt_04-01-2026 7	PDB_04-01-2026 6	KEGG_04-01-2026 5
01-03-2026	SwissProt_01-03-2026 11	PDB_01-03-2026 4	KEGG_01-03-2026 5
10-01-2025	SwissProt_10-01-2025 8	PDB_10-01-2025 3	KEGG_10-01-2025 5
07-02-2025	SwissProt_07-02-2025 7	PDB_07-02-2025 4	KEGG_07-02-2025 4
04-01-2025	SwissProt_04-01-2025 4	PDB_04-01-2025 1	KEGG_04-01-2025 1
01-01-2025	SwissProt_01-01-2025 4	PDB_01-01-2025 3	KEGG_01-01-2025 2

Overview model training performances

Training and model configuration
The ECPICK, HIT-EC, and CLEAN models were all trained using the same curated dataset. For the three models, we adopted the hyperparameters exactly as specified in their original publication. It is important to note that no separate validation or test sets were used during the training phase.
Performance evaluation
To evaluate the models, we calculated the micro-averaged F1 score and macro-averaged F1 score. Although the evaluation data was included in the training process, the performance metrics were strictly based on samples from the overlapping classes between the original databases (Swiss-Prot, PDB, KEGG) and the curated dataset.