| Dataset | Dataset 1 | Dataset 2 |
| Samples | 7034 | 71,047 |
| Features | 20 | 57 |
| Classes | 2 | 2 |
| Missing values % | 0.0% | 0.7% |
| negative samples | 1869 (73.46%) | 20,609 (29.01%) |
| Data sources | IBM Watson [14] [15] | Cell2cell [16] |