RDDL: A systematic ensemble pipeline tool that streamlines balancing training schemes to reduce the effects of data imbalance in rare-disease-related deep-learning applications

Citation DataComputational Biology and Chemistry, ISSN: 1476-9271, Vol: 106, Page: 107929

Publication Year2023

2
Citations
0
Usage
8
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
2
- Citation Indexes
  2
Captures
8
- Readers
  8

Article Description

Identifying lowly prevalent diseases, or rare diseases, in their early stages is key to disease treatment in the medical field. Deep learning techniques now provide promising tools for this purpose. Nevertheless, the low prevalence of rare diseases entangles the proper application of deep networks for disease identification due to the severe class-imbalance issue. In the past decades, some balancing methods have been studied to handle the data-imbalance issue. The bad news is that it is verified that none of these methods guarantees superior performance to others. This performance variation causes the need to formulate a systematic pipeline with a comprehensive software tool for enhancing deep-learning applications in rare disease identification. We reviewed the existing balancing schemes and summarized a systematic deep ensemble pipeline with a constructed tool called RDDL for handling the data imbalance issue. Through two real case studies, we showed that rare disease identification could be boosted with this systematic RDDL pipeline tool by lessening the data imbalance problem during model training. The RDDL pipeline tool is available at https://github.com/cobisLab/RDDL/.

Bibliographic Details

DOI10.1016/j.compbiolchem.2023.107929

PMID37517206

URL IDhttp://www.sciencedirect.com/science/article/pii/S1476927123001202; http://dx.doi.org/10.1016/j.compbiolchem.2023.107929; http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85166289704&origin=inward; http://www.ncbi.nlm.nih.gov/pubmed/37517206; https://linkinghub.elsevier.com/retrieve/pii/S1476927123001202; https://dx.doi.org/10.1016/j.compbiolchem.2023.107929

AUTHOR(S)

Yang, Tzu-Hsien; Liao, Zhan-Yi; Yu, Yu-Huai; Hsia, Min

PUBLISHER(S)

Elsevier BV

TAG(S)

Biochemistry, Genetics and Molecular Biology; Chemistry; Mathematics

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know