PlumX Metrics
Embed PlumX Metrics

A novel two-way rebalancing strategy for identifying carbonylation sites

BMC Bioinformatics, ISSN: 1471-2105, Vol: 24, Issue: 1, Page: 429
2023
  • 1
    Citations
  • 0
    Usage
  • 4
    Captures
  • 1
    Mentions
  • 0
    Social Media
Metric Options:   Counts1 Year3 Year

Metrics Details

  • Citations
    1
  • Captures
    4
  • Mentions
    1
    • News Mentions
      1
      • News
        1

Most Recent News

Wuhan University Reports Findings in Bioinformatics (A novel two-way rebalancing strategy for identifying carbonylation sites)

2023 NOV 27 (NewsRx) -- By a News Reporter-Staff News Editor at Information Technology Daily -- New research on Biotechnology - Bioinformatics is the subject

Article Description

Background: As an irreversible post-translational modification, protein carbonylation is closely related to many diseases and aging. Protein carbonylation prediction for related patients is significant, which can help clinicians make appropriate therapeutic schemes. Because carbonylation sites can be used to indicate change or loss of protein function, integrating these protein carbonylation site data has been a promising method in prediction. Based on these protein carbonylation site data, some protein carbonylation prediction methods have been proposed. However, most data is highly class imbalanced, and the number of un-carbonylation sites greatly exceeds that of carbonylation sites. Unfortunately, existing methods have not addressed this issue adequately. Results: In this work, we propose a novel two-way rebalancing strategy based on the attention technique and generative adversarial network (Carsite_AGan) for identifying protein carbonylation sites. Specifically, Carsite_AGan proposes a novel undersampling method based on attention technology that allows sites with high importance value to be selected from un-carbonylation sites. The attention technique can obtain the value of each sample’s importance. In the meanwhile, Carsite_AGan designs a generative adversarial network-based oversampling method to generate high-feasibility carbonylation sites. The generative adversarial network can generate high-feasibility samples through its generator and discriminator. Finally, we use a classifier like a nonlinear support vector machine to identify protein carbonylation sites. Conclusions: Experimental results demonstrate that our approach significantly outperforms other resampling methods. Using our approach to resampling carbonylation data can significantly improve the effect of identifying protein carbonylation sites.

Bibliographic Details

Chen, Linjun; Jing, Xiao-Yuan; Hao, Yaru; Liu, Wei; Zhu, Xiaoke; Han, Wei

Springer Science and Business Media LLC

Biochemistry, Genetics and Molecular Biology; Computer Science; Mathematics

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know