PlumX Metrics
Embed PlumX Metrics

Domain-adaptive entity recognition: unveiling the potential of CSER in cybersecurity and beyond

International Journal of Machine Learning and Cybernetics, ISSN: 1868-808X
2024
  • 0
    Citations
  • 0
    Usage
  • 2
    Captures
  • 1
    Mentions
  • 0
    Social Media
Metric Options:   Counts1 Year3 Year

Metrics Details

  • Captures
    2
  • Mentions
    1
    • News Mentions
      1
      • 1

Most Recent News

Investigators from University of Tsukuba Report New Data on Engineering (Domain-adaptive Entity Recognition: Unveiling the Potential of Cser In Cybersecurity and Beyond)

2024 DEC 06 (NewsRx) -- By a News Reporter-Staff News Editor at Japan Daily Report -- New research on Engineering is the subject of a

Article Description

In the dynamic fields of cybersecurity, precise recognition and identification of cybersecurity-related entities in textual data have become crucial. Existing studies on Named Entity Recognition (NER) in the cybersecurity domain often overlook challenges posed by data sparsity and the substantial presence of Out-of-Vocabulary (OOV) tokens in Cyber Treat Intelligence (CTI) reports. To tackle these challenges, we introduce the Cybersecurity Entity Recognition (CSER) model—a comprehensive approach crafted to handle CTI data complexities and similar intricacies across other domains. The CSER model integrates output from contextual, semantic, and morphological encoders to form a robust feature vector, capturing nuanced patterns, buzzwords, and structural attributes specific to cybersecurity entities. In particular, we employ various deep-learning approaches to capture morphological and contextual features, while pre-trained embeddings are utilized to capture semantic features. Additionally, Conditional Random Field (CRF) is employed as a sequential decoder, enhancing the effectiveness of cybersecurity entity identification. Extensive experiments on genuine cybersecurity datasets reveal that the proposed CSER model surpasses contemporary state-of-the-art methods, demonstrating superior predictive performance. To validate the effectiveness of this model, experiments are extended to datasets from biomedical and material science domains, providing comprehensive insights into the model’s adaptability across diverse domains. Our research demonstrates that the CSER model excels in domains with frequent OOV tokens, particularly cybersecurity, addressing data sparsity effectively. Its capability to manage a substantial volume of OOV tokens enhances performance where traditional models struggle.

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know