Research on Text Classification and Topic Extraction Model of Medical English Corpus Based on Natural Language Processing
2024 IEEE 6th International Conference on Power, Intelligent Computing and Systems, ICPICS 2024, Page: 727-731
2024
Metric Options: CountsSelecting the 1-year or 3-year option will change the metrics count to percentiles, illustrating how an article or review compares to other articles or reviews within the selected time period in the same journal. Selecting the 1-year option compares the metrics against other articles/reviews that were also published in the same calendar year. Selecting the 3-year option compares the metrics against other articles/reviews that were also published in the same calendar year plus the two years prior.
Example: if you select the 1-year option for an article published in 2019 and a metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019. If you select the 3-year option for the same article published in 2019 and the metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019, 2018 and 2017.
Citation Benchmarking is provided by Scopus and SciVal and is different from the metrics context provided by PlumX Metrics.
Example: if you select the 1-year option for an article published in 2019 and a metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019. If you select the 3-year option for the same article published in 2019 and the metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019, 2018 and 2017.
Citation Benchmarking is provided by Scopus and SciVal and is different from the metrics context provided by PlumX Metrics.
Conference Paper Description
In the medical field, a large amount of professional literature stores a wealth of knowledge, and effectively organizing and classifying this information is of great significance for medical research and practice. This paper proposes a deep learning-based text classification and topic extraction model for processing medical English corpora. We first obtained a corpus of 1000 medical texts, and then used natural language processing techniques to preprocess these texts. We propose a convolutional neural network (CNN) based text classification model that is capable of automatically learning and extracting features from text. Meanwhile, we utilize Latent Dirichlet Allocation (LDA) model for topic extraction. On the test set, our model outperforms other common text classification models such as Naive Bayes, Support Vector Machines, and Logistic Regression in both accuracy and F1-score. The research in this paper provides an effective method for the automatic classification and topic extraction of medical English texts, which has wide practicability and application value.
Bibliographic Details
Institute of Electrical and Electronics Engineers (IEEE)
Provide Feedback
Have ideas for a new metric? Would you like to see something else here?Let us know