PlumX Metrics
Embed PlumX Metrics

Evaluation on ChatGPT for Chinese Language Understanding

Data Intelligence, ISSN: 2641-435X, Vol: 5, Issue: 4, Page: 885-903
2023
  • 14
    Citations
  • 0
    Usage
  • 43
    Captures
  • 0
    Mentions
  • 0
    Social Media
Metric Options:   Counts1 Year3 Year

Metrics Details

  • Citations
    14
    • Citation Indexes
      14
  • Captures
    43

Article Description

ChatGPT has attracted extension attention of academia and industry. This paper aims to evaluate ChatGPT in Chinese language understanding capability on 6 tasks using 11 datasets. Experiments indicate that ChatGPT achieved competitive results in sentiment analysis, summary, and reading comprehension in Chinese, while it is prone to factual errors in closed-book QA. Further, on two more difficult Chinese understanding tasks, that is, idiom fill-in-the-blank and cants understanding, we found that a simple chain-of-thought prompt can improve the accuracy of ChatGPT in complex reasoning. This paper further analyses the possible risks of using ChatGPT based on the results. Finally, we briefly describe the research and development progress of our ChatBIT.

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know