Skip to content
@CLUEbenchmark

CLUE benchmark

Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard

Pinned Loading

  1. CLUE CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 4.1k 546

  2. SuperCLUE SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    3.1k 103

  3. SuperCLUE-Safety SuperCLUE-Safety Public

    SC-Safety: 中文大模型多轮对抗安全基准

    127 9

  4. SuperCLUE-Auto SuperCLUE-Auto Public

    汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测

    33 3

  5. SuperCLUE-Agent SuperCLUE-Agent Public

    SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准

    83 3

  6. SuperCLUE-RAG SuperCLUE-RAG Public

    中文原生检索增强生成测评基准

    112 3

Repositories

Showing 10 of 51 repositories
  • Math24o Public

    Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark

    CLUEbenchmark/Math24o’s past year of commit activity
    Python 7 0 0 0 Updated Mar 20, 2025
  • 2024h1 Public

    中文大模型基准测评2024上半年度报告,Report of LLMs in Chinese, First Half of 2024

    CLUEbenchmark/2024h1’s past year of commit activity
    1 0 1 0 Updated Jul 9, 2024
  • SuperCLUE-Video Public

    中文原生多层次文生视频测评基准

    CLUEbenchmark/SuperCLUE-Video’s past year of commit activity
    17 1 0 0 Updated Jul 8, 2024
  • SuperCLUE-V Public

    中文原生多模态理解测评基准(测评方案)

    CLUEbenchmark/SuperCLUE-V’s past year of commit activity
    3 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Long Public

    中文原生长文本测评基准

    CLUEbenchmark/SuperCLUE-Long’s past year of commit activity
    5 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Image Public

    中文原生文生图测评基准

    CLUEbenchmark/SuperCLUE-Image’s past year of commit activity
    8 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Coder Public

    中文原生代码助手测评基准,产品级

    CLUEbenchmark/SuperCLUE-Coder’s past year of commit activity
    0 0 0 0 Updated Jul 8, 2024
  • SuperCLUElyb Public

    SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准

    CLUEbenchmark/SuperCLUElyb’s past year of commit activity
    145 6 3 1 Updated Jun 19, 2024
  • SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    CLUEbenchmark/SuperCLUE’s past year of commit activity
    3,127 103 36 0 Updated May 23, 2024
  • CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    CLUEbenchmark/CLUE’s past year of commit activity
    Python 4,090 546 78 2 Updated May 23, 2024

Most used topics

Loading…