logo

Blog

  • Blog

  • Dataset

  • About

  • Resources

    • Paper

Contact Us
Contact Us
logo
  • Blog

  • Dataset

  • About

  • Resources

    Paper

Blog/Featured

Featured Content

Blog cover

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines - Exploring the Real Proficiency Boundaries of LLM

Blog cover

Proudly Announce Abaka AI Join 2077AI Community as Core Founding Contributor

Blog cover

FormalMATH Benchmark: A Formal Mathematics Benchmark for Pushing the Limits of AI

Blog cover

Breaking Traditional Knowledge Dependency: KOR-Bench for Evaluating Intrinsic Reasoning Abilities of Models

Blog cover

A Novel Paradigm for Model Evaluation: The Innovative Multi-source Document Parsing Evaluation Framework OmniDocBench

Latest
Content

 SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines - Exploring the Real Proficiency Boundaries of LLM

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines - Exploring the Real Proficiency Boundaries of LLM

Proudly Announce Abaka AI Join 2077AI Community as Core Founding Contributor

Proudly Announce Abaka AI Join 2077AI Community as Core Founding Contributor

FormalMATH Benchmark: A Formal Mathematics Benchmark for Pushing the Limits of AI

FormalMATH Benchmark: A Formal Mathematics Benchmark for Pushing the Limits of AI

Breaking Traditional Knowledge Dependency: KOR-Bench for Evaluating Intrinsic Reasoning Abilities of Models

Breaking Traditional Knowledge Dependency: KOR-Bench for Evaluating Intrinsic Reasoning Abilities of Models

A Novel Paradigm for Model Evaluation: The Innovative Multi-source Document Parsing Evaluation Framework OmniDocBench

A Novel Paradigm for Model Evaluation: The Innovative Multi-source Document Parsing Evaluation Framework OmniDocBench

2077AI

Join Us In Shaping The Future Of AI
Contact Us
Contact Us
2077AI ©2025Join Us in Shaping the Future of AI