Skip to main content
  1. Data Science Blog/

Power of Chinese AI Models

·438 words·3 mins· loading · ·
AI/ML Models Technology Trends & Future Artificial Intelligence AI Models AI Industry Research Methods

Power of Chinese AI Models

Power of Chinese AI Models
#

Introduction
#

After the Deepseek R1 turmoil in the market, there has been a shift in attention towards China. The West is now looking towards the East, and even those in the East are turning their gaze northward.

I was tracking these models for sometime so thought to summarize them at one place for my readers.

Opensource: 🚀

Partially or fully close source: 🔒

List of Chinese Models
#

DeveloperModelSeriesModelsFeatures of this Model
Tsinghua & Fudan UniversityOpenChineseGPTOpenChineseGPT 🚀Dialogue, instruction-following
Tsinghua & Fudan UniversityOpenBuddyOpenBuddy 🚀Dialogue, instruction-following
Tsinghua & Fudan UniversityOpenChineseLLaMAOpenChineseLLaMA 🚀Dialogue, instruction-following
Shanghai AI LabFengshenbang SeriesFengshenbang-13B 🚀, Fengshenbang-7B 🚀General-purpose, multilingual
IDEA ResearchZiya SeriesZiya-LLaMA 🚀, Ziya-13B 🚀Dialogue, instruction-following
Tsinghua UniversityCPM SeriesCPM-1 🚀, CPM-2 🚀, CPM-3 🚀Early Chinese LLMs
HuaweiPanGuPanGu 🔒Large-scale, multilingual
Tsinghua & Fudan UniversityChinese LLaMA & AlpacaChinese LLaMA 🚀, Chinese Alpaca 🚀Dialogue, instruction-following
Fudan UniversityMOSSMOSS 🚀Dialogue, general-purpose
Zhipu AIChatGLM SeriesChatGLM3 🚀, ChatGLM2 🚀, ChatGLM 🚀, GLM-4 🚀Chinese dialogue, multi-turn, long-context
Alibaba CloudQwen SeriesQwen-1.8B 🚀, Qwen-7B 🚀, Qwen-14B 🚀, Qwen-72B 🚀, Qwen-2.5-1M 🚀Multimodal, multilingual, 32K tokens, strong performance on benchmarks
Baichuan Intelligent TechBaichuan SeriesBaichuan-7B 🚀, Baichuan-13B 🚀, Baichuan2 🚀High performance, quantized versions
Shanghai AI LabInternLM SeriesInternLM 🚀, InternLM-Chat 🚀General-purpose, long-context
01.AIYi SeriesYi-1.0 🚀, Yi-6B 🚀, Yi-34B 🚀Multilingual, long-context
DeepSeek AIDeepSeek SeriesDeepSeek-V2 🚀, DeepSeek-LLM-67B 🚀, DeepSeek-R1 🚀High performance, Chinese & English, advanced reasoning for math and coding
Shenzhen Yuanxiang AIXVERTE SeriesXVERTE-7B 🚀, XVERTE-13B 🚀, XVERTE-65B 🚀Multilingual, 256K tokens
Peking UniversityYuLan SeriesYuLan-Base-126B 🚀, YuLan-Chat-3-126B 🚀Multilingual, large-pretraining
Sichuan AI UniversitygLAWLAW 🚀, LAWMiner 🚀, LLAMA 🚀, Fuzz 🚀, Mingcha 🚀Specialized for legal tasks
BaiduERNIEERNIE 3.0 Titan 🔒Knowledge enhanced with 260 billion parameters, supports multiple industries
ByteDanceDoubaoDoubao 1.5 Pro 🔒Better than ChatGPT-4o in knowledge retention, coding, reasoning, optimized for lower hardware costs
TencentHunyuanHunyuan 🔒Supports image and text generation, logical reasoning, aimed at enterprise use
Moonshot AIKimiKimi k1.5 🔒Matches or outperforms OpenAI o1, focused on solving complex problems
SenseTimeSenseNovaSenseNova 🔒Includes models for natural language processing, content generation, data annotation
MiniMaxMiniMax-TextMiniMax-Text-01 🔒Large parameter size (456 billion), outperforms on some benchmarks, large context window
KuaishouKlingKling 🔒Text-to-video model, free to public, simulates real-world motion and physics
iFlytekiFlytek SparkiFlytek Spark V4.0 🔒Improved core capabilities, ranks high in international tests compared to GPT-4 Turbo

Related

From Claw Code to Clean Room: A Developer's Guide to Re-implementing Software Without Getting Sued
·2854 words·14 mins· loading
AI Ethics & Governance Software Development Technology Trends & Future Clean Room Design Intellectual Property AI Code Generation Software Copyright Trade Secrets Software Development
From Claw Code to Clean Room: A Developer’s Guide to Re-implementing Software Without Getting …
100 Websites You Only Need on the Internet
·1402 words·7 mins· loading
Data Science Resources Data Science Artificial Intelligence Developer Tools AI Tools Productivity Tools Online Learning
100 Websites You Only Need on the Internet # The internet has billions of pages. Most of them are …
The AI Leadership Playbook: A Reusable Workflow Template
·939 words·5 mins· loading
Business & Career Artificial Intelligence Career Development AI Integration Generative AI Future of Work
The AI Leadership Playbook: A Reusable Workflow Template # Part 7 of the Human Skills, AI-Expanded …
Agentic AI for Business Leaders: When Agents Help and When They Do Not
·967 words·5 mins· loading
Artificial Intelligence Business & Career Technology Trends & Future Career Development AI Integration Generative AI Future of Work
Agentic AI for Business Leaders: When Agents Help and When They Do Not # Part 6 of the Human …
AI for Technology Executives: Scenarios and Prompts
·1169 words·6 mins· loading
Business & Career Artificial Intelligence Technology Trends & Future Career Development AI Integration Generative AI Cybersecurity
AI for Technology Executives: Scenarios and Prompts # Part 5 of the Human Skills, AI-Expanded …