AI Builders Digest

← Back to home

Chinese AI

25 items tagged with this topic

Recent

Chinese ModelsfromXJun 24

btw Zai IPO'ed in Jan at HK$120 a share. when I first met @louszbd nobody really knew anyone us…

Custom AI Chinese AI

Chinese ModelsfromTogether AI BlogJun 17

Kimi K2.7 Code vs Claude Fable 5: Landing pages that cost 94% less

We generated 12 landing pages with Kimi K2.7 Code and Claude Fable 5. Kimi cost 94% less and scored within a few points on every page. Here's what actually moved the needle.

Coding Chinese AI

Chinese ModelsfromXJun 18

Bernie Sanders introduced a bill to seize 50% of any AI startup that crosses $200M in revenue.…

Older

Chinese Modelsfrom MiniMax News

MiniMax Hailuo 02, World-Class Quality, Record-Breaking Cost Efficiency - MiniMax News | MiniMax

MiniMax Hailuo 02 launches with NCR architecture innovation. Native 1080p generation, SOTA instruction following, extreme physics mastery. 370M videos generated, ranked #2 globally on Artificial Analy

Chinese Modelsfrom XMay 24, 2026

Thinking Machines is impressive. In a couple hours I just fine tuned my own Qwen3.5-397B model…

Thinking Machines is impressive. In a couple hours I just fine tuned my own Qwen3.5-397B model this afternoon. Fast usable multimodal is also going to enable very mind-blowing personal AI. https://t.co/mm3laZb766

Chinese Modelsfrom Together AI BlogMay 11, 2026

Serving DeepSeek-V4: why million-token context is an inference systems problem

DeepSeek-V4 makes million-token context a serving-systems problem. Together AI explores the inference work behind V4 on NVIDIA HGX B200, including compressed KV layouts, prefix caching, kernel maturity, and endpoint profiles for long-conte…

Chinese Modelsfrom DeepSeek NewsApr 24, 2026

DeepSeek V4 Preview Release | DeepSeek API Docs

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.

Chinese Modelsfrom Latent Space NewsletterApr 25, 2026

[AINews] DeepSeek V4 Pro (1.6T-A49B) and Flash (284B-A13B), Base and Instruct — runnable on Huawei Ascend chips

The prodigal Tiger returns... but is no longer the benchmarks leader.

Chinese Modelsfrom Latent Space NewsletterApr 21, 2026

[AINews] Moonshot Kimi K2.6: the world's leading Open Model refreshes to catch up to Opus 4.6 (ahead of DeepSeek v4?)

Yay Kimi!!!

Chinese Modelsfrom MiniMax NewsApr 23, 2026

A Deep Dive into the MiniMax-M2-her - MiniMax News | MiniMax

Chinese Modelsfrom MiniMax NewsApr 23, 2026

MiniMax M2.1: Significantly Enhanced Multi-Language Programming, Built for Real-World Complex Tasks - MiniMax News | MiniMax

Chinese Modelsfrom MiniMax NewsApr 23, 2026

MiniMax M2 & Agent: Ingenious in Simplicity - MiniMax News | MiniMax

Chinese Modelsfrom MiniMax NewsApr 23, 2026

MiniMax Speech 2.8: Breathing life into AI voice - MiniMax News | MiniMax

Chinese Modelsfrom MiniMax NewsApr 23, 2026

MiniMax Speech 2.6: The Ultimate Voice Agent Has Arrived - MiniMax News | MiniMax

Chinese Modelsfrom MiniMax NewsApr 18, 2026

MiniMax Speech 2.5 Launches: Enhanced Multilingual Expressiveness Exceptional Voice Cloning Fidelity - MiniMax News | MiniMax

Chinese Modelsfrom Qwen BlogSep 22, 2025

Qwen3Guard: Real-time Safety for Your Token Stream

Tech Report GitHub Hugging Face ModelScope DISCORD Introduction We are excited to introduce Qwen3Guard, the first safety guardrail model in the Qwen family. Built upon the powerful Qwen3 foundation models and fine-tuned specifically for sa…

Chinese Modelsfrom Qwen BlogAug 18, 2025

Qwen-Image-Edit: Image Editing with Higher Quality and Efficiency

QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD We are excited to introduce Qwen-Image-Edit, the image editing version of Qwen-Image. Built upon our 20B Qwen-Image model, Qwen-Image-Edit successfully extends Qwen-Image’s unique text…

Chinese Modelsfrom Qwen BlogAug 4, 2025

Qwen-Image: Crafting with Native Text Rendering

GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD We are thrilled to release Qwen-Image, a 20B MMDiT image foundation model that achieves significant advances in complex text rendering and precise image editing. To try the latest model, feel fre…

Chinese Modelsfrom Qwen BlogJul 27, 2025

GSPO: Towards Scalable Reinforcement Learning for Language Models

PAPER DISCORD Introduction Reinforcement Learning (RL) has emerged as a pivotal paradigm for scaling language models and enhancing their deep reasoning and problem-solving capabilities. To scale RL, the foremost prerequisite is maintaining…

Chinese Modelsfrom Qwen BlogJul 24, 2025

Qwen-MT: Where Speed Meets Smart Translation

DEMO API DISCORD Introduction Here we introduce the latest update of Qwen-MT (qwen-mt-turbo) via Qwen API. This update builds upon the powerful Qwen3, leveraging trillions multilingual and translation tokens to comprehensively enhance the…

Chinese Modelsfrom DeepSeek News

DeepSeek-V3.2 Release | DeepSeek API Docs

🚀 Launching DeepSeek-V3.2 & DeepSeek-V3.2-Speciale — Reasoning-first models built for agents!

Chinese Modelsfrom DeepSeek News

Introducing DeepSeek-V3.2-Exp | DeepSeek API Docs

🚀 Introducing DeepSeek-V3.2-Exp — our latest experimental model!

Chinese Modelsfrom DeepSeek News

DeepSeek-V3.1-Terminus | DeepSeek API Docs

🚀 DeepSeek-V3.1 → DeepSeek-V3.1-Terminus

Chinese Modelsfrom DeepSeek News

DeepSeek-V3.1 Release | DeepSeek API Docs

Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀

Chinese Modelsfrom DeepSeek News

DeepSeek-R1-0528 Release | DeepSeek API Docs

🚀 DeepSeek-R1-0528 is here!