Meituan: LongCat Flash Chat

By Meituan

Released: 2025-09-09

Visit Website

API

LLM

RAG

Meituan

Paid

New

Meituan Longcat Flash Chat features 560B parameters, MoE architecture and 131k-token context window. LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (≈27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce communication overhead and achieve high throughput while maintaining training stability through advanced scaling strategies such as hyperparameter transfer, deterministic computation, and multi-stage optimization. This release, LongCat-Flash-Chat, is a non-thinking foundation model optimised for conversational and agentic tasks. It supports long context windows up to 128K tokens and shows competitive performance across reasoning, coding, instruction following, and domain benchmarks, with particular strengths in tool use and complex multi-step interactions. Available at $0.2/1M tokens.

Visit Meituan: LongCat Flash Chat

AI-Powered

Leverages advanced AI technology to deliver cutting-edge capabilities and results.

Fast & Efficient

Optimized performance ensures quick results without compromising on quality.

Purpose-Built

Specifically designed for llms tasks and workflows.

Meituan Model Timeline

9 Sept 2025

Meituan: LongCat Flash ChatCurrent

131k tokens context

Specifications

pricing$0.20 / $0.80 (per 1M)

context Window131k tokens

AI Evaluation

4.8

Expert Rating

Text4.9/5

Coding3.5/5

Optimized for programming tasks, this model excels at code generation, debugging, and software engineering workflows with solid benchmark performance.

Pros

Competitive pricing ($0.2/1M)
131k token context window
Large-scale 560B architecture
Strong code generation and debugging

Cons

Requires substantial compute
May lack creative flair
Speed/quality trade-off

Related Tools

FLUX

FLUX, from Black Forest Labs, is a family of high-quality open and commercial image generation models prized for photorealism and prompt adherence. Widely integrated across third-party tools and APIs, it has become a default backbone for image generation.

Claude Opus 4.8

Claude Opus 4.8 is Anthropic's June 2026 flagship model, succeeding Opus 4.7. It posts a headline score of 81 on the hardest agentic coding and reasoning suites, holds long-horizon tool-use plans together across far more steps, and is notably more candid about its own uncertainty - refusing to fabricate rather than confidently pressing on. It is the default choice for serious agentic and software-engineering workloads.

Anthropic Mythos 1

Mythos 1 is the full release of Anthropic's specialised, security-focused system, graduating from the guarded Claude Mythos Preview. Built around Project Glasswing, it pairs frontier reasoning with defensive security tooling - vulnerability analysis, triage and incident response - deployed with heavy oversight and auditability. It runs alongside the general-purpose Claude family rather than replacing it.