AI Tools Review

Meituan: LongCat Flash Chat

By Meituan

Released: 2025-09-09

API
LLM
RAG
Meituan
Paid
New

Meituan Longcat Flash Chat features 560B parameters, MoE architecture and 131k-token context window. LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (≈27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce communication overhead and achieve high throughput while maintaining training stability through advanced scaling strategies such as hyperparameter transfer, deterministic computation, and multi-stage optimization. This release, LongCat-Flash-Chat, is a non-thinking foundation model optimised for conversational and agentic tasks. It supports long context windows up to 128K tokens and shows competitive performance across reasoning, coding, instruction following, and domain benchmarks, with particular strengths in tool use and complex multi-step interactions. Available at $0.2/1M tokens.

Visit Meituan: LongCat Flash Chat

AI-Powered

Leverages advanced AI technology to deliver cutting-edge capabilities and results.

Fast & Efficient

Optimized performance ensures quick results without compromising on quality.

Purpose-Built

Specifically designed for llms tasks and workflows.

Meituan Model Timeline

Meituan: LongCat Flash ChatCurrent

131k tokens context

Specifications

pricing$0.20 / $0.80 (per 1M)
context Window131k tokens

AI Evaluation

4.8
Expert Rating
Text4.9/5
Coding3.5/5

Optimized for programming tasks, this model excels at code generation, debugging, and software engineering workflows with solid benchmark performance.

Pros

  • Competitive pricing ($0.2/1M)
  • 131k token context window
  • Large-scale 560B architecture
  • Strong code generation and debugging

Cons

  • Requires substantial compute
  • May lack creative flair
  • Speed/quality trade-off