inclusionai

Access 3 inclusionai models through the OpenRouter unified API including Ring-2.6-1T, Ling-2.6-1T, and Ling-2.6-flash. Compare pricing, context windows, benchmarks, and capabilities between different inclusionai models.

inclusionai tokens processed on OpenRouter

inclusionAI: Ring-2.6-1TRing-2.6-1T
16.5B tokens
Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool use, and long-horizon task execution, delivering leading results on benchmarks including PinchBench, ClawEval, TAU2-Bench, and GAIA2-search. With adaptive reasoning effort across high and xhigh modes, Ring-2.6-1T dynamically allocates reasoning budget based on task complexity. This enables stronger performance with lower token overhead, especially in tool-heavy and multi-turn agent workflows. Ring-2.6-1T is designed for advanced coding agents, complex reasoning pipelines, and large-scale autonomous systems where execution quality, latency, and cost efficiency all matter.
by inclusionaiMay 8, 2026262K context$0.075/M input tokens$0.625/M output tokens

inclusionai

Access 3 inclusionai models through the OpenRouter unified API including Ring-2.6-1T, Ling-2.6-1T, and Ling-2.6-flash. Compare pricing, context windows, benchmarks, and capabilities between different inclusionai models.

inclusionai tokens processed on OpenRouter

inclusionAI: Ring-2.6-1TRing-2.6-1T
16.5B tokens
Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool use, and long-horizon task execution, delivering leading results on benchmarks including PinchBench, ClawEval, TAU2-Bench, and GAIA2-search. With adaptive reasoning effort across high and xhigh modes, Ring-2.6-1T dynamically allocates reasoning budget based on task complexity. This enables stronger performance with lower token overhead, especially in tool-heavy and multi-turn agent workflows. Ring-2.6-1T is designed for advanced coding agents, complex reasoning pipelines, and large-scale autonomous systems where execution quality, latency, and cost efficiency all matter.
by inclusionaiMay 8, 2026262K context$0.075/M input tokens$0.625/M output tokens

inclusionAI: Ling-2.6-1TLing-2.6-1T

1.64B tokens

Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a “fast thinking” approach to reduce costs to roughly a quarter of comparable models while maintaining top-tier performance. The model achieves state-of-the-art results on benchmarks such as AIME26 and SWE-bench Verified, and is well suited for advanced coding, complex reasoning, and large-scale agent workflows where both capability and efficiency are critical.

by inclusionaiApr 23, 2026262K context$0.075/M input tokens$0.625/M output tokens

inclusionAI: Ling-2.6-flashLing-2.6-flash

110B tokens

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency. It delivers performance comparable to state-of-the-art models at a similar scale while significantly reducing token usage across coding, document processing, and lightweight agent workflows.

by inclusionaiApr 21, 2026262K context$0.01/M input tokens$0.03/M output tokens