Skip to main content
Tool Chatbots freemium active 8-8.9
8/10 Strong
Active

Free open-weight downloads / hosted API priced per model

Best plan

Free open-weight downloads / hosted API priced per model

Watch out: Do not generalize from one Qwen checkpoint to the whole family. Benchmark the exact model ID, snapshot, modality route, quantization, serving stack, tool fees, rate limits, data-retention path, promo price, and language mix you plan to use

Try Qwen free

Editorial · no paid placements

The call

Qwen is Alibaba Cloud's fast-moving model family. Pick it for Qwen Studio tests, hosted Qwen3.7-Max or qwen3.7-plus, open-weight Qwen3 deployments, Qwen Cloud / Model Studio inference, multilingual products, and developer control. Skip it if you need a polished consumer assistant like ChatGPT.

  • Buy if Multilingual products across 119 languages
  • Pick Free open-weight downloads / hosted API priced per model
  • Skip if Users wanting a polished consumer chat app

Evidence rail

Why this recommendation is trusted

Source
Registered source
Freshness
Current
Confidence
Medium confidence
Verified
Review
Volatility
Volatile

High-volatility evidence needs frequent review.

Build comparison
Watch out
Do not generalize from one Qwen checkpoint to the whole family. Benchmark the exact model ID, snapshot, modality route, quantization, serving stack, tool fees, rate limits, data-retention path, promo price, and language mix you plan to use.

Editorial score

Unweighted average of 4 axes · confidence high

  • Utility 9/10

    How much real work it can do for a competent operator, end to end.

  • Value 10/10

    What you get for the dollar relative to the closest alternative.

  • Moat 5/10

    How hard it would be for a competitor to replicate the underlying advantage.

  • Longevity 8/10

    How likely the product is to still be best-in-class 24 months out.

Key facts

  1. Best For Developers who want Qwen Studio tests, strong open-weight Qwen3 models, and Alibaba Cloud hosted inference options, especially for multilingual, coding, agentic, multimodal/GUI-agent, and robotics-research evaluation.
    high Volatile 2026-06-22 Qwen official site
  2. Pricing Anchor Hosted API pricing is published through Qwen Cloud / Alibaba Cloud docs and model marketplace pages; docs are representative, qwen3.7-max still shows a 50% promo ending June 22, 2026, qwen3.7-plus shows 20% off display pricing without an official expiry found, and tool calls can add fees.
    high Volatile 2026-06-22 Qwen Cloud pricing docs
  3. Watch Out For Do not generalize from one Qwen checkpoint to the whole family. Benchmark the exact model ID, snapshot, modality route, quantization, serving stack, tool fees, rate limits, data-retention path, promo price, and language mix you plan to use.
    high Volatile 2026-06-22 Qwen3 blog
  4. Model Surface Qwen should be evaluated as a model family with Qwen Studio, Qwen Cloud/API Platform, Apache 2.0 Qwen3 open weights, the June 8 qwen3.7-max snapshot, qwen3.7-plus multimodal/GUI-agent work, specialty Qwen image/audio/video branches, and the June 16 Qwen-Robot Suite research branch rather than a single chatbot product.
    high Volatile 2026-06-22 Qwen Cloud model releases changelog
  5. Latest Hosted Model Qwen Cloud's latest Max changelog entry is qwen3.7-max-2026-06-08, described as adding visual-modal understanding compared with the May 20 snapshot; the live qwen3.7-max marketplace page still presents the public model as text input/output with 1M context, 991.80K max input, 65.53K max output, and built-in tools.
    high Volatile 2026-06-22 Qwen Cloud model releases changelog
  6. Latest Changelog Release Qwen Cloud's latest official model-release changelog entry found on June 22 is qwen3.7-max-2026-06-08, with visual-modal understanding added versus the May 20 Max snapshot; qwen3.7-plus / qwen3.7-plus-2026-05-26 remains the current Plus multimodal interactive hybrid-agent release.
    high Volatile 2026-06-22 Qwen Cloud model releases changelog
  7. Deployment Surface Choose Qwen when Qwen Studio testing, open-weight deployment, regional availability, or Alibaba Cloud integration matters; standard Qwen Cloud models are called directly through the chat completions API, while deployments are needed for fine-tuned custom models.
    medium Volatile 2026-06-22 Qwen official site

Alibaba Cloud’s Qwen family spans Qwen Studio, hosted API access through Qwen Cloud / Alibaba Cloud Model Studio, the hosted Qwen3.7-Max flagship lane, qwen3.7-plus for multimodal/GUI-agent work, and open-weight model releases on Hugging Face and ModelScope.

The practical buyer question is whether your team needs a controllable model family for building, hosting, tuning, or routing AI systems.

The official Qwen3 release includes two open-weight MoE models, Qwen3-235B-A22B and Qwen3-30B-A3B, plus dense models from 0.6B through 32B, all under Apache 2.0. Qwen says Qwen3 supports 119 languages and dialects, hybrid thinking/non-thinking modes, and agentic/coding improvements. The newer Qwen3.7-Max route is a hosted Qwen Cloud model, not part of the Apache 2.0 open-weight Qwen3 release. As of the June 22 source check, Qwen Cloud’s changelog says the June 8 Max snapshot adds visual-modal understanding, while the public qwen3.7-max marketplace page still describes a pure text interface. Hosted inference pricing is published through Qwen Cloud / Alibaba Cloud docs and varies by exact model, context, mode, discount, tool use, and token volume.

Recent developments

  • June 22, 2026: Qwen Cloud model releases, Qwen3.7-Max, Qwen3.7-Plus, Qwen Cloud pricing, Qwen Studio, Qwen3 sources, and Hugging Face Qwen were rechecked. The newest official Qwen Cloud changelog entry remains qwen3.7-max-2026-06-08, listed on June 10 with visual-modal understanding added versus the May 20 Max snapshot, while the live qwen3.7-max marketplace page still describes public experimentation as text-only. Verify modality on the exact endpoint before building around visual input.
  • June 22, 2026: The public qwen.ai surface is now best described as Qwen Studio, not just Qwen Chat. Qwen’s official research page also added the Qwen-Robot Suite on June 16, including Qwen-RobotNav, Qwen-RobotManip, and Qwen-RobotWorld.
  • June 22, 2026: Pricing docs still list qwen3.7-max at $2.50/M input and $7.50/M output, with the 50% promo page expiring June 22. qwen3.7-plus list rates remain $0.40/$1.60 up to 256K and $1.20/$4.80 from 256K-1M, while its model page shows 20% off display pricing without an official expiry found.
  • June 15, 2026: Qwen Cloud model releases, Qwen3.7-Max, Qwen3.7-Plus, Qwen Cloud pricing, the Qwen3.7-Max promo page, Qwen3 sources, and Hugging Face Qwen were rechecked again. No material change was found versus the June 14 refresh.
  • June 6, 2026: Qwen Cloud model releases, Qwen3.7-Max, Qwen3.7-Plus, Qwen Cloud pricing, and Qwen3 sources were rechecked while refreshing Mistral AI vs Qwen. The buyer split is now explicit: Mistral is the EU/open-model/vendor-platform lane; Qwen is the Alibaba/Qwen Cloud, multilingual, qwen3.7-max, qwen3.7-plus, and Qwen3 open-weight lane.
  • May 27, 2026: Alibaba Cloud used its first international Qwen Conference to push Qwen as an agent-cloud platform. The buyer signal is that Qwen is moving beyond model-family benchmarking into Qwen Cloud, Skills, infrastructure upgrades, and enterprise agent tooling.
  • May 27, 2026: Qwen Cloud’s model-release changelog lists qwen3.7-plus / qwen3.7-plus-2026-05-26 as a multimodal interactive hybrid-agent model for screen/GUI perception, code generation from visual references, tool use, productivity workflows, and end-to-end mobile-app navigation.
  • May 22, 2026: Qwen Cloud’s model-release changelog added qwen3.5-livetranslate-flash-realtime and its 2026-05-19 snapshot. It is the newest specialty Qwen release AiPedia found in official sources, aimed at real-time multilingual audio/video translation.
  • May 21, 2026: Qwen Cloud listed qwen3.7-max / qwen3.7-max-2026-05-20 as the next-generation flagship in the Qwen Max series. The official model page shows text input/output, thinking enabled by default, a 1M context window, 991K max input, 65K max output, and list pricing of $2.50/M input and $7.50/M output.
  • May 13, 2026: AiPedia refreshed this page against official Qwen, Alibaba Cloud Model Studio, Hugging Face, and the latest Qwen ecosystem coverage. Model Studio International pricing now lists qwen-max at $1.20/M input (0-32K) and $6.00/M output (down from the May 10 list of $1.60/$6.40), with a new Qwen-Flash tier at $0.10/M input and $0.40/M output. Qwen-Turbo is no longer receiving updates; Qwen-Flash is the recommended replacement.
  • May 11, 2026: Alibaba Qwen and Taobao launched a co-built agentic shopping experience. The integration is the highest-profile production deployment of Qwen agentic capabilities to date, pushing the family from developer-facing model lineup into a consumer-scale commerce surface that touches hundreds of millions of users.
  • April 10, 2026: Vidu Shengshu, the Alibaba-affiliated video-model studio, raised fresh funding. Reinforces that Alibaba’s AI bet now spans Qwen text/code, Qwen-VL, image, video, and embodied stacks, not just the chat model family.
  • April 16, 2026: Third-party coverage reported a Qwen3.6-35B-A3B sparse MoE release. AiPedia is tracking it as a market signal, but this evergreen page keeps the official Qwen3 open-weight line as the buyer-facing baseline until primary source support is clear.
  • April 30, 2026: Alibaba-linked Metis showed an 8B Qwen3-VL-based agent can improve by calling tools less. The HDPO-trained model reduces blind tool calls from 98% to 2% in the project reports, making tool abstention a useful Qwen ecosystem signal.
  • April 19, 2026: Alibaba Amap debuts first embodied robot at Beijing Humanoid Robot Half Marathon. Quadruped from Amap’s new embodied-intelligence division, powered by Alibaba’s ABot-World model (leads AGIbot World Challenge and World Arena benchmarks). Moves Alibaba from Qwen-as-foundation into first-party robotics alongside the model family.

System Verdict

Pick Qwen if you need open-weight models with multilingual reach. Apache 2.0 Qwen3 releases give real commercial flexibility. The official Qwen3 release lists 119-language coverage and model sizes from 0.6B to 235B MoE, making Qwen a strong candidate for multilingual products, local experiments, and custom hosted deployments.

Skip it if you want a polished consumer chat product or strict Western data residency. Qwen Studio is useful for testing, but it is not ChatGPT-grade as a general consumer workspace. Alibaba Cloud is a Chinese provider, which matters for regulated enterprise buyers. Competing open-weight families like DeepSeek may be stronger on specific reasoning or cost benchmarks.

Who uses which surface: Qwen Studio for quick tests, Hugging Face or ModelScope downloads for self-hosters, Alibaba Cloud Model Studio for hosted API use, and third-party gateways only after checking their separate pricing and model availability.

Key Facts

Official open-weight lineQwen3 series under Apache 2.0, from 0.6B dense to 235B MoE
Latest Qwen Cloud Max changelog entryqwen3.7-max-2026-06-08: Max snapshot with visual-modal understanding added versus the May 20 snapshot
Live qwen3.7-max marketplace pagePublic qwen3.7-max page still describes text input/output, thinking enabled by default, 1M context, 991.80K max input, 65.53K max output, built-in tools, 600 RPM, and 1M TPM
Current Plus multimodal/GUI laneqwen3.7-plus-2026-05-26, a multimodal interactive hybrid-agent model for screen/GUI, coding, tool use, productivity, and app-navigation workflows
Latest specialty audio/video releaseqwen3.5-livetranslate-flash-realtime-2026-05-19 for real-time multilingual audio/video translation
Newest official research branchQwen-Robot Suite, Qwen-RobotNav, Qwen-RobotManip, and Qwen-RobotWorld were added to Qwen’s research surface on June 16, 2026
Largest Qwen3 open MoEQwen3-235B-A22B: 235B total parameters, 22B activated
Smaller Qwen3 open MoEQwen3-30B-A3B: 30B total parameters, 3B activated
Dense Qwen3 sizes0.6B, 1.7B, 4B, 8B, 14B, and 32B
Language coverage119 languages, pre-trained on ~36T tokens
ArchitectureHybrid thinking / non-thinking mode switchable
Qwen3 context examples32K on smaller dense models; 128K on Qwen3-8B and larger official Qwen3 models
Hosted API pricingPublished by Alibaba Cloud Model Studio and varies by model/mode/context
Example hosted rateQwen3.7-Max list: $2.50/M input and $7.50/M output; Qwen Cloud page shows a 50% promo rate at $1.25/$3.75 through June 22, 2026
Plus display discountqwen3.7-plus list rates remain in docs, but the model page displays 20% off visible <=256K rates at $0.32/M input and $1.28/M output with no official expiry found
Batch invocation50% off real-time pricing on supported models
Production agent surfaceQwen and Taobao co-built agentic shopping launched May 11, 2026
Agent-cloud pushFirst international Qwen Conference promoted Qwen Cloud, Skills, infrastructure upgrades, and JVS Agent Suite

Qwen3.7-Max, Qwen3.7-Plus, Qwen Cloud pricing, and model-release rows above were verified on 2026-06-22. Older qwen-max examples retain their own source dates in price history. See Sources.

What it actually is

A multi-pronged model family covering several surfaces: Qwen Studio for direct testing, hosted API access through Qwen Cloud / Alibaba Cloud Model Studio, open-weight downloads on Hugging Face and ModelScope, and third-party gateway access where providers choose to carry specific Qwen models.

The family splits into specialists. Core Qwen models handle general chat and reasoning. Qwen3.7-Max is the latest hosted Max lane in Qwen Cloud’s official changelog, while qwen3.7-plus, Qwen-Coder, Qwen-VL, Qwen-Audio, Qwen-Image, LiveTranslate, and QwQ-style reasoning branches appear across the broader ecosystem. Production buyers should verify the exact checkpoint, modality support, license, context window, tool fees, and hosting path before choosing a model.

pricing. Thin-margin cloud pricing combined with open weights gives teams a self-host escape valve most closed-model providers cannot offer.

When to pick Qwen

  • Multilingual products. 119-language training covers Chinese, Japanese, Korean, Arabic, and European languages at higher quality than English-centric families.
  • Self-hosted deployment. Apache 2.0 weights run from single-CPU (0.6B) to 4x A100 (72B dense) to MoE clusters (235B, 480B Coder). No licensing fees.
  • Cost-sensitive API tests. Model Studio publishes per-model token pricing and batch discounts for supported models.
  • Hosted flagship Qwen tests. Qwen3.7-Max gives teams a 1M-context hosted Qwen option for agentic coding, office workflows, and long-horizon execution before deciding whether open-weight Qwen is enough.
  • Balanced hosted multimodal work. Qwen Cloud docs currently recommend qwen3.7-plus as the balanced route and its model page presents multimodal input for image/text/video to text output.
  • Agentic and coding experiments. Qwen3 includes hybrid thinking/non-thinking controls, MCP-oriented examples, and deployment guidance through SGLang and vLLM.
  • Model-family breadth. The Qwen ecosystem spans text, code, vision-language, image, audio, and reasoning branches.
  • IDE and agent backends. Use an OpenAI-compatible local or hosted endpoint after benchmarking the exact model.

When to pick something else

  • Polished consumer chat product: ChatGPT or Claude. qwen.ai is developer-first.
  • Strongest open-weight reasoning: DeepSeek R1 still leads on specific reasoning benchmarks.
  • Strongest English writing: Claude Opus 4.8. Qwen handles English well but trails Claude on nuance.
  • Google Workspace integration: Gemini. Qwen has no Workspace hooks.
  • Open-weight with Huawei Ascend training stack: GLM GLM-5.1 is the closest alternative with domestic-silicon provenance.
  • Broadest plugin marketplace: ChatGPT. No Qwen equivalent to the GPT Store.

Pricing

Hosted pricing via Qwen Cloud pricing docs, Qwen Cloud model pages, and Alibaba Cloud Model Studio. Self-host for free under Apache 2.0 via Hugging Face.

Plan / ModelPriceNotes
Open weights (Hugging Face/ModelScope)Free to downloadApache 2.0 across the official Qwen3 open-weight line; hosting costs are separate
Qwen3 open-weight self-hostingInfrastructure costCost depends on model size, quantization, hardware, throughput, and context length
Alibaba Cloud Model StudioModel-specific token pricingOfficial page lists model, mode, input/output token rates, and free quota where applicable
Qwen3.7-MaxList: $2.50/M input, $7.50/M output; promo page displays $1.25/$3.75 through June 22, 2026Latest Max changelog entry is the June 8 snapshot; live marketplace page still shows text input/output, 1M context, 991.80K max input, 65.53K max output
Qwen3.7-PlusList: $0.40/M input and $1.60/M output up to 256K, $1.20/M input and $4.80/M output from 256K-1M; model page displays 20% off visible <=256K rates at $0.32/$1.28 with no official expiry foundQwen Cloud’s May 27 multimodal/GUI hybrid-agent release
qwen-max example$1.20/M input (0-32K), $6.00/M outputListed on Model Studio’s Qwen-Max International pricing as of May 13, 2026; tiered to $2.40/$12 (32K-128K) and $3/$15 (128K-252K)
qwen-plus$0.40/M input (0-256K), $1.20/M outputLong-context tier: $1.20/M input and $3.60/M output for 256K-1M
Qwen-Flash$0.10/M input, $0.40/M outputNew entry tier; Qwen-Turbo no longer receiving updates
Batch invocation50% off real-timeSupported models only

Qwen3.7-Max and Qwen3.7-Plus pricing verified 2026-06-22 via Qwen Cloud pricing docs, the Qwen3.7-Max model page, and the Qwen3.7-Max promotion page. Qwen Cloud pricing docs list representative models only and point buyers to marketplace model pages for complete current pricing. Built-in tools can add fees: Web Search is listed at $10 per 1,000 calls and Image Search at $8 per 1,000 calls, while Web Extractor and Code Interpreter are marked free for a limited time. Older qwen-max examples were verified 2026-05-13 via Alibaba Cloud Model Studio pricing. Chinese Mainland deployment rates can differ from International tiers. Third-party gateways can be useful, but their rates and model availability are separate from Alibaba’s official pricing.

Against the alternatives

Qwen3 open lineDeepSeekClaudeGLM
Open weightsApache 2.0 Qwen3 checkpointsStrong open-model ecosystemClosed frontier assistant/APIOpen-model Chinese/English ecosystem
Language coverageQwen3 lists 119 languages and dialectsChinese + English focusBroad, English-strong writingChinese + English focus
Hosted APIAlibaba Cloud Model Studio plus gatewaysVendor/gateway dependentAnthropic API and app surfacesVendor/gateway dependent
Consumer polishDeveloper-firstDeveloper-firstStrong Claude appDeveloper-first
Best viewed asOpen-weight multilingual model familyLow-cost reasoning/API rivalWriting/reasoning assistantChinese open-model rival

Failure modes

  • Consumer chat product is minimal. qwen.ai is functional for testing but lacks ChatGPT-grade onboarding, memory, or ecosystem.
  • Data residency on Alibaba Cloud. Enterprise buyers in regulated industries need to evaluate the Chinese-cloud posture. Self-hosting the Apache 2.0 weights is the workaround.
  • Thin moat on open-weight leaderboard. DeepSeek, Kimi, GLM, and Qwen all iterate monthly. Leadership positions shift fast.
  • English documentation lag. Official docs translate from Chinese first. Some resources trail the Chinese original by weeks.
  • Vision models lag the strongest closed models. Qwen-VL and Qwen3.5-Omni are capable but trail the strongest closed vision models on independent evaluations.
  • Hosted API rate limits vary by region. Alibaba Cloud tier and regional load affect throughput. Production deployments should load-test.
  • Pricing is model-specific. Alibaba Cloud Model Studio tables change by model, mode, free quota, context, and batch eligibility.
  • Changelog and marketplace wording can diverge. The June 10 changelog says the June 8 Max snapshot adds visual-modal understanding, while the live qwen3.7-max marketplace page still describes a text-only public interface. Verify the exact route before promising visual input.
  • Latest does not mean open weight. Qwen3.7-Max is a hosted Qwen Cloud flagship route. The Apache 2.0 open-weight buyer case still rests on the official Qwen3 checkpoints.
  • Promos can distort cost comparisons. Qwen Cloud showed a 50% Qwen3.7-Max promotional rate during this refresh; compare on list price unless you are buying during the promo window.
  • Pricing pages can disagree. The representative pricing docs and model pages are both official, but model pages can show temporary display discounts. Recheck the exact page before publishing a cost comparison.
  • Responses API has separate retention behavior. Qwen Cloud says normal API inputs and outputs are not used for training, while linked conversation context for the Responses API is stored for 7 days.

Methodology

This page was produced by the aipedia.wiki editorial pipeline, an automated system that ingests vendor documentation, verifies pricing and model details against primary sources, and generates the editorial analysis you are reading. No individual human wrote this review. Scoring follows the four-dimension rubric at /about/scoring/ (Utility, Value, Moat, Longevity; unweighted average).

Last verified 2026-06-22 against Qwen Cloud model releases, the Qwen3.7-Max model page, Qwen Cloud pricing docs, the Qwen3.7-Max promotion page, Qwen official site, Qwen3 blog, Hugging Face Qwen, Qwen Studio and Qwen research pages, current Qwen Conference coverage, Qwen-Taobao coverage, and tracked Qwen3.6-35B-A3B coverage.

FAQ

Is Qwen open source? Partly. The official Qwen3 open-weight line ships under Apache 2.0 on Hugging Face and ModelScope, covering sizes from 0.6B to 235B MoE. Download, self-host, fine-tune, and deploy commercially under that license, but verify the exact model because not every Qwen-branded surface is open.

What is the main Qwen3 open-weight release? The official Qwen3 release includes two MoE models, Qwen3-235B-A22B and Qwen3-30B-A3B, plus six dense models from 0.6B through 32B. Qwen says the line supports hybrid thinking modes, 119 languages and dialects, and agentic/coding improvements.

What is the latest Qwen model? As of this refresh on June 22, 2026, the latest official Qwen Cloud changelog entry AiPedia found is qwen3.7-max-2026-06-08, listed on June 10 as a Max snapshot with visual-modal understanding added versus the May 20 snapshot. The live qwen3.7-max marketplace page still describes the public model page as text input/output, so buyers should verify the exact route before assuming visual input. The current Plus multimodal/GUI agent lane remains qwen3.7-plus-2026-05-26. None of this changes the buyer-facing fact that the main open-weight line is still Qwen3.

How does Qwen3 compare to Claude? Qwen is more compelling when you need open weights and self-hosting. Claude is usually stronger when you want a polished paid assistant or API for English writing, long-document work, and managed enterprise workflows.

Can I run Qwen locally? Yes. Official Qwen3 sizes start at 0.6B and scale up to 235B MoE. Practical hardware depends on model size, quantization, context length, throughput targets, and serving stack.

Sources

Qwen comparisons

See all →

Reader reviews

Loading…
Share LinkedIn
Was this review helpful?
Embed this score on your site Free. Links back.
Qwen editorial score badge
<a href="https://aipedia.wiki/tools/qwen/" target="_blank" rel="noopener"><img src="https://aipedia.wiki/badges/qwen.svg" alt="Qwen on aipedia.wiki" width="260" height="72" /></a>
[![Qwen on aipedia.wiki](https://aipedia.wiki/badges/qwen.svg)](https://aipedia.wiki/tools/qwen/)

Badge value auto-updates if the editorial score changes. Attribution via the link is required.

Cite this page For journalists, researchers, and bloggers
According to aipedia.wiki Editorial at aipedia.wiki (https://aipedia.wiki/tools/qwen/)
aipedia.wiki Editorial. (2026). Qwen: Editorial Review. aipedia.wiki. Retrieved June 22, 2026, from https://aipedia.wiki/tools/qwen/
aipedia.wiki Editorial. "Qwen: Editorial Review." aipedia.wiki, 2026, https://aipedia.wiki/tools/qwen/. Accessed June 22, 2026.
aipedia.wiki Editorial. 2026. "Qwen: Editorial Review." aipedia.wiki. https://aipedia.wiki/tools/qwen/.
@misc{qwen-editorial-review-2026, author = {{aipedia.wiki Editorial}}, title = {Qwen: Editorial Review}, year = {2026}, publisher = {aipedia.wiki}, url = {https://aipedia.wiki/tools/qwen/}, note = {Accessed: 2026-06-22} }
Spotted an error or want to share your experience with Qwen?

Every tool page is re-verified on a recurring cycle, and corrections land faster when readers flag them directly. If you spot a stale fact, a missing capability, or have used Qwen and want to share what worked or didn't, the editorial desk reviews every message sent through this form.

Email editorial@aipedia.wiki
Report outdated info Help us keep this page accurate