Inside DeepSeek’s Industrial Turn

DeepSeek’s transformation is not the end of its technical idealism. It is the moment when technical idealism fully meets the industrial cost of frontier AI.

May 09, 2026

The real signal of DeepSeek V4 is not that China has released another strong model. It is that a technically idealistic company is being reshaped by the real cost of frontier AI, domestic compute ecosystems, talent competition, and product segmentation.

This essay is part of the series What China’s Industry Media Is Really Talking About.

After the release of DeepSeek V4, what Chinese industry media focused on was not simply another model benchmark being refreshed. From the outside, DeepSeek can easily be written as a familiar technology story: a Chinese team builds a stronger model at lower cost, an open-source company challenges Silicon Valley again, and a new release triggers another wave of excitement around China’s AI sector. But over the past month, Chinese industry media has provided a more valuable signal. What it captured is that DeepSeek is moving from a “model miracle” into a far more complex industrial stage. Financing, compute, talent, product segmentation, multimodal trade-offs, domestic chip adaptation, cloud deployment, and developer-framework migration are all entering DeepSeek’s core story at the same time.

36Kr’s Intelligent Emergence, in its article DeepSeek V4 Has Just Been Released: Five Subjective Questions Still Need Answers, breaks the issue down in granular detail: Why did DeepSeek begin fundraising? Why does V4 still hold back on multimodal capabilities? Why is the product side beginning to split into fast mode and expert mode? Why are outside expectations for V4 no longer just about “more parameters” or “stronger capability”? The value of this report is that it does not treat DeepSeek V4 as a single product release. It places the model inside the real pressures now facing a frontier model company: larger models require more compute, longer competition requires more stable capital, top talent requires stronger organizational support, and productization requires clearer user segmentation.

Another piece, The Turn Behind DeepSeek V4 and Liang Wenfeng, by 36Kr’s Baobian, pushes this change one level deeper. The report links the release of DeepSeek V4 with Liang Wenfeng’s “turn,” focusing less on model capability itself and more on how DeepSeek is moving away from its earlier heavy reliance on the CUDA ecosystem, adapting V4 comprehensively to Huawei Ascend 950PR, and shifting toward the CANN framework. This is not an ordinary technical adaptation detail. From CUDA to CANN, what is really happening is that DeepSeek is beginning to move from being part of the existing global AI toolchain into China’s own compute base, development framework, and industrial ecosystem.

This is where DeepSeek V4 becomes worth analyzing. It is not an isolated model event. It is a set of industrial signals appearing at the same time. In the past, outside discussions of DeepSeek often focused on labels such as “low cost,” “open source,” “engineering efficiency,” and “China’s OpenAI challenger.” But recent Chinese industry-media coverage shows that DeepSeek is entering another stage. It remains a model company, but it can no longer be only a model company. It still carries the color of technical idealism, but the real cost of frontier AI is forcing it to become more capitalized, more productized, more engineering-intensive, and more embedded inside China’s AI industrial system.

1. From Model Miracle to Industrial Node

DeepSeek’s first stage was the “model miracle.” Its second stage is the “industrial node.” In the R1 era, DeepSeek was easy for the outside world to understand as a technological raid: lower training cost, stronger reasoning ability, higher engineering efficiency, and enough impact to force global capital markets to reassess the capability frontier of Chinese AI. At that moment, DeepSeek was a highly communicable story: a small team, a strong model, low cost, open source, and a challenger breaking through against giants. This narrative had strong dramatic force and matched the global technology discourse’s preference for “dark horse” companies.

After V4, however, Chinese industry media is no longer seeing the same story. When 36Kr’s Intelligent Emergence frames V4 around “five subjective questions,” it is effectively pointing out that DeepSeek’s problem has shifted from “Can it build a strong model?” to “Can it organize long-term frontier model competition?” These are very different questions. Producing one strong model requires algorithmic ability, engineering ability, data capability, and talent density. But sustaining frontier AI competition requires capital structure, compute supply, talent incentives, product systems, commercialization paths, and ecosystem positioning.

The brutal reality of frontier AI is that the stronger the capability becomes, the less its cost can be romanticized. The larger the model, the higher the training cost. The longer the context window, the heavier the inference burden. The larger the user base, the greater the pressure on service stability and compute scheduling. The stronger the open-source influence, the higher the probability that talent will be recruited away by competitors. What made DeepSeek so attractive in the past was that it looked like a sample of technical idealism pushing back against capital-heavy, big-company, Silicon Valley-style bloat. But after V4, Chinese industry media began discussing financing, talent, valuation, and product segmentation. This means the company has been pulled into the long war of frontier AI.

This does not mean DeepSeek’s idealism has disappeared. More accurately, DeepSeek is moving from technical idealism into industrial realism. Technical idealism allowed it to challenge existing paths, explore different engineering solutions to reduce cost, open-source aggressively, and organize high-density innovation with a small team. But once frontier model competition becomes a sustained investment cycle, idealism alone cannot solve long-term compute supply, talent retention, or product service costs. Frontier AI is not merely a research project. It is an industrial system that continuously burns capital, compute, and organizational capacity.

That is why the real meaning of DeepSeek V4 is not whether it “shocked the market again.” It is that it reveals a deeper transition: Chinese AI companies are moving from model-release competition into industrial-organization competition. Whoever can integrate model capability, domestic compute, cloud platforms, development frameworks, enterprise scenarios, talent organization, and capital supply will be better positioned to survive the next stage.

2. Once Financing Opens, Technical Idealism Enters the Stage of Industrial Cost

36Kr’s Intelligent Emergence discussion of DeepSeek’s financing is the key entry point for understanding the company’s change. One of DeepSeek’s most important public images in the past was that it did not rely on traditional large-scale fundraising, and that it achieved high-level model breakthroughs with a relatively restrained capital structure. This image carried strong symbolic meaning in China’s technology community, because it seemed to prove that a technical team could build world-class models without following Silicon Valley-style financing races, big-company resource accumulation, or capital-market storytelling.

After V4, however, the financing issue can no longer be avoided. The report notes that DeepSeek’s opening of a financing window is not driven merely by ordinary commercialization needs, but by a combination of pressures: training larger models, retaining top talent, filling compute-resource gaps, and sustaining a long-term technical roadmap. This means DeepSeek is not raising capital because it has abandoned its ideals. It is raising capital because continuing to pursue frontier model capability itself requires heavier industrial resources.

This point is extremely important. Outside observers often interpret fundraising as the beginning of a company’s “commercialization” or “capitalization.” But in frontier AI, financing is first and foremost a cost-structure issue. Large models are not traditional internet products. Search, social media, e-commerce, gaming, and local services could tell early-stage stories around user growth and network effects. Frontier model companies first face training clusters, inference compute, engineering talent, data processing, model evaluation, safety alignment, product service, and API stability. Revenue is not yet fully mature, but costs are already deeply industrialized.

DeepSeek once appeared special because, for a long time, it allowed the outside world to believe that engineering efficiency could significantly offset capital consumption. That judgment was not wrong. One of DeepSeek’s major contributions was to prove that frontier model competition does not have to follow only one route of endlessly stacking GPUs. But engineering efficiency can only change the cost curve. It cannot abolish the cost curve. As model capability keeps advancing, as the user base continues to expand, and as the product side moves from experimental scenarios into real service systems, DeepSeek eventually has to face the industrial ledger of frontier AI.

Talent becomes sharper at this stage as well. 36Kr’s Intelligent Emergence mentions DeepSeek’s talent mobility and competitive pressure from major technology companies. This detail matters. The rarest asset of a frontier AI company is not office space, brand recognition, or even a single model release. It is extremely high-density talent organization. A small team can produce a surprising model because of talent density and engineering culture. But once the company’s effectiveness has been proven by the market, its talent becomes a target for every major platform and capital-backed competitor. Open source brings reputation. Reputation brings talent attraction. But it also raises the risk that talent will be recruited away.

Financing, therefore, is not evidence that DeepSeek has become “impure.” It is more like an industrial watershed. If DeepSeek were only a research organization, it could remain light-capital, low-commercialization, and strongly technical in culture. But if it wants to participate in global frontier model competition for the long term, it must have stable capital, sustained compute, talent incentives, and a product-operation system. DeepSeek’s turn is the inevitable passage from a laboratory-style miracle into industrialized competition.

3. Holding Back on Multimodal Is Not Retreat. It Is Strategic Sequencing Under Resource Constraints.

After the release of DeepSeek V4, one important question stands out: Why does V4 still focus mainly on language models rather than fully committing to multimodal capabilities? In the global AI narrative, multimodality has almost become the default direction for frontier models. OpenAI, Google, Anthropic, Meta, and China’s major platforms are all strengthening image, audio, video, voice interaction, and tool-use capabilities. If DeepSeek holds back on multimodality, it can easily be read from the outside as a capability gap.

But 36Kr’s Intelligent Emergence offers a more valuable angle: this is not simply a capability issue. It is a strategic sequencing issue. Multimodality is not just adding a few functional modules on top of a language model. It is another, much heavier system of data, compute, training pipelines, and product infrastructure. Image, video, voice, and cross-modal understanding require different types of data cleaning, labeling, synthesis, evaluation, and safety mechanisms. They also require more complex training infrastructure. For a company that is still expanding quickly while also facing compute and capital constraints, multimodality is not a slogan. It is a resource-allocation choice.

This shows that DeepSeek has entered a more mature industrial stage. Early model companies can describe the future by saying they want everything. Once they enter frontier competition, every capability has to correspond to real cost. Whether to prioritize multimodality is not only a technical-path decision. It is a combined trade-off among capital, compute, talent, service cost, and product strategy. DeepSeek’s choice to keep strengthening language models, long context, reasoning ability, coding ability, and agentic coding at the V4 stage suggests that it is not simply being led around by external hype.

Holding back on multimodality does not mean DeepSeek underestimates multimodality. It may mean DeepSeek understands more clearly which capability boundary it should consolidate first. Language models remain the foundation for reasoning, coding, agents, enterprise knowledge processing, complex question answering, and long-text tasks. For DeepSeek, deepening language-model capability, extending context windows, reducing inference cost, stabilizing product experience, and adapting well to domestic compute may matter more than rushing into the image and video generation war.

This is also an important change emerging in China’s AI industry: from chasing hot capabilities to sequencing roadmaps according to resource constraints and industrial position. Chinese internet companies used to rush collectively into every new track after it appeared. But the cost density of frontier AI is far higher than that of ordinary internet products. Every company has to answer the same question: Where should limited resources be invested? Multimodality, reasoning, coding, agents, enterprise services, foundation models, or application loops?

DeepSeek’s choice provides a signal: China’s AI competition is moving from “who releases more functions” to “whose capability sequencing is clearer.” If DeepSeek in the R1 era proved that Chinese teams could make global impact in reasoning models, then DeepSeek in the V4 era shows that frontier AI companies cannot rely only on technical bursts. They must also learn how to make strategic trade-offs under resource constraints.

4. Product Segmentation Shows DeepSeek Is Becoming a User System, Not Just a Model Company

36Kr’s Intelligent Emergence pays particular attention to DeepSeek’s product-side changes, and this is worth emphasizing. DeepSeek has begun to introduce fast mode and expert mode. On the surface, this may look like a product-interface adjustment. Behind it, however, is an inevitable shift that happens when a model company enters real user systems. Once a model moves from technical demonstration into large-scale use, it can no longer place all users, all tasks, all costs, and all response-speed requirements behind the same entrance.

The emergence of fast mode and expert mode means DeepSeek is beginning to organize user experience according to task complexity and resource consumption. Ordinary questions require low latency, low cost, and quick response. Complex questions require longer reasoning, stronger context handling, better tool use, and more stable output. Mixing the two together wastes compute and damages user experience. Product segmentation is essentially a rematching of model capability, inference cost, and user demand.

This shows that DeepSeek’s competitive dimension is changing. Early large-model companies mainly demonstrate “what the model can do,” meaning the capability frontier. Once they enter the product stage, the key question becomes: for which task, at what cost, for which user, and with what experience? Model capability becomes a real user system only when it is organized into product structure.

This change is easy for outside coverage to miss because it is less exciting than benchmarks, parameter counts, valuation rumors, or investor speculation. But from an industrial perspective, it is critical. If DeepSeek only releases strong models, it remains a technology company. If it begins to adjust product structure according to task complexity, response speed, inference cost, and user scenarios, it is moving toward a platform-like AI service system. The model is no longer merely a backend capability. It is being packaged, scheduled, segmented, and priced as a service.

Product segmentation also means that DeepSeek will find it harder to preserve its earlier, singular public image. A unified entrance, free or low-cost access, and a model available to everyone are highly effective for building a public technology myth. But a DeepSeek that truly enters a long-term service stage must distinguish among ordinary users, professional users, developers, enterprise customers, API customers, and ecosystem partners. Different users correspond to different compute costs and different service value. DeepSeek’s product segmentation is one of the first visible signals that it is moving from a public technology event into a commercial service system.

This is why DeepSeek’s “turn” should not be understood as a sudden change. It is more like an inevitable evolution. Strong models bring users. Users bring costs. Costs require segmentation. Segmentation drives productization. Productization then requires capital, compute, and ecosystem support. Once this chain begins, DeepSeek is no longer just a model team. It starts becoming a complex AI service organization.

5. From CUDA to CANN, the Larger Signal of V4 Is the Closing Loop of China’s AI Stack

36Kr’s Baobian observation that DeepSeek V4 has adapted to Huawei Ascend 950PR and shifted from CUDA toward the CANN framework is one of the most important industrial signals in this story. Outside observers usually look at DeepSeek first through model capability. Chinese industry media more sensitively asks what compute ecosystem this model is being placed into. The key point of V4 is not only “what model did DeepSeek build?” It is also “which Chinese AI industrial system is this model beginning to run on?”

CUDA has long been the default infrastructure of global AI development. It is not merely a software tool. It is an entire system of development habits, operator libraries, ecosystem experience, engineering paths, and talent formation built around Nvidia GPUs. Any company that wants to train and deploy large models efficiently finds it difficult to bypass CUDA. It is one of Nvidia’s deepest moats and one of the most underestimated forms of infrastructure power in the global AI race.

DeepSeek V4’s comprehensive adaptation to Huawei Ascend 950PR and migration toward CANN, therefore, is not an isolated technical move. It represents an attempt by China’s AI industry to connect strong models to a domestic compute foundation. The difficulty here is not just whether a chip can provide computing power. It also includes whether the toolchain is stable, whether the operator libraries are mature, whether the development framework is usable, whether model adaptation is efficient, whether the cloud platform can carry the workload, and whether enterprise applications are willing to connect. The real challenge of China’s domestic AI stack is not replacing one chip with another chip. It is whether models, chips, frameworks, cloud, and applications can work together.

This is the meaning of DeepSeek V4’s Ascend adaptation. DeepSeek is not an ordinary application company. Its model capability is strong enough, its industry influence is large enough, and its developer attention is high enough. If such a model can run efficiently on domestic compute and domestic frameworks, the signal is much bigger than a small company doing a technical adaptation. It tells the whole industry that China’s domestic AI stack is not only a policy slogan. It can become an engineering path capable of carrying frontier model deployment.

This does not mean China’s domestic AI stack is already fully mature. Official technology commentary, including Science and Technology Daily, has also noted that domestic chips still face many problems in toolchains, operator libraries, and system coordination. This judgment matters. Serious industrial analysis cannot turn domestic adaptation into a victory declaration. China’s domestic compute ecosystem still faces challenges in performance, stability, developer convenience, software maturity, and ecosystem inertia. CUDA’s advantages cannot be fully erased in the short term.

But this is precisely where V4 matters. It does not declare that China’s domestic AI stack has already completed substitution. It shows that China’s AI stack is iterating under real frontier-model workloads. Industrial capability does not mature through slogans. It matures through real tasks, real pressure, real failures, and real applications. DeepSeek V4’s adaptation to Ascend and CANN essentially pushes China’s domestic compute ecosystem into a higher-pressure engineering field.

6. Chinese Industry Media Is Seeing Full-Stack Change, Not Just Model News

Over the past month, beyond the two core reports from 36Kr’s Intelligent Emergence and Baobian, other Chinese domestic media outlets have added complementary signals from different angles. China Business Journal focused on DeepSeek V4’s technical capabilities, long context, training data, inference cost, and pricing power, placing emphasis on how model capability becomes service capability. TMTPost and Semiconductor Industry Vertical placed V4 in the framework of “Chinese compute, Chinese models, Chinese rhythm,” emphasizing Huawei Cloud, domestic chip adaptation, and enterprise application access. 21st Century Business Herald treated DeepSeek V4 as a chain variable for capital markets, domestic chips, and AI application ecosystems. The Paper and National Business Daily focused more on the product layer, including expert mode, fast mode, and early multimodal gray testing.

Together, these reports present a very clear picture: Chinese industry media is not reporting on DeepSeek in isolation. It is using DeepSeek to observe how China’s AI industrial system is changing. One model release connects to financing. Financing connects to talent. Talent connects to long-term competition. Long-term competition connects to compute. Compute connects to Huawei Ascend and CANN. Domestic frameworks connect to cloud platforms. Cloud platforms connect to enterprise applications such as Kingsoft Office and 360. Enterprise applications then feed back into requirements for model segmentation, cost optimization, and service stability.

This is where China Industry Signals becomes valuable. Ordinary technology news often chases individual events: which model was released, what the benchmark score was, how high the valuation is, who invested, and how the stock price moved. Industry media’s granularity is closer to the industrial field. It notices when a financing window opens, why a product entrance changes, why talent moves, why multimodality is held back, why a model adapts to a particular chip, what framework migration means, and why enterprise applications choose a particular cloud platform.

DeepSeek’s story is moving from “one Chinese model shocks the world” to “how China’s AI industry organizes itself.” This change matters more than any single model capability. The next stage of global AI competition will not be only a model-ranking competition. It will increasingly look like a system competition: who has the model, who has the chips, who has the cloud, who has the application scenarios, who has the engineering talent, who has patient capital, and who can organize all of these elements into a continuously iterating industrial loop.

DeepSeek’s unique position is that it sits at several intersections at the same time. It is a representative open-source model company and also a symbol of Chinese technical idealism. It is a successful case of low-cost engineering and also the latest company to bear the high-cost pressure of frontier AI. It once benefited from the global AI software ecosystem and is now beginning to enter domestic compute frameworks. It challenges the giants, yet it must also face the same capital, talent, and product problems that the giants face.

This makes DeepSeek a very useful window into changes in China’s AI industry. It is not the largest platform company, not the most closed ecosystem, and not the most policy-driven national project. Precisely because of this, its turn carries stronger signal value. When a company like DeepSeek also begins fundraising, segmenting products, adapting to domestic compute, adjusting its multimodal sequence, and entering cloud and enterprise application ecosystems, China’s AI competition has moved beyond the stage of model heroism.

7. What Western Narratives Usually Underestimate Is Not DeepSeek’s Model Capability, but China’s Ability to Turn Models Into Industrial Systems

Outside discussions of DeepSeek usually focus on several questions: Is it really low cost? Does it threaten OpenAI? Does it hurt Nvidia? Does it prove that China can bypass US export controls? Is its valuation reasonable? These questions matter, but they tend to compress DeepSeek back into a single-point competition story.

The deeper question is: Is DeepSeek becoming part of China’s AI industrial system rather than remaining an isolated model company? If the answer is yes, then DeepSeek’s significance is not only technological. It is industrial-organizational. Every financing round, every product segmentation move, every model release, and every domestic chip adaptation is not only a company action. It is also a test of the carrying capacity of China’s AI system.

Western technology narratives are very good at understanding breakthroughs by individual companies. They are also good at understanding winners and losers in capital markets. But many changes in China’s AI sector do not always appear as a single company’s stock price or a single product launch. They more often appear as linkages among model companies, chip companies, cloud platforms, local industrial policies, open-source communities, application vendors, engineering talent, and capital markets. This linkage is not always elegant, and it is not always efficient. But it has strong industrial absorptive capacity.

The release of DeepSeek V4 provides a concrete case. It does not merely prove that DeepSeek is stronger. It simultaneously exposes several problems China’s AI system is trying to solve: how frontier model companies raise capital, how they retain talent, how they sequence capabilities under compute constraints, how they segment user experience, how they adapt to domestic chips, how they migrate development frameworks, how they connect to cloud platforms, and how they enter enterprise applications.

When these questions are combined, this is no longer ordinary model news. It becomes an early cross-section of China’s AI industrialization. DeepSeek becoming more “industrial” does not mean it has lost its technical edge. On the contrary, it may be the condition for preserving that technical edge over time.

The next stage of frontier AI will not reward one-off bursts. It will reward sustained organizational capacity. It will reward the ability to turn research breakthroughs into products, products into services, services into ecosystems, and ecosystems into larger flows of compute and data feedback. America’s advantages lie in capital markets, top model companies, cloud platforms, the GPU ecosystem, and global developer networks. China is trying to organize AI in another way: through domestic compute, open-source models, engineering optimization, application scenarios, cloud-platform access, and industrial-chain coordination, it is building a more industry-embedded path for AI development.

DeepSeek sits near the center of this process. It still preserves the outer shell of technical idealism, but it has begun to carry the weight of industrial realism. This is not a simple story of “maturity.” It is a signal that China’s AI competition is moving from model competition to system competition.

Conclusion: DeepSeek’s Next Competition Is Organizational Capability

The most important meaning of DeepSeek V4 is not that China has another strong model. That judgment is no longer enough. What matters is that DeepSeek is entering a harder and longer-term stage: it must prove not only that it can build strong models, but also that it can place those models inside a system made of capital, talent, compute, products, cloud platforms, domestic frameworks, and enterprise scenarios, and keep that system running.

36Kr’s Intelligent Emergence saw DeepSeek’s internal mechanism change: financing, talent, multimodal trade-offs, and product segmentation. 36Kr’s Baobian saw DeepSeek’s ecosystem-position change: Liang Wenfeng’s turn, V4’s adaptation to Huawei Ascend 950PR, and the shift from CUDA toward the CANN framework. Other industry-media reports added the broader industrial field: domestic compute, long context, cloud services, application access, expert mode, and capital-market response.

Together, these materials point to one conclusion: DeepSeek’s story is moving from model heroism to industrial organizational capability.

This is not an easier story. Model heroism has drama. Industrial organizational capability is heavier, longer, and harder for outside commentary to understand quickly. But what will truly determine China’s future position in AI is not one launch event, one leaderboard, or one financing round. It is whether China can organize models, chips, frameworks, cloud, applications, talent, and capital into a continuously iterating system.

DeepSeek’s turn is not the destination. It is an early signal that China’s AI competition has entered the industrialization stage.

Source note: This essay is based primarily on recent Chinese industry-media reports by 36Kr’s Intelligent Emergence and Baobian on DeepSeek V4, with additional references to coverage from China Business Journal, TMTPost, 21st Century Business Herald, The Paper, Science and Technology Daily, and National Business Daily. All translations of short quoted phrases are my own.

Marsha Tudor

I can’t help but wonder if it is in the best interests of those who seek to systematize various industries to support DeepSeek’s successful transition.

Innovation will continue to be important; people will be watching while still in school or elsewhere. Will they assess if it is worth the risk of investing time and resources to develop novel models based, in part, on that outcome?

It also makes me think of the EV industry you previously discussed, which faces challenges due to compressed iteration time. If that results in “divergence” being seen as too risky in the EV industry, is there a parallel in AI development?

Are there efforts to insulate AI research from such pressures?

China as a System @leonliao

Discussion about this post

Ready for more?