{"id":223,"date":"2025-01-28T11:58:04","date_gmt":"2025-01-28T19:58:04","guid":{"rendered":"https:\/\/www.barrybriggs.com\/blog\/?p=223"},"modified":"2025-01-28T11:58:04","modified_gmt":"2025-01-28T19:58:04","slug":"is-chinas-deepseek-a-threat-to-microsofts-ai-strategy","status":"publish","type":"post","link":"http:\/\/www.barrybriggs.com\/blog\/programming\/is-chinas-deepseek-a-threat-to-microsofts-ai-strategy\/","title":{"rendered":"Is China\u2019s DeepSeek a Threat to Microsoft\u2019s AI Strategy?"},"content":{"rendered":"\n<p>All the tech world continues to be abuzz as it woke yesterday morning to the release of the DeepSeek-R1 family of AI language models, funded and created in China. At one point during the day yesterday tech stocks had plummeted \u2013 only to return to more or less normalcy (whatever that means) today.<\/p>\n\n\n\n<p>Has the AI bubble burst, as many (including me) have predicted? Is Microsoft\u2019s AI strategy in jeopardy?<\/p>\n\n\n\n<p>I don\u2019t think so. But let\u2019s dive into the details to fully understand its ramifications. &nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Planning and Reasoning<\/h2>\n\n\n\n<p>DeepSeek rivals the most advanced AI models (such as OpenAI\u2019s <a href=\"https:\/\/openai.com\/o1\/\">o1<\/a>) in its ability to plan and reason in order to solve a problem.<\/p>\n\n\n\n<p>Consider this problem. Notice that DeepSeek begins with a &lt;think&gt; step in which it plans its attack on the problem:<\/p>\n\n\n\n<p><em>&gt;&gt;&gt; what is 2*10 divided 4 with pi added to it?<br>&lt;think&gt;<br>First, I need to calculate the product of 2 and 10.<br>Next, I will divide that result by 4.<\/em><\/p>\n\n\n\n<p><em>Finally, I&#8217;ll add the value of \u03c0 (pi) to the quotient obtained from the division.<br>&lt;\/think&gt;<\/em><\/p>\n\n\n\n<p>Then it executes its plan:<\/p>\n\n\n\n<p><em>1. **Multiply 2 and 10:**<br>&nbsp;&nbsp; 2 \\times 10 = 20<\/em><\/p>\n\n\n\n<p><em>2. **Divide the result by 4:**<br>&nbsp;&nbsp; 20 \\div 4 = 5<\/em><\/p>\n\n\n\n<p><em>&nbsp;3. **Add \u03c0 (pi) to the quotient:**<br>&nbsp; 5 + \\pi<\/em><\/p>\n\n\n\n<p><em>Since \u03c0 is approximately 3.1416, we have:<br>&nbsp;&nbsp; 5 + 3.1416 = 8.1416<\/em><\/p>\n\n\n\n<p><em>**Final Answer:**<br>&nbsp;{8.14}<\/em><\/p>\n\n\n\n<p>At first blush it seems remarkable; DeepSeek\u2019s methodical thinking process at least appears to closely mimic how humans think (very similar to how OpenAI\u2019s o1 model functions).<\/p>\n\n\n\n<p>But it\u2019s worth pointing out that DeepSeek is not a from-scratch LLM \u2013 it\u2019s a distillation of other foundation models (Meta\u2019s Llama and Alibaba\u2019s QWEN). Essentially DeepSeek was \u201ctaught\u201d by them \u2013 meaning DeepSeek did not need the vast resources that their pedagogues required.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">DeepSeek is Open-Source<\/h2>\n\n\n\n<p>Importantly, DeepSeek\u2019s models \u2013 there are six of them varying by number of parameters \u2013 are freely <a href=\"https:\/\/ollama.com\/library\/deepseek-r1\">downloadable<\/a>; all but the largest fit comfortably on a decent-size desktop (the 14 billion parameter model occupies about 9GB on my machine).<\/p>\n\n\n\n<p>Moreover, DeepSeek is offered under the permissive <a href=\"https:\/\/opensource.org\/license\/mit\">MIT License<\/a> (<em>\u201c\u2026without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and\/or sell copies of the Software\u2026\u201d<\/em> ) which means that developers can use it in applications pretty much any way they want. Purportedly, even the model weights are available, although I haven\u2019t found them yet (DeepSeek\u2019s GitHub repo, created yesterday, is <a href=\"https:\/\/github.com\/deepseek-ai\/DeepSeek-R1\">here<\/a>.) And finally, the DeepSeek team has published a <a href=\"https:\/\/arxiv.org\/pdf\/2501.12948\">comprehensive paper<\/a> outlining their technical methodology \u2013 meaning that anyone, in theory, can reproduce their work (apparently there is <a href=\"https:\/\/github.com\/huggingface\/open-r1\">work in progress<\/a> to do just that).<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Microsoft and DeepSeek (and Other LLMs Generally)<\/h2>\n\n\n\n<p>But how does it affect Microsoft and its strategy?<\/p>\n\n\n\n<p>It\u2019s no secret that LLMs in their short lifetime have become commoditized, a fact Microsoft has wisely recognized. Since ChatGPT was announced just a little over two years ago, dozens of foundation and frontier models have appeared; the LLM repo <a href=\"https:\/\/huggingface.co\/\">HuggingFace<\/a> now offers well over 1 million fine-tuned large and small language models.<\/p>\n\n\n\n<p>In fact, models form just one part of Microsoft\u2019s overall strategy, which comprises a far more expansive and inclusive view of AI in the enterprise. For Microsoft, the true value of AI lies in the myriad applications it can power \u2013 and that developers can build using it.<\/p>\n\n\n\n<p>Microsoft\u2019s vision for the AI-powered enterprise includes providing user interfaces to LLMs connected to corporate data (Copilot); offering a wide assortment of LLMs for developers to make use of; deep set of AI-focused tools for developers to use (AI Builder, AI Foundry, and ML Studio); and lastly, providing access to the \u201dfuel\u201d that powers enterprise AI applications, data, including productivity data (Microsoft 365), analytical data (Fabric), and corporate applications through connectors.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Political Ramifications of DeepSeek: Another TikTok?<\/h2>\n\n\n\n<p>It\u2019s unlikely that the current US administration will, or can, block the use of DeepSeek as it nearly did with TikTok. By now DeepSeek has been downloaded to millions of computers (including, as you\u2019ve seen, mine); blocking it as a purely technical matter will be close to impossible.<\/p>\n\n\n\n<p>Nevertheless, because DeepSeek originates in China, geopolitics cannot be ignored. I asked it point-blank if China\u2019s ruler Xi Jinping is a dictator; after an over-600-word dissertation, it replied (note that, as mentioned, I am using a downloaded version of DeepSeek; evidently the online version, hosted in China, is more circumspect):<\/p>\n\n\n\n<p><em>Labeling Xi Jinping as a dictator depends on one&#8217;s perspective of China&#8217;s political system and the definition of dictatorship applied. Considering the unique governance structure and collective leadership within the CCP, it is complex to apply traditional Western definitions of dictatorship to China&#8217;s context.<\/em><\/p>\n\n\n\n<p>I then asked Microsoft\u2019s homegrown <a href=\"https:\/\/techcommunity.microsoft.com\/blog\/aiplatformblog\/introducing-phi-4-microsoft%E2%80%99s-newest-small-language-model-specializing-in-comple\/4357090\">Phi4<\/a> model the same question and received more or less the same diplomatic, noncommittal answer:<\/p>\n\n\n\n<p><em>Ultimately, whether one views Xi Jinping as a dictator may depend on their interpretation of political systems, definitions of democracy and authoritarianism, and perspectives on governance in different cultural contexts.<\/em><\/p>\n\n\n\n<p>DeepSeek also (somewhat surprisingly) provided relatively objective answers on topics controversial in China, such as the 1989 Tiananmen Square massacre; however it refused to answer a question about the state of the Chinese housing market, saying it only provides \u201chelpful and harmless responses.\u201d Hmmm.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Microsoft, OpenAI, and DeepSeek<\/h2>\n\n\n\n<p>The interactions above raise some important questions. Although DeepSeek appears to have achieved a new level of LLM transparency, we do not yet know to what extent bias and harmful content are filtered or guardrails have been applied \u2013 whereas Microsoft and OpenAI scrupulously follow Responsible AI methodologies. Additionally, not much is known about the Chinese startup that created it, which might raise concerns about using DeepSeek in mission-critical applications.<\/p>\n\n\n\n<p>It&#8217;s not perfect by any means. DeepSeek\u2019s knowledge stops at July 2023 and it doesn\u2019t appear to have scoured every available internet source (for example, it didn\u2019t know that I worked at Microsoft so hadn\u2019t seen either my <a href=\"https:\/\/www.barrybriggs.com\">personal website<\/a> or <a href=\"https:\/\/www.linkedin.com\/in\/barrybriggs\">LinkedIn<\/a>).<\/p>\n\n\n\n<p>Nevertheless, it\u2019s possible, likely even, that DeepSeek models could show up in Azure\u2019s stable of LLMs, to be evaluated, compared, tested, and perhaps deployed within applications on Azure. So at least in one way DeepSeek could complement Azure.<\/p>\n\n\n\n<p>But, when asked how DeepSeek could impact Microsoft, the LLM itself had a slightly more ominous answer:<\/p>\n\n\n\n<p><em>\u2026in the fast-evolving tech landscape, companies like DeepSeek could potentially compete with Microsoft in areas such as AI-powered search engines, enterprise software solutions, or cloud services. For example, if DeepSeek develops advanced AI tools that rival Microsoft&#8217;s offerings (like Copilot for Office), it could influence market dynamics. Similarly, partnerships or collaborations between the two companies could also emerge in the future.<\/em><\/p>\n\n\n\n<p>Well, I doubt either will happen; it\u2019s hard to see a DeepSeek-based Copilot as it\u2019s so tightly integrated into Microsoft 365. My view is rather that DeepSeek \u2013 just one component of the overall AI stack \u2013 will prove generally beneficial to the overall AI ecosystem and Microsoft in particular.<\/p>\n\n\n\n<p>But we\u2019ll see.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>All the tech world continues to be abuzz as it woke yesterday morning to the release of the DeepSeek-R1 family of AI language models, funded and created in China. At one point during the day yesterday tech stocks had plummeted \u2013 only to return to more or less normalcy (whatever that means) today. Has the &hellip; <\/p>\n<p class=\"link-more\"><a href=\"http:\/\/www.barrybriggs.com\/blog\/programming\/is-chinas-deepseek-a-threat-to-microsofts-ai-strategy\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Is China\u2019s DeepSeek a Threat to Microsoft\u2019s AI Strategy?&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"episode_type":"","audio_file":"","cover_image":"","cover_image_id":"","duration":"","filesize":"","date_recorded":"","explicit":"","block":"","filesize_raw":"","footnotes":""},"categories":[12,9,14],"tags":[],"class_list":["post-223","post","type-post","status-publish","format-standard","hentry","category-ai","category-programming","category-policy"],"_links":{"self":[{"href":"http:\/\/www.barrybriggs.com\/blog\/wp-json\/wp\/v2\/posts\/223","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.barrybriggs.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.barrybriggs.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.barrybriggs.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/www.barrybriggs.com\/blog\/wp-json\/wp\/v2\/comments?post=223"}],"version-history":[{"count":1,"href":"http:\/\/www.barrybriggs.com\/blog\/wp-json\/wp\/v2\/posts\/223\/revisions"}],"predecessor-version":[{"id":224,"href":"http:\/\/www.barrybriggs.com\/blog\/wp-json\/wp\/v2\/posts\/223\/revisions\/224"}],"wp:attachment":[{"href":"http:\/\/www.barrybriggs.com\/blog\/wp-json\/wp\/v2\/media?parent=223"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.barrybriggs.com\/blog\/wp-json\/wp\/v2\/categories?post=223"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.barrybriggs.com\/blog\/wp-json\/wp\/v2\/tags?post=223"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}