Seek .(SKLTY)
Search documents
国务院国资委推动央企扩大算力有效投资 DeepSeek模型更新
Xin Lang Cai Jing· 2026-02-11 23:57
Market Dynamics - The State Council, led by Premier Li Qiang, emphasized the importance of cultivating new productive forces through the integration of artificial intelligence across various industries, aiming for high-quality development [1] - The State-owned Assets Supervision and Administration Commission (SASAC) urged central enterprises to enhance effective investment in computing power and promote the synergy between computing power and electricity, aiming to strengthen the foundation of the AI industry [1] Company Developments - Meta Platforms Inc. plans to invest over $10 billion to build a data center in Lebanon, Indiana, covering 4 million square feet and expected to be operational by late 2027 or early 2028, creating 300 long-term jobs and supporting over 4,000 construction workers [3] - Shanghai Suiruan Technology Co., Ltd. has changed its IPO review status to "inquired" on the Sci-Tech Innovation Board, focusing on cloud AI chip design since its establishment in 2018 [5] - NetEase reported a stable performance for 2025, with Q4 revenue of 27.5 billion yuan and total annual revenue of 112.6 billion yuan, achieving an operating profit of 35.8 billion yuan, a 21% year-on-year increase [6] - Newray Co., Ltd. plans to acquire a 70% stake in PCB tool company Huilian Electronics for no more than 700 million yuan, aiming to enhance its market competitiveness in the PCB tool sector [8] - Zhongji Xuchuang clarified that CSP customers place orders directly with the company, ensuring no bypassing of the company in the supply chain [7] - Zhongwei Semiconductor plans to establish an IPM production line project in Ziyang, Sichuan, using 121 million yuan of surplus fundraising for this initiative [10]
来了!DeepSeek新模型 | 附体验入口
Xin Lang Cai Jing· 2026-02-11 13:22
Core Insights - DeepSeek has released an updated model, enhancing its capabilities significantly [1][3] Model Enhancements - The context capacity has been upgraded to 1 million tokens from the previous 128,000, allowing for the processing of extensive content such as the entire "Three-Body Problem" trilogy [9][11] - The knowledge base has been updated to May 2025, indicating a new foundational model, potentially referred to as DeepSeek V4 [9][14] Performance Improvements - The frontend and coding capabilities have seen substantial improvements, now comparable to top competitors like Gemini 3 Pro and K2.5 [10][12] - The language style has become more lively and authentic, reducing inaccuracies and enhancing user interaction [10][13] Limitations - The model remains a pure text model and does not support visual understanding, focusing solely on text and voice inputs [14][15]
DeepSeek更新新模型,支持最高1M百万Token上下文长度
Xin Lang Cai Jing· 2026-02-11 11:35
Core Viewpoint - DeepSeek has released a version update that supports a maximum context length of 1 million tokens, but it has not yet enabled multimodal capabilities [1][2]. Group 1: Version Update - The recent update for DeepSeek on both web and app platforms allows for a context length of up to 1 million tokens [1][2]. - As of now, the updated version does not support multimodal capabilities [1][2]. Group 2: Future Developments - Reports suggest that a minor update for the V3 series model is expected to be released around the Spring Festival [1][2]. - The next flagship model from DeepSeek is anticipated to be a trillion-parameter foundational model, but the significant increase in scale has slowed down the training speed, causing delays in the release process [1][2].
DeepSeek疑似已更新:上下文暴增至100万,知识库
Guan Cha Zhe Wang· 2026-02-11 11:24
Group 1 - The core point of the article highlights the significant updates in the DeepSeek AI model, particularly its context processing capabilities and knowledge base freshness [1][3] - After updating to version 1.7.4, DeepSeek claims a context processing capacity of 1M, capable of handling the entire "Three-Body" trilogy [1] - The latest version, DeepSeek V3.2, released on December 1, 2025, shows an 8-fold increase in context capability to 128K compared to the previous version [3] Group 2 - The knowledge base of DeepSeek has been updated to reflect information up to May 2025, improving its relevance to significant events and technological advancements expected in late 2024 and early 2025 [3] - Currently, DeepSeek does not support multimodal capabilities, indicating a potential area for future development [3] - The official DeepSeek team has not made any public announcements or responses regarding these updates [3]
DeepSeek新模型来了?
Hua Er Jie Jian Wen· 2026-02-11 11:21
Core Insights - DeepSeek is advancing its new model version with a grayscale test, potentially the final version before the official V4 launch [1] - The V4 model is expected to be released in mid-February 2026, and it will not replicate the global AI computing demand panic seen during the V3 launch [2] - The core value of V4 lies in driving the commercialization of AI applications through underlying architectural innovations rather than disrupting the existing AI value chain [2] Model Enhancements - The context length of the model has been expanded from 128K to 1M, nearly a tenfold increase, and the knowledge base has been updated to May 2025 [1] - V4 is expected to introduce two innovative technologies, mHC and Engram, which aim to overcome computing chip and memory bottlenecks [2][8] - Initial internal tests indicate that V4 outperforms models like Anthropic Claude and OpenAI's GPT series in programming tasks [2] Technical Innovations - mHC (Manifold Constraint Hyperconnection) addresses the bottlenecks in information flow and training instability in deep Transformer models, enhancing the richness and flexibility of communication between neural network layers [4] - Engram is a "conditional memory" module that decouples memory from computation, allowing static knowledge to be stored in a sparse memory table, thus freeing up expensive GPU memory for dynamic calculations [6] Cost Efficiency and Market Impact - The introduction of mHC and Engram is expected to significantly reduce training and inference costs, stimulating downstream application demand and initiating a new cycle of AI infrastructure development [8] - The report suggests that Chinese AI hardware manufacturers may benefit from increased demand and investment due to these cost optimizations [8] Market Dynamics - The market landscape has shifted from a dominant player to a more fragmented competition, with DeepSeek's market share declining as more players enter the field [9][11] - The efficiency in computing management and performance improvements from DeepSeek are accelerating the development of Chinese large language models and applications, altering the global competitive landscape [11] Opportunities for Software Companies - Major global cloud service providers are actively pursuing general artificial intelligence, and the capital expenditure race continues [12] - If V4 can maintain high performance while significantly lowering training and inference costs, it will help developers convert technology into revenue more quickly, alleviating profit pressures [12] - Enhanced capabilities of V4 are expected to create more powerful AI agents, transforming them from mere conversational tools to capable assistants that can handle complex tasks [12]
DeepSeek更新新模型 可一次性处理超长文本
Xin Lang Cai Jing· 2026-02-11 11:13
新浪科技讯 2月11日晚间消息,多名用户反馈,DeepSeek在网页端和APP端进行了版本更新,支持最高 1M(百万)Token的上下文长度。而去年8月发布的DeepSeekV3.1上下文长度拓展至128K。 实测中发现,DeepSeek在问答中称自身支持上下文1M,可以一次性处理超长文本。提交超过24万个 token的《简爱》小说文档,DeepSeek可以支持识别文档内容。 之前曾有知情人士称,DeepSeek 春节更可能推出的是针对V3系列模型的小幅更新。但该人士同时透 露,真正的重头戏仍在路上。DeepSeek下一代旗舰模型预计将是一款万亿参数级别的基础模型,正因 规模大幅跃升,训练速度明显放缓,导致发布进程有所延后。 责任编辑:何俊熹 新浪科技讯 2月11日晚间消息,多名用户反馈,DeepSeek在网页端和APP端进行了版本更新,支持最高 1M(百万)Token的上下文长度。而去年8月发布的DeepSeekV3.1上下文长度拓展至128K。 实测中发现,DeepSeek在问答中称自身支持上下文1M,可以一次性处理超长文本。提交超过24万个 token的《简爱》小说文档,DeepSeek可以支持识别文档 ...
DeepSeek突然测试新模型,春节大招要来了?
Feng Huang Wang· 2026-02-11 10:52
Core Insights - The recent upgrade of DeepSeek does not include multimodal visual understanding capabilities, focusing instead on pure text and voice interaction paths [2] - The core context window has been increased from 128K to 1M tokens, allowing the model to process long texts equivalent to the "Three-Body Problem" trilogy in a single instance, positioning it against international competitors like GPT-5 and Gemini3Pro [2] - The knowledge base of the current model has been updated to include accurate outputs for news events as far ahead as April 2025, with the cutoff date for knowledge now set to May 2025 [2] User Experience and Development - Feedback from developers and early users indicates that the new model's language style has become "enthusiastic and nuanced," with front-end response quality rated as comparable to Claude3.5Sonnet, suggesting a focus on enhancing user interaction experience [5] - The company has been actively hiring for multiple technical core positions, including deep learning researchers and engineers, indicating a commitment to advancing its large language model (LLM) capabilities [5] - There is speculation within the industry that the current version may correspond to the rumored "DeepSeek V4" or an enhanced version of the V3.2 series, although the official version name has not yet been disclosed [5]
DeepSeek突然测试新模型,上下文已到百万级
Feng Huang Wang· 2026-02-11 10:37
Core Insights - DeepSeek has initiated a key update with a significant enhancement in its model architecture, moving from a context window of 128K to 1M tokens, which allows for processing longer texts comparable to international products like GPT-5 and Gemini3Pro [1] - The model's knowledge base has been updated to include information up to May 2025, and it can accurately output news events as far ahead as April 2025 [1] - User feedback indicates that the new model exhibits a more "enthusiastic and nuanced" language style, enhancing the user interaction experience [1] Group 1 - DeepSeek has begun gray testing for its updated model on both web and app platforms [1] - The new model's context window allows it to handle the entire "Three-Body" trilogy in a single processing instance [1] - The upgrade does not include multimodal visual understanding capabilities, focusing instead on text and voice interactions [1] Group 2 - DeepSeek has been actively hiring for multiple core technical positions, including deep learning researchers and engineers, indicating a focus on advancing its large language model (LLM) capabilities [2] - The company is open to various recruitment channels, including campus recruitment and internships, to fill these positions [2] - There is speculation that the current version being tested may correspond to the previously rumored "DeepSeek V4" or an enhanced version of V3.2 [2]
华为云“码道”代码智能体开启公测,支持 GLM-4.7 和 DeepSeek-V3.2
Xin Lang Cai Jing· 2026-02-11 10:32
Core Insights - Huawei Cloud officially launched "CodeArts," an AI-powered coding assistant, in January 2023, which integrates IDE, autonomous development mode, and code library indexing capabilities, currently in public beta for 10,000 users [1][8] - The personal version of "CodeArts" is available for free to developers, while the enterprise version will be announced later [1][8] - The product utilizes GLM-4.7 and DeepSeek-V3.2 models and supports JetBrains series and Visual Studio Code IDEs [1][8] Product Features - "CodeArts" combines essential programming capabilities such as project-level code generation, code continuation, research knowledge Q&A, and unit test case generation, significantly enhancing developer productivity and providing a high-quality coding experience [2][9] - The tool allows users to input requirements, enabling the AI to generate code directly [3][11] Copyright and Usage - Huawei Cloud states that the copyright of the code generated by the AI belongs to the user, emphasizing that "CodeArts" functions as a tool that responds to user inputs without creative autonomy [5][13]
字节要复现又一个DeepSeek时刻了?
Feng Huang Wang· 2026-02-11 10:25
接下来,影视行业的一切价值都将重估。 摘要: 一年前的春节,一家中国公司用R1大模型震撼全球科技界。凭借算法创新突破算力限制,以较低训练成本达到以往AI大模型靠堆算力、拼资金和数据的 效果,外媒将这一突破称为"DeepSeek时刻"。 那场技术地震的余波尚未平息,2026年2月,字节跳动似乎正准备接棒,在视频生成领域制造另一场海啸。 就在几天前,字节跳动旗下的最新一代视频生成模型Seedance 2.0悄然在即梦、豆包等产品中开启小范围内测。 如果你还记得OpenAI发布Sora时的那个夜晚,那么现在的感觉似曾相识,却又有所不同。不同之处在于,这一次,站在风暴眼中心的是又一家中国公 司,更重要的是,这不再是少数人手中的演示Demo,而是一个正在被数万创作者疯狂测试的"实战武器"。 一位曾专注大银幕创作的从业者向凤凰网科技感叹,"Seedance 2.0的完成度令人惊艳,尝试了猫狗宠物的创作,效果都很好",但其也对影视行业接下来 的发展感到担忧,"能预见制作层面的预算将被砍,《流浪地球3》这样的作品几年前立项成本数亿,这种还在制作过程中的作品,接下来所有的一切都要 价值重估"。 一枚王炸,悄悄扔出 2月7日晚间 ...