Seek .(SKLTY)
Search documents
2025“年度字词”揭晓:“韧”,“深度求索(DeepSeek)”
Xin Hua She· 2025-12-19 03:14
Core Viewpoint - The "Chinese Language Inventory 2025" event revealed the annual character "韧" (Resilience) and the annual phrase "深度求索" (DeepSeek) on December 19, 2025, highlighting the importance of language in recording life and societal changes in China [1] Group 1 - The event is co-hosted by the National Language Resources Monitoring and Research Center, Commercial Press, and Xinhua Net, marking the 20th anniversary of the "Chinese Language Inventory" initiative [1] - The initiative encourages the public to use language to document life and describe social changes from a Chinese perspective [1] - The organizers stated that the "Chinese Language Inventory" will continue to use the Chinese language as a medium to promote cultural heritage and mutual learning among civilizations [1]
“天才少女”罗福莉首秀:小米MiMo大模型,比DeepSeek更便宜、推理速度快三倍
Tai Mei Ti A P P· 2025-12-17 07:15
Core Insights - Xiaomi is making significant advancements in the AI large model sector, highlighted by the introduction of the MiMo-V2-Flash model, which boasts 309 billion total parameters and 15 billion active parameters, ranking among the top open-source models globally [2][3] - The company plans to invest 200 billion yuan (approximately 28.5 billion USD) in research and development over the next five years, with an expected R&D expenditure of 32-33 billion yuan (approximately 4.5-4.7 billion USD) for the current year [6] Model Development - MiMo-V2-Flash is designed to be cost-effective and faster, outperforming competitors like DeepSeek-V3.2 in inference speed by three times while being slightly cheaper [3] - The model has been open-sourced, providing developers with access to all model weights and API for integration into Web Coding IDE [5] Future AI Capabilities - The next generation of intelligent systems must evolve from merely answering questions to completing tasks, requiring enhanced memory, reasoning, and planning capabilities, as well as an Omni-modal perception ability [5] - A physical model is essential for AI evolution, enabling interaction with the real world and understanding physical laws, which current models lack [5] Financial and User Metrics - Xiaomi's global monthly active users have reached 742 million, with its AIoT platform connecting 1.04 billion devices and over 15,000 hardware partners [7] - The company has opened its CarIoT platform to the automotive industry, collaborating with major car manufacturers like BYD and GAC Toyota [7]
罗福莉首秀前,小米突然发布,代码全球最强,总体媲美DeepSeek-V3.2【附实测】
3 6 Ke· 2025-12-17 02:51
智东西12月17日报道,今天,小米发布并开源了最新MoE大模型MiMo-V2-Flash,总参数309B,激活参数15B。今日上午,小米2025小米人车家全生态合 作伙伴大会上,Xiaomi MiMO大模型负责人罗福莉将首秀并发布主题演讲。 该模型专为推理、编码和Agent场景构建,支持混合思维模式,允许用户切换模型是"思考"还是即时回答。它能一键生成功能齐全的HTML网页,并与 Claude Code、Cursor和Cline等氛围编码框架协同。该模型提供256k上下文窗口,能够完成数百轮Agent交互和工具调用的任务。 基准测试结果显示,MiMo-V2-Flash的性能基本与DeepSeek-V3.2相当,仅在不使用任何工具辅助的"人类最后一场考试"和创意文本生成评估ARENA- HARD中略逊色于DeepSeek-V3.2,但时延更小。 在多个Agent测评基准上,MiMo-V2-Flash位列全球开源模型Top 2;代码能力测评超过所有开源模型,比肩标杆闭源模型Claude 4.5 Sonnet,但推理价格 仅为其2.5%且生成速度提升至2倍。 MiMo-V2-Flash能以每秒150个token的速 ...
梁文锋的“左右互搏”:宕机的DeepSeek与闷声发财的幻方
Xin Lang Cai Jing· 2025-12-16 00:42
深夜23点,北京国贸写字楼的灯光只剩零星几点。程序员小林盯着屏幕上刺眼的"服务器繁忙"提示,第 三次尝试调用DeepSeek API失败。 就在他为瘫痪的程序焦头烂额时,千里之外的杭州,幻方量化的交易系统正自动完成一笔高频交易,其 旗下基金今年收益率已悄然逼近55%。 这两个看似无关的场景,却由同一个名字连接——"极客"梁文锋。他左手打造的DeepSeek从AI顶流沦为 争议焦点,右手培育的幻方量化却成为量化领域的隐形冠军。 这场冰与火的强烈反差,不仅是一家公司的战略分野,更撕开了AI行业的遮羞布——当概念炒作的潮 水退去,技术的真正价值到底藏在哪里? 01DeepSeek的180天坠落 2025年初的DeepSeek,曾是现象级的存在。 每天2400万人挤爆官网,单日APP下载量突破540万次,社交媒体上充斥着"AI改变世界"的狂热讨论。 上线次月,其月活用户就飙升至1.8亿,力压豆包、腾讯元宝登顶行业第一,估值被外媒疯炒至1500亿 美元,创始人梁文锋的身家也一度跃居亚洲科技大亨前列。 然而,这场狂欢的褪去速度,比任何人预想的都要快。 6月AI产品榜发布,DeepSeek排名从榜首滑落至第五,月活用户骤降 ...
估值1.05万亿!DeepSeek双登《自然》封神,中国AI如何做到颠覆?
Sou Hu Cai Jing· 2025-12-15 22:07
2025年末,一位中国创业者再度引爆科技圈。 国际顶级期刊《自然》新鲜出炉的年度十大科学人物榜单上,DeepSeek创始人梁文锋赫然在列。 要知道,该榜单每年仅甄选十位真正推动科学进步的领军者。梁文锋的入选,源自其带领团队研发的 DeepSeek大模型对全球AI格局的颠覆性重塑。 而这并非他与《自然》的首次邂逅——今年9月,他作为DeepSeek-R1论文核心作者已登上期刊封面, 短短三月内再次上榜,实力毋庸置疑。 正如《自然》赋予他的"Tech disruptor"评语,这位40岁的创业者已是公认的AI领域革命者。 接连的高光时刻,让梁文锋的崛起之路格外耀眼。他与估值1.05万亿的DeepSeek所缔造的传奇,究竟是 时运眷顾还是实力使然? 一、破局者之路,从10万到万亿的逆袭 长期以来,海外科技巨头始终认定中国AI难触核心技术,只能在产业链下游挣扎。然而,一位年轻企 业家的实践路径,正在系统性地扭转这一认知。 2013年,职业生涯起步阶段的梁文锋带着有限资本,进入变幻莫测的金融市场。当时他对人工智能的理 解尚处于探索阶段,却已展现出敢于挑战常规的勇气与远见。 两年后,他创立幻方科技,专注于量化投资这一专业 ...
PriceSeek重点提醒:铝锭现货价格全面下跌
Xin Lang Cai Jing· 2025-12-15 13:33
生意社12月15日讯 中国铝业股份有限公司2025年12月15日铝锭(AL99.70)现货价格各地区价格下跌,具体如下: 华东市场对外报价21710元/吨,华南市场对外报价21590元/吨,西南市场对外报价21650元/吨,中原市 场对外报价21650元/吨;较上一交易日分别下跌340元/吨、350元/吨、340元/吨、330元/吨。 PriceSeek评析 铝,多空评分:-1.5 生意社12月15日讯 中国铝业股份有限公司2025年12月15日铝锭(AL99.70)现货价格各地区价格下跌,具体如下: 华东市场对外报价21710元/吨,华南市场对外报价21590元/吨,西南市场对外报价21650元/吨,中原市 场对外报价21650元/吨;较上一交易日分别下跌340元/吨、350元/吨、340元/吨、330元/吨。 中国铝业2025年12月15日铝锭(AL99.70)现货价格在华东、华南、西南、中原市场均出现下跌,跌幅 分别为340元/吨、350元/吨、340元/吨、330元/吨。此次全面下跌表明市场需求疲软或供应过剩,短期 可能引发市场看跌情绪,对铝现货价格构成显著下行压力。评分为-1.5(介于一般利空和重大 ...
DeepSeek倒逼vLLM升级,芯片内卷、MoE横扫千模,vLLM核心维护者独家回应:如何凭PyTorch坐稳推理“铁王座”
3 6 Ke· 2025-12-15 00:36
Core Insights - vLLM has rapidly become a preferred inference engine for global tech companies, with GitHub stars increasing from 40,000 to 65,000 in just over a year, driven by the open-source PagedAttention technology [1] - Neural Magic played a crucial role in vLLM's success, utilizing a "free platform + open-source tools" strategy to build a robust enterprise-level inference stack and maintain a library of pre-optimized models [1] - Red Hat's acquisition of Neural Magic in November 2024, including key team members like Michael Goin, is expected to enhance vLLM's competitive edge in the AI large model sector [1][2] Development and Optimization - The vLLM core team, led by Michael Goin, has shifted focus from optimizing Llama models to enhancing features related to the DeepSeek model, particularly with the release of DeepSeek R1 [3] - The development cycle for version 0.7.2 was tight, efficiently supporting Qwen 2.5 VL and introducing a Transformers backend for running Hugging Face models [3] - Version 0.7.3 marked a significant update with numerous contributors involved, enhancing DeepSeek with multi-token prediction and MLA attention optimizations, as well as expanding support for AMD hardware [4] Hardware Compatibility and Ecosystem - The vLLM team is committed to building an open and efficient hardware inference ecosystem, supporting various mainstream chips and collaborating closely with hardware teams like NVIDIA and AMD [8] - The integration of PyTorch as a foundational layer allows vLLM to support a wide range of hardware, simplifying the adaptation process for hardware vendors [10][11] - The team's collaboration with hardware partners ensures that vLLM can maintain high performance across different platforms, with a focus on optimizing the architecture for new hardware like the Blackwell chip [8][9] Multi-Modal Capabilities - vLLM has evolved from a text-only inference engine to a unified service platform supporting multi-modal generation and understanding, including text, images, audio, and video [17][19] - The introduction of multi-modal prefix caching significantly improves efficiency in processing various input types, while the decoupling of encoders enhances resource utilization for large-scale inference [18][19] - The release of vLLM-Omni marks a milestone in multi-modal inference, allowing for seamless integration and resource allocation across different modalities [19][21] Community and Feedback Loop - The growing trend of companies contributing modifications back to the upstream vLLM project reflects a positive feedback loop driven by the speed of community version iterations [22][23] - Collaboration with leading model labs and companies enables rapid feedback collection, ensuring that vLLM remains competitive and aligned with industry developments [23][24] - The vLLM team is actively addressing developer concerns, such as startup speed, by implementing tracking projects and optimizing performance through community engagement [24][25] Strategic Positioning - Red Hat's deep involvement in vLLM is rooted in the strategic understanding that inference is a critical component of AI application costs, aiming to integrate cutting-edge model optimizations [26][27] - The governance structure of vLLM is decentralized, with contributions from multiple organizations, allowing Red Hat to influence the project while adhering to open-source principles [26][27] - The collaboration with the PyTorch team has led to significant improvements in supporting new hardware and models, reinforcing vLLM's position as a standard in inference services [27]
智见丨产业“DeepSeek时刻”的破局与重塑:创新药投资新框架
Sou Hu Cai Jing· 2025-12-12 06:45
Core Insights - The pharmaceutical industry is currently experiencing a new wave of innovation, transitioning from small molecule drugs to advanced therapies such as monoclonal antibodies, antibody-drug conjugates (ADCs), small nucleic acid drugs, and cell therapies, which offer more precise targeting and improved patient compliance [4][5][6]. Group 1: Innovation Trends - The global pharmaceutical industry is focusing on five key innovation directions, including the development of GLP-1 drugs for obesity, which are projected to generate approximately $51.8 billion in sales by 2024, reflecting a year-on-year growth of 42%-46% [6]. - ADCs are showing promise in replacing traditional chemotherapy for breast cancer and urothelial carcinoma, with expected sales of around $13 billion in 2024, a 25% increase from previous years [6][7]. - PD-1 monoclonal antibodies are recognized as a cornerstone in cancer immunotherapy, with projected sales exceeding $50 billion in 2024, marking a growth of over 10% [7]. - The prevalence of autoimmune diseases has increased from approximately 7.7% in 2000-2002 to about 11% in 2017-2019, indicating a growing market for innovative treatments targeting these conditions [8]. - Small nucleic acid drugs are expanding from rare genetic diseases to chronic conditions, with a peak sales estimate of around $3 billion for the siRNA drug Leqvio, approved in 2021 [8]. Group 2: China's Pharmaceutical Landscape - China's pharmaceutical industry has rapidly evolved over the past decade, with significant reforms initiated in 2015 that aligned the drug approval process with international standards, facilitating the approval of innovative drugs [9][10]. - The "engineer dividend" in China has led to a surge in talent across all segments of the pharmaceutical industry, enhancing the efficiency and cost-effectiveness of drug development and production [10][11]. - Despite a late start, China's innovative drug sector is experiencing remarkable growth, with a rising share of the global market, currently estimated at 3%-5% compared to a population share of about 18% [15][16]. - Recent government policies are aimed at supporting the development of innovative drugs, with comprehensive measures to enhance research funding, market access, and clinical application [19][20]. Group 3: Investment Strategies - The valuation of innovative drug companies typically employs a pipeline DCF (Discounted Cash Flow) approach, focusing on late-stage or highly probable products, while also considering the lifecycle of drugs and their patent protection [21][22]. - An alternative valuation method based on peak sales (PS) is gaining traction, allowing for a more straightforward assessment of potential revenue based on market consensus [22]. - Investment strategies emphasize the importance of established pharmaceutical companies with strong R&D capabilities and product pipelines, as well as biotech firms with high-potential single products targeting unmet clinical needs [27][28].
AI 价值链-Google Gemini 3 Pro、Claude Opus 4.5、Grok 4.1 与 DeepSeek 3.2…… 谁才是真正的领导者?这意味着什么
2025-12-12 02:19
Summary of Key Points from the Conference Call Industry Overview - The conference call discusses the U.S. semiconductor and internet industries, focusing on the AI value chain and the competition among leading AI models: Google Gemini 3 Pro, Claude Opus 4.5, Grok 4.1, and DeepSeek 3.2 [1][2][3]. Core Insights and Arguments - **Model Performance Comparison**: - Gemini 3 Pro and Claude Opus 4.5 are viewed as closely matched, while skepticism surrounds DeepSeek's claim to leadership. All three models have published benchmarks that favor their performance, but third-party benchmarking is still ongoing [3][4][14]. - Early results indicate that Gemini and Claude are neck and neck, with Grok 4.1 outperforming GPT-5 [3][14]. - **Scaling Laws**: - The scaling laws for AI models remain intact, suggesting renewed confidence among AI labs and their investors to expand AI infrastructure. Continued access to superior compute resources and unique data is essential for scaling [4][15]. - **OpenAI's Challenges**: - OpenAI is reportedly lagging behind its competitors, facing issues such as disappointing GPT-5 performance, failed pre-training runs, and significant talent departures. This situation raises concerns about its future leadership in the AI space [6][18][19]. - **Compute Infrastructure**: - The competition between GPUs and TPUs is highlighted, with concerns about Nvidia's market position. The defining theme is compute scarcity, which benefits both GPU and ASIC technologies [7][20][22]. - **Market Dynamics**: - There is a potential shift from model benchmarking to product adoption and monetization, as evidenced by Gemini's inability to displace ChatGPT despite superior performance [8][21]. Important but Overlooked Content - **DeepSeek's Position**: - DeepSeek's ability to quickly follow leading models raises concerns about the sustainability of frontier model economics if model improvement slows down. However, current model improvements are still strong [5][17]. - **Investment Implications**: - Nvidia (NVDA) is rated as outperforming with a target price of $275, citing a significant datacenter opportunity. Broadcom (AVGO) is also rated outperforming with a target price of $400, driven by a strong AI trajectory. AMD (AMD) is rated market perform with a target price of $200, contingent on OpenAI's success [10][11][12]. - **Consumer Behavior**: - OpenAI's large user base, with 800 million monthly active users, may provide a competitive moat despite its current challenges. The sticky nature of consumer behavior in technology could offer OpenAI some breathing room [18][19]. - **Future Monitoring**: - Investors are advised to closely monitor developments in the AI space, particularly regarding OpenAI's performance and the broader implications for the semiconductor and AI infrastructure markets [19][21]. This summary encapsulates the key points discussed in the conference call, providing insights into the competitive landscape of AI models, the challenges faced by leading companies, and the implications for investors in the semiconductor and AI sectors.
连姥姥都在问DeepSeek!一位AI六小龙掌门的反思与进击
Di Yi Cai Jing· 2025-12-11 12:18
Core Insights - The emergence of DeepSeek has significantly impacted MiniMax and other large model companies, prompting reflections on their performance and strategic choices [2][4] - MiniMax is focusing on a technology-driven approach rather than a purely monetization-driven strategy, recognizing the importance of sustainable growth in the AGI space [4][9] - The AI talent pool in China is a critical advantage, with a notable increase in the proportion of top AI researchers from China, which is expected to drive future breakthroughs [7][8] Group 1: Company Challenges and Strategies - MiniMax faced challenges early on, including financial difficulties due to the collapse of Silicon Valley Bank, which affected payroll [1] - The company has implemented a stock option incentive program, offering employees between hundreds of thousands to millions of dollars based on their contributions [3] - The team has learned to improve its capabilities in response to challenges, emphasizing morale-boosting strategies and financial incentives to maintain motivation [2] Group 2: Market Dynamics and Future Outlook - The number of large model companies is expected to decrease next year, as many well-funded and experienced players have exited the market [8] - Despite the competitive landscape, MiniMax believes that there is still room for various models to coexist, each with unique strengths and weaknesses [8] - The future of the AI industry is seen as distinct from the internet era, with the core product being the model itself, and the key competitive advantage being imagination and persistence [9]