14 KiB
linkTitle, title, breadcrumbs, next, description, cascade
| linkTitle | title | breadcrumbs | next | description | cascade | ||
|---|---|---|---|---|---|---|---|
| AI Daily | AI Daily-AI资讯日报 | false | /en/2025-12/2025-12-19 | Your daily source for curated AI news, practical tools, and actionable tutorials to master Artificial Intelligence; |
|
AI Daily News 2025/12/20
AI News | Daily Briefing | Web Data Aggregation | Frontier Science Exploration | Industry Insights | Open-Source Innovation | AI & Human Future | Visit Web Version | Join Group Chat
Today's Digest
Google releases 270M parameter FunctionGemma with 85% accuracy
GPT-5.2-Codex becomes the strongest programming model, reaching 56.4% on SWE-Bench
RUC-Tencent confirms long reasoning chains accumulate noise, proposes Adaptive Think
Manus hits $100M ARR in eight months, fastest growth record globally
Pieter Abbeel takes over as Amazon AGI head, leading frontier research
Product & Feature Updates
-
Google unveils FunctionGemma. FunctionGemma, a small 270M parameter model, can now directly convert natural language into device commands (AI News)! Its accuracy skyrocketed from 58% to an impressive 85% in tests. Imagine saying "set a reminder to feed the cat at 8 PM" and it instantly gets it, calling the system API. This isn't just a chatbot anymore; it's a powerful 🚀 smart agent ready to get things done.

-
Google Gemini can detect AI-generated videos. Google Gemini now lets users upload videos ⬆️ to directly check if they were generated by Google AI. It leverages SynthID watermark technology to inspect both visual and audio tracks. This cool feature supports videos up to 100MB and 90 seconds, and it's free to use globally (AI News)—no subscription needed! ✨
-
OpenAI releases GPT-5.2-Codex. GPT-5.2-Codex is officially here, and it's currently the most powerful agent programming model out there! 🤯 It boasts a 56.4% accuracy on SWE-Bench Pro and can stay focused on complex tasks for extended periods without losing its place. Its defensive cybersecurity capabilities are also top-tier, even helping researchers uncover a critical React framework vulnerability (AI News).

-
Kling 2.6's motion control feature is live. Kling 2.6 just dropped a new motion control feature, letting users define how characters in their images move! 🤩 You can even join a creation contest for a chance to win up to $1000 cash! Five first-prize winners will also snag 16,000 points, and if you submit by December 31st, your work might even get featured on the official homepage (AI News). Don't miss out!

-
Mistral releases OCR 3. Mistral OCR 3 is here, crushing its predecessor with a 74% win rate when handling scanned forms and handwritten content! 📈 It costs just $2 per thousand pages, with bulk discounts bringing it down to a sweet $1. Plus, it can preserve complex table structures and even supports direct Markdown output (AI News). Talk about efficiency!

Frontier Research
-
Large models' "thinking too much leads to errors" confirmed. The RUC-Tencent team has officially confirmed the "thinking too much leads to errors" phenomenon in large models! 🤯 Using information theory, they discovered that excessively long reasoning chains accumulate noise. Their solution? A new Adaptive Think strategy that tells the model to "stop when confident." This approach slashed Token consumption on GSM8K by half, and even improved accuracy (AI News). No wonder their paper was selected as a NeurIPS 2025 Spotlight! ✨
-
JARVIS framework enhances visual reasoning. The JARVIS framework, a self-supervised learning framework (AI News) inspired by I-JEPA, is boosting visual reasoning for multimodal large models! 🧠 It helps them learn visually without relying solely on text descriptions. Experiments consistently show significant improvements on vision-centric tasks without compromising other multimodal reasoning abilities. The code is already open-sourced on GitHub – go check it out! 🚀
-
AIMM detects social media stock market manipulation. AIMM, a new AI framework, is here to sniff out stock market manipulation on social media! 🧐 It combines Reddit activity and OHLCV data to generate a daily manipulation risk score. Amazingly, it issued a warning 22 days before the GME event! The truth dataset (AI News), containing 33 labeled samples, has already been open-sourced. Take that, market manipulators! 📉
-
Pull-based protocols solve AI collaboration challenges. A recent paper dives into AI collaboration, finding that knowledgeable Leaders often struggle to guide Followers effectively due to a lack of "theory of mind," causing success rates to plummet from 35% to a mere 17%. 📉 But here's the kicker: experiments proved that active, question-driven Pull protocols are more stable than Push commands (AI News), doubling the frequency of clarification requests. It seems asking is better than telling! 🤔
Industry Outlook & Social Impact
-
Manus hits $100M ARR in 8 months. Manus, a Singaporean AI agent company, has just set a new global record, smashing past $100 million in ARR in just eight months! 🚀 With a monthly compound growth rate exceeding 20%, it has processed a staggering 147 trillion tokens. This powerhouse can autonomously handle complex tasks (AI News), from resume screening to full-stack development, all with a lean team of just 105 people. Mind-blowing! 🤯

-
Amazon AGI head steps down. Pieter Abbeel, the reinforcement learning guru, is taking the reins of Amazon's frontier research team, replacing Rohit Prasad after his two-year tenure. This UC Berkeley professor's former students include OpenAI co-founders (AI News), and his academic citations total a whopping 231,000! Talk about a big-name hire! 🌟
-
ByteDance AI phone solution unveiled. ByteDance's AI phone solution is shaking things up! 📱 They're waiving token sharing and custom development fees, asking only for a prominent entry point. They're already in talks with Vivo, Lenovo, and Transsion to pre-install Doubao Assistant (AI News). This means phone manufacturers can rake in a share of traffic and membership revenue, directly hitting the previous pain point of sky-high token costs. Smart move! 💸
-
AWS CEO opposes laying off junior developers. AWS CEO Matt Garman is calling out the "dumbest idea ever": replacing junior developers with AI. 🙅♂️ He argues that junior employees are actually better at using AI tools. Garman emphasizes that the talent pipeline is like a sports team; not nurturing new talent will lead to a gap (AI News) down the line. He believes AI will create even more jobs in the long run. Good point! 💡
Top Open-Source Projects
-
PentestGPT: A penetration testing powerhouse. PentestGPT, a GPT-driven security tool, is automating penetration testing workflows, helping security researchers uncover system vulnerabilities faster! 🛡️ It supports analysis across various attack vectors and is open-source and free to use (AI News). Sweet! 👍
-
Stanford CS229 Cheatsheet. This VIP cheatsheet for Stanford's classic CS229 Machine Learning course is a goldmine! 📚 It covers core concepts like supervised learning and deep learning. An absolute must-have for review and exam prep, it's truly a condensed essence (AI News) of knowledge. Get studying! 🧑💻
-
Metabase: Open-source BI tool. Metabase, a business intelligence powerhouse, makes data handling a breeze for everyone! 📊 It supports embedded analytics and visualization, and its enterprise-grade features are fully open-source (AI News). This is truly great news for small and medium-sized teams! 🎉
Social Media Share
-
Context engineering becomes a new moat. The Box CEO made a killer point: AI agents are evolving from "model capabilities" to "system architecture," and the root cause of failure isn't logical flaws, but information asymmetry. 🤯 He argues that context engineering is essentially reverse-engineering what information input (AI News) an expert needs. This is the new moat! 🏰

-
ByteDance's 35% salary increase is insane. While everyone else is hitting the brakes on growth, ByteDance just announced an insane average salary increase of 35%! 💰 Netizens are collectively expressing envy, jealousy, and hatred (AI News) – and who can blame them?! Wild! 🤑

-
Xiaohongshu AI video goes viral with 100K likes. Uncle Yingfeng's viral AI video on Xiaohongshu just racked up 100,000 likes! 📈 His work ingeniously avoided the dreaded AI breathing pauses, and the sound transitions and rhythm were both precise and impactful. Gaining 100K likes in just 10 days proves the terrifying power of long-tail recommendations (AI News). Check it out! 👇
-
Claude Code is surprisingly powerful. Li Mo just showed off how surprisingly powerful Claude Agent SDK is! 🤯 He demonstrated using a Feishu app as a database for one-click collection and publishing to Xiaohongshu, and even wrapping it as an API to run periodically. The coolest part? When running a dozen tasks in parallel, if there's an error, it will self-correct its code (AI News) and rerun! That's next-level automation. ✨
-
Dissecting Plan Mode's Architectural Moat. The Flask author is shedding light on Plan Mode's architectural moat, pointing out that its native implementation is deeply integrated with IDE toolchains, allowing it to perceive file states in real-time. This means users can intercept approvals at atomic-level steps, essentially transforming from a coder to a reviewer (AI News)! Talk about control! 🧐

-
16-year-old hacker breaks into four tech giants. A 16-year-old hacker managed to breach Discord, Vercel, Cursor, and X through a Mintlify SVG/XSS vulnerability! 🤯 However, the bounty payments amounted to only a few thousand dollars, sparking controversy. The discussion highlighted that putting third-party content on the main domain is the root cause of creating risk (AI News) in the first place. Food for thought! 🤔
-
Google Conductor introduces context-driven development. Google Conductor, a new Gemini CLI extension, is here to revolutionize development with context-driven AI! ✨ It automatically scans your project structure, extracts relevant code, and packages it into rich context requests for your model. Say goodbye to tedious manual copy-pasting, and ensure AI is no longer "feeling the elephant in the dark" (AI News). Genius! 💡

AI Daily News Audio Edition
| Xiaoyuzhou | Douyin |
|---|---|
| Afterlife Tavern | Self-media Account |
![]() |
![]() |

