--- linkTitle: 06-28-Daily title: 06-28-Daily AI Daily weight: 3 breadcrumbs: false comments: true description: Daily selection of AI industry news, open source hot spots, academic frontiers and big V opinions. AI information; AI daily; AI knowledge base; AI tutorials; AI information daily; AI tools;OpenAI just announced it's bought Crossing Minds, a company all about AI recommendation systems for e-commerce. Their team has already joined OpenAI. The move is all about beefing up OpenAI's capabilities in key areas like personalized recommendations, retrieval-augmented generation (RAG), and re... --- ## AI Insights Daily 2025/6/28 > `AI Daily Report` | `Morning Update` | `Aggregating Web Data` | `Exploring Cutting-Edge Science` | `Industry's Unfiltered Voice` | `Open-Source Innovation Power` | `AI & Humanity's Future` | [Visit Web Version ↗️](https://ai.hubtoday.app/) ### **AI Content Summary** ``` Lots of companies are dropping AI product updates left and right. OpenAI snagged Crossing Minds to boost personalized recommendations and AGI apps, while Hengbot rolled out its smart robot dog. Google, on the other hand, launched its Gemma 3n model and the Doppl virtual try-on app. Suno bought WavTool to beef up its music editing features, especially with all those copyright lawsuits they're facing. Meanwhile, AI research spilled the beans on a "grokking" phenomenon in large model pre-training. Plus, folks have been widely sharing their tips for building AI agents and optimizing code review assistants. ``` ### **AI Product & Feature Updates** 1. **OpenAI** just announced it's bought **Crossing Minds**, a company all about AI recommendation systems for e-commerce. Their team has already joined OpenAI. The move is all about beefing up OpenAI's capabilities in key areas like **personalized recommendations**, **retrieval-augmented generation (RAG)**, and **real-time user modeling**, speeding up the rollout of **artificial general intelligence (AGI)** in real-world applications. This strategic acquisition will also help OpenAI supercharge its personalized modeling and e-commerce recommendation systems, broaden **ChatGPT**'s commercial use cases, and push forward user fine-tuning and behavior understanding systems for post-training phases. 🚀✨ ['More Details'](https://www.crossingminds.com/)
![OpenAI acquires Crossing Minds](https://assets-v2.circle.so/k2bihhhpptnld7s9yjhy5rcklimh)
2. **Hengbot** just dropped its new **Sirius robot dog**, and get this – it's not just super agile, doing things like dancing and playing soccer. It also comes packed with **OpenAI**'s **large language model**, so it can have voice conversations and even develop its own unique personality. This multi-functional smart robot dog is already up for pre-order on their official website for $1299, with a full launch expected this fall. It's looking like it could be the next big thing for families. 🐶🤖🎉
![Hengbot Sirius robot dog](https://autoproxy.justlikemaki.vip/?pp=https://pic.chinaz.com/2025/0627/6388664055657490519988244.png)
3. AI music company **Suno** announced it's buying **WavTool**, a browser-based AI digital audio workstation. The goal is to beef up its song creation and production editing capabilities, and this move comes right as Suno is facing a bunch of **music copyright lawsuits**. 🤔 While the acquisition terms haven't been spilled, most of WavTool's crew has already joined the Suno team. This whole thing might be Suno's way of distracting folks from the legal battles and signaling confidence to investors, especially since they've already bagged $125 million in funding. 🎶⚖️
![Suno acquires WavTool](https://autoproxy.justlikemaki.vip/?pp=https://pic.chinaz.com/picmap/202005281122130015_60.jpg)
4. **Google Labs** just rolled out a brand-new virtual try-on app called **Doppl**. Users can upload photos or screenshots to **dynamically try on any outfit**, helping them explore and express their personal style. Right now, it's live on iOS and Android platforms in the U.S. What sets this app apart from previous static, brand-limited virtual try-ons is that it can generate animated videos, giving users a much more intuitive look at how clothes will actually appear on them, helping them decide on their outfits. 👗🤳✨
![Google Doppl virtual try-on](https://assets-v2.circle.so/4tjlf3vvqk77u07immaxg452so6a)
5. **Google** has relaunched and tweaked its "**Ask Photos**" search tool, powered by **Gemini AI**, aiming to make finding photos faster and a better experience for users. 📸🔍 The feature now gives instant results for simple queries, while handling more complex ones in the background, and it's rolling out gradually to more U.S. users. 👍
![Google Ask Photos update](https://autoproxy.justlikemaki.vip/?pp=https://pic.chinaz.com/2025/0627/6388662236368236647884395.png)
6. Google officially dropped its next-gen **open-source lightweight multimodal large model**, **Gemma 3n**. It's specially optimized for **mobile and edge devices**, aiming to deliver **native multimodal** capabilities that are almost as good as cloud-based models. 💡📱 This is the most advanced version in the Gemma series to date, supporting image, audio, video, and text input, plus text output. It's shown stellar performance in **lmarena.ai** tests, with significant boosts especially in math, programming, and reasoning. 🤯 ['More Details'](https://developers.googleblog.com/en/introducing-gemma-3n-developer-guide/)
![Google Gemma 3n model](https://assets-v2.circle.so/48ph1ou3at97bcecx9v4exbkgh69)

![Gemma 3n model test](https://assets-v2.circle.so/bx2ljlkm93rf3zulfs5ucia3m3fo)
### **Cutting-Edge AI Research** 1. A new study has confirmed for the first time that **large language models** (LLMs) also experience a "Grokking" phenomenon during **pre-training**. This means that even after the training loss converges, the model's **generalization performance** keeps getting better, revealing the shift from **memorization to generalization**. 🤯🔍 Researchers have developed two novel and efficient **metrics** that can accurately predict the **generalization improvements** of **large foundation models** without needing downstream task fine-tuning or testing. This gives LLM pre-training a super handy monitoring tool. 🧠 ['Paper Link'](https://arxiv.org/abs/2506.21551) 2. MADrive is a **memory-augmented** **driving scene modeling** framework that pushes past the limits of existing **3D Gaussian Splatting** techniques. It achieves **photorealistic synthesis** of significantly altered or entirely new **autonomous driving environments** by retrieving and integrating similar **3D vehicle assets** from a large external memory bank. 🚗💨 This innovation seriously boosts the flexibility and realism of scene reconstruction, giving **autonomous driving** simulations way more powerful support. 🌐 ['Paper Link'](https://arxiv.org/abs/2506.21520) ### **Top Open-Source Projects** 1. Black Forest Labs just **open-sourced** its **FLUX.1Kontext [dev]** image editing model. This model is a real game-changer with its **context-aware image editing** capabilities, letting you precisely modify existing images based on text instructions while keeping the style consistent. People are saying its performance is on par with **GPT-4o**, and it even runs on consumer-grade hardware. 🎨✨ This model aims to lower the barrier for professional image editing and spark innovation within the open-source community. 🚀 ['Project Link'](https://huggingface.co/black-forest-labs/FLUX.1-Kontext-dev)
![FLUX.1Kontext image editing](https://autoproxy.justlikemaki.vip/?pp=https://pic.chinaz.com/2025/0627/6388661124441853705469566.png)
2. **ottomator-agents** is an **open-source AI agent** project hosted on the oTTomator Live Agent Studio platform. It's racked up **2336** stars and offers developers a super flexible AI agent solution for building all sorts of smart apps. 🌟💻 ['Project Link'](https://github.com/coleam00/ottomator-agents) 3. **rl-swarm** is a totally **open-source** framework focused on creating **RL training swarms** over the internet, and it's got **824** stars. 🌐🧠 The project aims to simplify large-scale **reinforcement learning** training, offering a distributed solution for research and development. ['Project Link'](https://github.com/gensyn-ai/rl-swarm) 4. **microui** is a tiny immediate-mode UI library with **4351** stars, focused on delivering simple and efficient user interface solutions. ⚙️📏 ['Project Link'](https://github.com/rxi/microui) 5. **jsoncrack.com** is an innovative and **open-source** visualization app that can turn various data formats like JSON, YAML, XML, and CSV into interactive diagrams. It's currently sitting at **38496** stars. 📊✨ ['Project Link'](https://github.com/AykutSarac/jsoncrack.com) 6. **Best-websites-a-programmer-should-visit** is a super popular collection of **must-visit websites for programmers**, boasting a whopping **69196** stars. It's designed to hook up developers with tons of learning and tool resources. 📚🤓 ['Project Link'](https://github.com/sdmg15/Best-websites-a-programmer-should-visit) ### **Social Media Shares** 1. Jiayuan dropped some deep insights on **how to build a Coding Agent**, pointing out that popular products like **Gemini CLI**, **Claude Code**, and **Cursor Agent** share similar underlying **architectures**. 🧑‍💻💡 He recommended an older video share that breaks down **Coding Agent building** from a macro perspective, offering valuable learning resources for interested developers.
![Coding Agent building share](https://pbs.twimg.com/media/GucYQlXagAApa22?format=jpg&name=orig)
['More Details'](https://x.com/tisoga/status/1938545123404783617) 2. Xiao Qiu Hen Xing shared a best practices guide for '**Vibe Coding**,' an **AI programming** method that combines the **Cursor** terminal with **Claude Code**. 🚀✨ The guide spells out how to use Claude Code to generate technical solutions, have Cursor review and tweak them, implement the code, and finally complete the code review process. ['More Details'](https://m.okjike.com/originalPosts/685e6a8d1e38b2a5382ec568) 3. Li Dengdeng shared their real-world experience with **Xiaomi AI Glasses**, noting they look **stylish** with a 'tech-forward' vibe. However, the photo function had issues like **lens glare**, **low pixel count**, **no anti-shake**, and **insufficient light intake**, leading to less-than-ideal photos that even looked a bit like 'sneaky shots.' 👓📸😅
![Xiaomi AI Glasses experience](https://cdnv2.ruguoapp.com/FnwSbRO8V-0qQd--BwSMvqm4JYVev3.jpg)

![Xiaomi AI Glasses worn](https://cdnv2.ruguoapp.com/FvxUKr5Zn8Cdd_UHFbVaGd_-N63bv3.jpg)
['More Details'](https://m.okjike.com/originalPosts/685e414ff432421164e9aeda) 4. Wang Xuan Leo highlighted a key detail from the **Xiaomi launch event**: the **Xiaomi SU7**'s **intelligent driving** system uses **NVIDIA's Thor series chips**. 🚗⚡️ The author thinks that compared to other brands using multiple Orin chips, and considering the price, **Mr. Lei's** decision really shows off its high cost-effectiveness and advanced tech. 👍
![Xiaomi SU7 intelligent driving](https://cdnv2.ruguoapp.com/Fq778kq_DuRq8S25Pj1eTqBe43_3v3.png)
['More Details'](https://m.okjike.com/originalPosts/685df372d82bae994a83ab09) 5. Karl's AI Watts shared an experiment featuring a 'royal rumble' of **command-line programming AI agents**. 🤖💥 Six contenders (including **claude-code**, **gemini**, and others) were tasked with **finding and eliminating other processes** to be the last one standing, showing off how fun AI vs. AI battles can be. 🎮 ['More Details'](https://x.com/aiwarts/status/1938331396373967094) 6. Baoyu shared an article by Paul Sangle-Ferriere, co-founder of cubic, revealing how they successfully cut the false positive rate of their **AI code review assistant** by 51%. They did this by making AI provide **reasoning logs**, streamlining their **toolset**, and using **dedicated micro-agents**, making it quieter and more accurate. 🛠️💡 These insights offer major takeaways for designing super-efficient **AI agents**. 🎯 ['More Details'](https://baoyu.io/translations/learnings-from-building-ai-agents)
![AI code review assistant optimization](https://baoyu.io/uploads/2025-06-26/1750961084743.png)
7. ChatV shared a cool, unique **AI conversation hack**: after diving deep into a chat with AI, they ask the AI to recap and summarize its own **thinking characteristics** (described in 10 everyday sentences) and give **suggestions for better AI conversations** (also in 10 everyday sentences). 🤔💬 This method isn't just about helping users **understand themselves**; it also helps **optimize future AI interaction experiences**. ✨ ['More Details'](https://m.okjike.com/originalPosts/685d84ac2b50c68918c64ea9) --- ## **Listen to the Audio Version of the AI Daily Report** | 🎙️ **Xiaoyuzhou** | 📹 **Douyin** | | --- | --- | | [Laisheng Xiaojiuguan](https://www.xiaoyuzhoufm.com/podcast/683c62b7c1ca9cf575a5030e) | [Laisheng Intelligence Station](https://www.douyin.com/user/MS4wLjABAAAAwpwqPQlu38sO38VyWgw9ZjDEnN4bMR5j8x111UxpseHR9DpB6-CveI5KRXOWuFwG) | | ![Xiaojiuguan](https://s1.imagehub.cc/images/2025/06/24/f959f7984e9163fc50d3941d79a7f262.md.png) | ![Intelligence Station](https://s1.imagehub.cc/images/2025/06/24/7fc30805eeb831e1e2baa3a240683ca3.md.png) |