Models & Agents Blog
Daily AI briefing on models, agent frameworks, and practical AI.
Subscribe via RSSNaver's Seoul World Model grounds video generation in real Street View geometry from over a million images and generalizes to other cities without fine-tuning.
Read article →Models & Agents
New arXiv papers expose critical flaws in how we evaluate depression-detection models, LLM pruning, and verbalized confidence.
Read article →Models & Agents
Fair zero-determinant strategies break in the periodic prisoner's dilemma, unlike the classic repeated version.
Read article →TrustFlow introduces topic-aware vector reputation for multi-agent systems, replacing scalar scores with queryable multi-dimensional vectors.
Read article →LlamaIndex drops LiteParse, a spatial PDF parser built specifically for agentic RAG workflows.
Read article →Models & Agents
Picsart launches AI agent marketplace, starting with four agents and adding more weekly for creators.
Read article →Models & Agents
RL agents scaled to 1,024 layers unlock emergent parkour skills from basic failures.
Read article →Models & Agents
Google DeepMind's Aletheia agent autonomously advances from IMO math to professional research discoveries.
Read article →Models & Agents
Perplexity launches "Personal Computer," a $200/month AI agent that automates emails, presentations, and app control 24/7.
Read article →Models & Agents
Nvidia plans $26B investment in open-weight AI models to counter Chinese dominance and lock in developers.
Read article →Models & Agents
Google unveils Gemini Embedding 2, a multimodal model embedding text, images, video, audio, and docs for advanced RAG systems.
Read article →Models & Agents
Meta acquires Moltbook, a Reddit-like platform for AI agents to interact and collaborate.
Read article →Models & Agents
Claude Opus 4.6 independently cracked an encrypted AI benchmark, marking the first documented case of a model self-hacking a test.
Read article →Models & Agents
Meta's new research trains multimodal AI on unlabeled video, challenging assumptions about text-heavy scaling.
Read article →Models & Agents
Anthropic's Claude AI discovered over 100 Firefox vulnerabilities that human testing missed for decades.
Read article →Models & Agents
Liquid AI launches LFM2-24B-A2B model and LocalCowork app for fully local, privacy-first agent workflows.
Read article →Models & Agents
YuanLab AI launches Yuan 3.0 Ultra, a 1T-parameter multimodal MoE model cutting parameters by 33% while boosting efficiency 49%.
Read article →Models & Agents
FireRedTeam releases FireRed-OCR-2B, a 2B-parameter model tackling structural hallucinations in document parsing for tables and LaTeX.
Read article →Models & Agents
Alibaba open-sources CoPaw, a workstation for scaling multi-channel AI agent workflows.
Read article →Models & Agents
Perplexity open-sources embedding models that match Google and Alibaba performance at a fraction of the memory cost.
Read article →Models & Agents
Sakana AI launches Doc-to-LoRA and Text-to-LoRA hypernetworks for zero-shot LLM adaptation to long contexts via natural language.
Read article →Models & Agents
Anthropic acquires Vercept to enhance Claude's screen reading, while Google launches Nano Banana 2 for faster, cheaper image generation.
Read article →
Planetterrian Daily
Omni View
Models & Agents for Beginners
Fascinating Frontiers
Modern Investing Techniques
Tesla Shorts Time
Environmental Intelligence
Финансы Просто
Привет, Русский!