GEEK HAUS
back to sources

Stories from VentureBeat

8 articles

·VentureBeat

Kimi K2.7-Code cuts thinking tokens 30% — but practitioners say the benchmarks don't check out

Moonshot AI released Kimi K2.7-Code, an open-source update to its K2 coding model line that uses the same trillion-parameter mixture-of-experts architecture as K2.6 and supports Op...

Kimi K2.7-Code cuts thinking tokens 30% — but practitioners say the benchmarks don't check out
read
·VentureBeat

Google researchers introduce 'faithful uncertainty,' allowing LLMs to offer best guesses instead of hallucinations

Google researchers introduced faithful uncertainty, a method that helps LLMs align responses with their internal confidence and offer qualified “best guesses” when appropriate. The...

Google researchers introduce 'faithful uncertainty,' allowing LLMs to offer best guesses instead of hallucinations
read
·VentureBeat

Microsoft’s open-source SkillOpt automatically upgrades AI agent skills without touching model weights

Microsoft released SkillOpt, an MIT-licensed framework that treats AI agent skill documents as optimizable objects rather than static instructions. The system uses performance feed...

Microsoft’s open-source SkillOpt automatically upgrades AI agent skills without touching model weights
read
·VentureBeat

Xiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code at ultra-long, 200+ step tasks

Xiaomi’s MiMo AI team released MiMo Code V0.1.0, an MIT-licensed, terminal-native AI coding assistant based on the OpenCode agent. The company says it outperformed Claude Code on l...

Xiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code at ultra-long, 200+ step tasks
read
·VentureBeat

Surprise upset: GPT-5.5 beats Claude Fable 5 on brutal new Agents’ Last Exam benchmark

UC Berkeley RDI and more than 300 experts launched Agents’ Last Exam, a benchmark meant to test whether AI agents can complete long, economically valuable professional workflows. O...

Surprise upset: GPT-5.5 beats Claude Fable 5 on brutal new Agents’ Last Exam benchmark
read
·VentureBeat

Researchers say they trained a foundation model from scratch for about $1,500

Sapient researchers say their HRM-Text architecture trained a 1B-parameter foundation model from scratch using far less data and compute than standard Transformer-based LLMs. The m...

Researchers say they trained a foundation model from scratch for about $1,500
read
·VentureBeat

Apple’s new Siri AI is more than just a smarter assistant — it's a new enterprise app layer

Apple’s WWDC 2026 updates recast Siri as a systemwide interface for enterprise app content, data, and actions across iPhone, iPad, Mac, Apple Watch, and Vision Pro. Developers can...

Apple’s new Siri AI is more than just a smarter assistant — it's a new enterprise app layer
read
·VentureBeat

Cohere open-sources a coding agent that runs on a single H100

Cohere launched North Mini Code, an open-source 30B-parameter mixture-of-experts model for agentic software engineering that can run on a single HThe model supports a 256K-token co...

Cohere open-sources a coding agent that runs on a single H100
read