⚡NVIDIA is making AI cheaper & more accessible

INSIDE: Open models, Gemini’s next step, voice AI, why startups fail

In partnership with

✨ Welcome back, Entrepreneurs

In this issue: Google quietly hints at a Gemma 4 open-source drop, open-source voice models get emotional with Chatterbox Turbo, Claude turns into a giftable consumer product, OpenAI hardens its realtime audio stack for production use, NVIDIA pushes open agentic models with Nemotron 3, and VCs deliver a reality check on why most consumer AI apps still don’t stick.

BIG AI MOVES

Image Source: Hugging Face

🔹 Google Hints at Gemma 4 Open-Source Drop

Google has teased a likely Gemma 4 release on Hugging Face, signaling the next version of its lightweight, open-source models built on Gemini tech, designed to run on phones and laptops, support multimodal tasks, and power custom apps like medical imaging and code generation, with no launch yet but a history of surprise releases fueling anticipation.

🔹 Resemble AI Drops Chatterbox Turbo Open-Source Voice Model

Resemble AI has released Chatterbox Turbo, a fast open-source voice AI model that adds emotion-aware speech through typed paralinguistic tags like laughs, sighs, and gasps, delivers ~6Ă— faster real-time factor performance, and embeds PerTh watermarking in every audio output, now live on Hugging Face and GitHub with independent benchmarks comparing it to ElevenLabs, Cartesia, and Vibevoice.

🔹 Anthropic Launches Claude Gift Cards

Anthropic has introduced Claude gift cards, allowing users to purchase access to Claude as a gift via claude.ai/gift, positioning AI subscriptions as a consumer-friendly product for the holiday season and signaling a push beyond developers toward mainstream users.

⚡OpenAI Upgrades Realtime Audio Models: Reliability Takes a Big Leap

OpenAI has released new snapshots of its Realtime Audio models, delivering major gains in transcription accuracy, speech quality, and live interaction reliability, especially for noisy environments and non-English languages.

Image Source: OpenAI

Highlights

  • gpt-4o-mini-transcribe-2025-12-15 cuts hallucinations by 89% vs. Whisper-1 and improves accuracy in short speech

  • gpt-4o-mini-tts-2025-12-15 delivers 35% fewer word errors, with smoother, more natural audio output

  • gpt-realtime-mini-2025-12-15 shows 22% better instruction following and 13% gains in function calling

  • Stronger performance in noisy settings and non-English languages like Hindi and Chinese

  • Designed for more stable, production-ready voice apps and real-time assistants

Why it matters:
These updates push OpenAI’s voice stack closer to reliable, human-grade audio interaction reducing errors, hallucinations, and friction in real-time use cases just as competition in voice AI accelerates.

Partner Spotlight/.TheCode

The Tech newsletter for Engineers who want to stay ahead

Tech moves fast, but you're still playing catch-up?

That's exactly why 100K+ engineers working at Google, Meta, and Apple read The Code twice a week.

Here's what you get:

  • Curated tech news that shapes your career - Filtered from thousands of sources so you know what's coming 6 months early.

  • Practical resources you can use immediately - Real tutorials and tools that solve actual engineering problems.

  • Research papers and insights decoded - We break down complex tech so you understand what matters.

All delivered twice a week in just 2 short emails.

⚡ NVIDIA Launches Nemotron 3: Open, Efficient Models for Agentic AI

NVIDIA has unveiled Nemotron 3, a new family of open models designed for agentic reasoning, long-context tasks, and high-throughput AI workloads, starting with the release of Nemotron 3 Nano and detailed technical reports.

Image Source: NVIDIA

Highlights

  • Three models: Nano, Super, and Ultra built for efficiency, collaboration, and state-of-the-art reasoning

  • Nemotron 3 Nano (3.2B active params) beats larger rivals like GPT-OSS-20B and Qwen3-30B on key benchmarks

  • Up to 3.3Ă— higher inference throughput on a single H200 GPU

  • Supports up to 1M token context, outperforming peers across long-context evaluations

  • Uses hybrid Mamba-Transformer MoE, plus LatentMoE, multi-token prediction, and NVFP4 (Super/Ultra)

Why it matters:
Nemotron 3 signals NVIDIA’s push to make open, long-context, agentic AI both high-performance and cost-efficient, giving builders serious alternatives to closed frontier models for production workloads.

⚡ VCs Weigh Why Consumer AI Startups Still Struggle to Last

VCs explain why most consumer AI startups still struggle to stick, even in a world flooded with generative tools. Adoption spikes are easy; building products people return to (and pay for) is proving much harder.

Image Source: Ideogram/TheAIEntrepreneurs

Highlights

  • Even three years into the generative AI boom, most startups still make money from selling to businesses, not individual consumers, according to VCs.

  • General-purpose AI like ChatGPT sees broad adoption, but specialized consumer GenAI apps haven’t resonated widely.

  • Investors say the next big consumer AI wave might require new personal devices or radically different experiences to sustain long-term growth.

Why it matters
Despite the hype around consumer AI, many startups struggle to sustain interest and revenue, showing that success requires clear utility and business models.

Tool Spotlight/Levanta

The Future of Shopping? AI + Actual Humans.

AI has changed how consumers shop, but people still drive decisions. Levanta’s research shows affiliate and creator content continues to influence conversions, plus it now shapes the product recommendations AI delivers. Affiliate marketing isn’t being replaced by AI, it’s being amplified.

🩺 AIHealthTech Insider: AI Is Catching What Medicine Misses

AI is no longer experimental in care. From algorithms flagging cancer early to digital ADHD therapy, dementia detection, noninvasive glucose monitoring, and AI-designed vaccines, these tools are already reshaping clinical decisions.

👉 Read AIHealthTech Insider: Issue #79 for the full roundup.

Interested in AIHealthTech Insider?

Are you interested in receiving the AIHealthTech Insider newsletter directly to your inbox? Stay updated on the latest AI-driven healthcare innovations.

Login or Subscribe to participate in polls.

🎥 Stream Your Imagination with Odyssey-2

Say goodbye to static video. Odyssey-2 lets you type a few words and instantly stream interactive AI-generated video no editing, no waiting, just pure creative flow.

How It Works: 

  1. Type Your Prompt → Try “a robot surfing lava” or “neon jungle at night.”

  2. Hit “Stream” → Watch your idea come to life in seconds.

  3. Interact Live → Pause, remix, or evolve the scene as it plays.

  4. Share or Save → Capture moments or export clips for your projects.

Pro Tip: Use Odyssey-2 for pitch decks, storytelling, or just wild creative play. It’s like directing a dream in real time.

🚀 Sunday AI: Agentic AI Just Hit Its Inflection Point

Autonomous agents are moving from experiments to infrastructure. This Sunday Special maps the shift—hyperscaler agent marketplaces, government adoption signals, must-attend AI events, learning tracks, and hiring where demand is spiking fast.

👉 Read Sunday Special Issue #26 for the full briefing.

Would you like to get this curated AI Events, Courses & Jobs roundup sent straight to your email every Sunday?

Login or Subscribe to participate in polls.

Editor’s Pick/Lindy AI

Build smarter, not harder: meet Lindy

Tired of AI that just talks? Lindy actually executes.

Describe your task in plain English, and Lindy handles it—from building booking platforms to managing leads and sending team updates.

AI employees that work 24/7:

  • Sales automation

  • Customer support

  • Operations management

Focus on what matters. Let Lindy handle the rest.

Reply

or to participate.