Tutorials
Clone MedTalk: HIPAA-Ready Video and Chat Consultations in Flutter
Telehealth is transforming the way patients and providers connect, offering faster access to care and reducing barriers caused by distance or scheduling. A critical part of this experience is enabling secure, real-time video consultations alongside features like chat messaging for sharing updates, questions, and follow-ups. With Stream's healthcare chat solution, developers can build HIPAA-ready communication
Read more ->
24 min read
Build a Voice-Controlled GitHub Agent in Python (MCP + Vision Agents)
Turn any GitHub repo into a voice assistant: ask about branches, open issues, create pull requests, list contributors—all via natural conversation. Powered by OpenAI's Realtime API for low-latency voice, GitHub's Model Context Protocol (MCP) for secure repo actions, and Vision Agents for seamless orchestration. In the demo, the agent understands spoken repo names (even when
Read more ->
4 min read
Build a Drive-Thru Voice AI Ordering System With Gemini Live Speech-to-Speech
Drive-thru ordering is a deceptively hard real-time problem. Background noise, interruptions, fast-paced conversations, and the need for low-latency responses all push traditional voice systems to their limits. Modern speech-to-speech models change that equation by making natural, interruptible conversations possible without stitching together separate STT, LLM, and TTS pipelines. In this tutorial, you’ll create a real-time
Read more ->
9 min read
Build a Realtime Video Restyling Agent with Gemini 3 + Decart AI
Google's Gemini 3, released November 18, 2025, gives you multimodal reasoning and tool-use for building response-accurate AI applications. Let's combine it with Decart AI and other leading LLM services to turn casual voice commands into artistic live video style changes, no extra scaffolding required. Pair it with Decart AI's Mirage LSD, the first live-stream diffusion
Read more ->
4 min read
Build an AI Math & Physics Agent with DeepSeek v3.2
DeepSeek recently released a powerful new model, DeepSeek-V3.2, that's now instantly accessible via OpenRouter. In under 5 minutes, you can turn it into a real-time, voice-enabled math and physics agent that not only solves problems but also explains its reasoning out loud. DeepSeek's latest open-source reasoning and agent-AI model, V3.2, leverages the new DeepSeek Sparse
Read more ->
4 min read
Build an AI Assistant with React Native
In this tutorial, we will demonstrate how easy it is to create an AI assistant for React Native using Stream Chat. As an example, we will leverage the StreamChat integration with Vercel's AI SDK; however, developers are free to use whichever LLM provider they like and still benefit from Stream's rich UI support for Markdown, tables, code samples, charts etc.
Read more ->
14 min read
Build an AI Assistant for iOS Using Swift
In this tutorial, we will demonstrate how easy it is to create an AI assistant for iOS using Stream Chat. In this example, we will use the Stream Chat integration with Vercel's AI SDK; however, developers are free to use whichever LLM provider they like and still benefit from Stream's rich UI support for Markdown, tables, code samples, charts etc.
Read more ->
10 min read
Build an AI Assistant for Android Using Compose
In this tutorial, we will demonstrate how easy it is to create an AI assistant for Android using Stream Chat. In this example, we will use the Stream Chat integration with Vercel's AI SDK; however, developers are free to use whichever LLM provider they like and still benefit from Stream's rich UI support for Markdown,
Read more ->
15 min read