π― Inspiration Communication should be seamless and inclusive, but language barriers often make it challenging. Whether in business meetings, classrooms, or everyday conversations, the need for real-time speech translation is more crucial than ever. Instead of relying on WiFi, why can't it be an offline device that can be used by children and the elderly just by pressing a button? Inspired by this, we created VerbaLinkβa wearable device that transcribes and translates speech instantly, making conversations accessible to everyone.
π What It Does VerbaLink is a real-time speech transcription and translation system that: β Records spoken language using an embedded microphone. β Transcribes audio with high accuracy using Faster Whisper AI. β Translates speech into multiple languages via Google Translate. β Transfers files & text over Serial to integrate with external systems.
Itβs fast, portable, and efficient, making it perfect for education, accessibility, international communication, and more.
π§ How We Built It We combined hardware and software for a seamless experience:
Arduino + TMRpcm: Handles real-time audio recording and playback. SD Card Storage: Saves WAV files for processing and transcription. Faster Whisper Model: Provides highly accurate speech-to-text transcription. Google Translate API: Converts text into multiple languages. Python Serial Communication: Ensures smooth data transfer between hardware and software.
π§ Challenges We Ran Into π΄ SPI & I2C Conflicts: The design was supposed to display the received text on an LCD, but there was either a library conflict or a memory overload. π΄ Hardware Limitations: The entire transcription process was supposed to be done by the Arduino itself, but the computing power is not enough. π΄ Serial Data Corruption: WAV file transfers were sometimes corrupt due to buffer overflows. π΄ Real-Time Optimization: Ensuring low latency transcription & translation while maintaining accuracy was tricky.
π Accomplishments That We're Proud Of π Built a fully functional speech transcription & translation device. π Resolved complex hardware-software integration issues. π Achieved real-time processing with minimal latency. π Created a system that can be expanded for accessibility & multilingual applications.
π What We Learned π‘ Efficient resource management is key in embedded systems. π‘ Recording audio signal with a mic amplifier π‘ Soldering is not as easy as YouTube tutorials π‘ SD card integration in Arduino systems. π‘ Python-Arduino serial communication requires buffering & proper timing for large file transfers. π‘ Transcription models like Faster Whisper can achieve high accuracy with the right parameters.
π Whatβs Next for VerbaLink? πΉ Chip improvement: Use a better chip to transcribe the data without transferring the data to a PC πΉ Voice Output (TTS): Convert translated text into natural speech playback. πΉ Multi-Speaker Support: Differentiate voices for group conversations. Able to translate from English to other languages by adjusting the config on a menu. πΉ Smaller & More Portable Version: Optimizing for wearable devices or smart glasses.
VerbaLink is just the beginning. The future of real-time multilingual communication is here!

Log in or sign up for Devpost to join the conversation.