Stop Scrolling, Start Searching: How I Built a Voice Note Search Engine with Python

Speaker Andile Mbele
Track Data Science and Engineering
Type Regular talk (45 minutes)

Abstract

How many times have you scrolled through endless WhatsApp voice notes trying to find that one important detail? I had grown frustrated of having to sift through voice notes from friends and colleagues to find that one detail they mentioned. One day, I shared a simple WhatsApp status update about this frustration and it turned out I wasn't the only one facing this problem, we use audio a lot to communicate, yet it's not so easy to browse or search through. This sparked an idea, EchoFind—a Python-powered web application that makes voice notes searchable like text messages.


In this talk, I'll walk you through the complete journey of solving a real-world problem using Python's rich ecosystem. Starting with OpenAI's Whisper API for speech-to-text transcription, we'll explore how to build a robust search engine that can pinpoint exact words in audio files and jump to those moments instantly.


You'll learn about:



  • Integrating OpenAI's Whisper API with Python for accurate speech transcription

  • Building efficient text search algorithms with word-level timestamps

  • Creating a FastAPI backend that handles file uploads and audio processing

  • Implementing real-time search with fuzzy matching for better user experience

  • Handling edge cases like accents, background noise, and multiple languages

  • Deploying a production-ready application that processes audio files securely


The talk includes live demonstrations of the application and real code examples, showing how Python's simplicity enabled rapid prototyping while its powerful libraries made production deployment seamless. We'll also discuss the cultural impact of voice messaging in Africa and how technology can bridge accessibility gaps.


Whether you're a beginner curious about API integration or an experienced developer interested in audio processing, you'll leave with practical knowledge about building AI-powered applications that solve everyday problems. Most importantly, you'll see how a simple observation about user experience can evolve into a fully functional product using Python.


The complete source code and deployment guide will be shared with attendees.