Speech-To-Text

Speech-To-Text

Speech-To-Text

The Speech-To-Text project involved developing a comprehensive accessibility solution that allows students to speak their answers instead of typing them, supporting English, French, and Spanish languages. This innovative system leverages the Web Speech API for real-time speech recognition, integrates with CKEditor through a custom plugin for seamless text editing, and utilizes a Python microservice with FastAPI for advanced punctuation restoration to provide accurate and accessible transcription services.

The Challenge

SchoolCity needed an accessible speech-to-text solution that could support multiple languages while providing accurate transcription and proper punctuation. The challenge included integrating speech recognition with existing text editors, handling real-time audio processing, supporting multiple languages, implementing punctuation restoration algorithms, and ensuring the solution was accessible to students with diverse needs while maintaining high accuracy and performance.

Speech-To-Text
  • Multilingual speech recognition (English, French, Spanish)
  • Real-time speech-to-text conversion
  • Custom CKEditor plugin integration
  • Advanced punctuation restoration
  • Accessibility-focused design
  • Cloud-native microservice architecture

The Solution & Results

Developed a comprehensive speech-to-text system using the Web Speech API for real-time speech recognition across English, French, and Spanish languages. Created a custom CKEditor plugin to seamlessly integrate speech input with text editing capabilities. Built a Python microservice using FastAPI for advanced punctuation restoration and text processing. Implemented the solution using Docker containers deployed on AWS ECS with Terraform for infrastructure management and CI/CD for automated deployment.

Successfully delivered a multilingual speech-to-text service that provides accurate transcription with proper punctuation restoration. The system now enables students with diverse needs to participate more effectively in educational activities, improving accessibility and inclusion. The solution has significantly enhanced the learning experience for students who prefer or require speech input methods.

Speech-To-Text
Speech-To-Text