Skip to content

Savinay99/voiceOS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎙️ VoiceOS — AI Speech Web App

VoiceOS is a modern, interactive AI-powered web application that enables real-time Speech-to-Text (STT) and Text-to-Speech (TTS) directly in the browser. Built with clean UI/UX and seamless user interaction, it leverages Web Speech API with optional Azure Speech integration.


🚀 Features

  • 🎤 Speech-to-Text (STT)

    • Real-time voice recognition
    • Instant transcript generation
    • Copy and clear transcript functionality
  • 🔊 Text-to-Speech (TTS)

    • Convert text into natural speech
    • Voice selection (multiple voices)
    • Adjustable speed and pitch
  • 🎨 Modern UI/UX

    • Responsive design (mobile + desktop)
    • Dark/Light mode toggle
    • Smooth animations and feedback
  • 🔐 Credential Support

    • Azure Speech API key & endpoint input
    • Input validation and error handling
  • Smart Fallback

    • Automatically uses browser Web Speech API if Azure is not connected

🛠️ Tech Stack

  • HTML5
  • CSS3 (Modern UI Design)
  • JavaScript (Vanilla JS)
  • Web Speech API
  • Azure Cognitive Services (Optional)

📂 Project Structure

voiceOS/
│── index.html
│── style.css
│── script.js
│── README.md

🌐 Live Demo

👉 https://Savinay99.github.io/voiceOS/


⚙️ How to Run Locally

  1. Clone the repository:
git clone https://github.com/Savinay99/voiceOS.git
  1. Open in browser using Live Server (recommended)

  2. Allow microphone permission when prompted


🔑 Azure Integration (Optional)

To enable Azure Speech Services:

  1. Go to Azure Portal
  2. Get your API Key and Endpoint
  3. Enter credentials in the app

⚠️ Notes

  • Speech recognition works best on Chrome / Edge
  • Requires HTTPS or localhost
  • Microphone permission is mandatory

📌 Future Improvements

  • Full Azure Speech SDK integration
  • Audio file download support
  • Multi-language auto detection
  • Voice cloning support

👨‍💻 Author

Savinay GitHub: https://github.com/Savinay99


⭐ Support

If you like this project, consider giving it a ⭐ on GitHub!

About

Interactive AI speech application with real-time STT & TTS, built using HTML, CSS, JavaScript and Azure Speech Services.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors