Whisper Web: ML-Powered Speech Recognition Directly in Your Browser
This repository profile is provided by osrepos.com, an open source repository discovery platform.

Summary
Whisper Web brings powerful, ML-powered speech recognition directly to your browser, leveraging ? Transformers.js. This innovative project allows for client-side audio processing, offering privacy and efficiency without relying on cloud services. It even includes experimental WebGPU support for accelerated performance.
Repository Information
Topics
Click on any tag to explore related repositories
Use at your own risk
OSRepos shares public repositories for knowledge and discovery only. Any installation, execution, configuration, or use of code from these repositories is the user's own responsibility. Always review the repository, source code, dependencies, licenses, and security implications before running or installing anything. OSRepos is not responsible for issues, damages, or losses resulting from third-party repositories.
Introduction
Whisper Web, developed by Xenova, offers cutting-edge, ML-powered speech recognition directly within your web browser. This project leverages the power of ? Transformers.js to perform complex machine learning tasks entirely client-side, eliminating the need for server-side processing. It's designed for privacy and efficiency, allowing users to transcribe audio without sending data to external servers. Furthermore, Whisper Web includes experimental WebGPU support, enabling GPU acceleration for even faster transcription directly in compatible browsers.
Installation
To get Whisper Web running locally, follow these simple steps:
Clone the repository and install dependencies:
git clone https://github.com/xenova/whisper-web.git cd whisper-web npm installRun the development server:
npm run devNote for Firefox users: You may need to change the
dom.workers.modules.enabledsetting inabout:configtotrueto enable Web Workers. More details can be found in this issue.Open the link (e.g., http://localhost:5173/) in your browser.
Examples
Experience Whisper Web in action through its live demos:
Main Demo Site: https://huggingface.co/spaces/Xenova/whisper-web
Experimental WebGPU Demo: https://huggingface.co/spaces/Xenova/whisper-webgpu
Why Use It?
Whisper Web stands out for several compelling reasons:
Client-Side Processing: All speech recognition happens directly in the user's browser, enhancing privacy and enabling offline functionality.
Performance: With experimental WebGPU support, it can leverage your device's GPU for significantly faster transcription speeds.
Ease of Integration: Built with TypeScript and JavaScript, it's straightforward to integrate into web applications.
Open Source: Licensed under MIT, it's free to use, modify, and distribute.
Links
GitHub Repository: https://github.com/xenova/whisper-web
Main Demo Site: https://huggingface.co/spaces/Xenova/whisper-web
Experimental WebGPU Demo: https://huggingface.co/spaces/Xenova/whisper-webgpu
Transformers.js: https://github.com/xenova/transformers.js
Related repositories
Similar repositories that may be relevant next.

Loop Engineering: Orchestrating AI Agents with Practical Patterns and Tools
June 25, 2026
Loop Engineering is a GitHub repository offering practical patterns, starters, and CLI tools for building robust AI coding agent systems. It shifts the focus from individual prompt crafting to designing control systems that orchestrate agents over time. This project empowers developers to create autonomous, iterative AI workflows for various development tasks.
Pic Smaller: Intelligent Online Image Compression for Web and Mobile
June 20, 2026
Pic Smaller is an intuitive and powerful online tool designed for intelligently compressing various image formats, including JPEG, PNG, WEBP, AVIF, SVG, and GIF. It offers a user-friendly experience with automatic compression and customizable output options, all while ensuring complete safety through purely local processing without server-side logic.
My Virtual World: A Self-Hosted 3D AI Virtual World for Agent Harnesses
June 16, 2026
My Virtual World is an innovative self-hosted 3D AI virtual environment designed for agent harnesses such as OpenClaw and Hermes. It allows AI agents to live, work, move between buildings, and interact with objects in a persistent voxel-style world. This project offers a unique platform for developing and observing AI agent behaviors in a rich, customizable 3D setting.
Deliberation: Multi-Agent LLM Consensus for Code and Plan Review
June 15, 2026
Deliberation is an innovative GitHub repository that enables Claude Code to leverage multiple LLMs like GPT, Gemini, Grok, and 400+ OpenRouter models for expert second opinions and arbiter-mediated consensus. It provides specialized AI agents for tasks such as code review, security analysis, and architectural design, ensuring comprehensive and reliable feedback. This project helps developers get diverse perspectives and achieve higher quality in their work.
Source repository
Open the original repository on GitHub.