Whisper Web: ML-Powered Speech Recognition Directly in Your Browser

This repository profile is provided by osrepos.com, an open source repository discovery platform.

Whisper Web: ML-Powered Speech Recognition Directly in Your Browser

Summary

Whisper Web brings powerful, ML-powered speech recognition directly to your browser, leveraging ? Transformers.js. This innovative project allows for client-side audio processing, offering privacy and efficiency without relying on cloud services. It even includes experimental WebGPU support for accelerated performance.

Repository Information

Analyzed by OSRepos on December 5, 2025

Use at your own risk

OSRepos shares public repositories for knowledge and discovery only. Any installation, execution, configuration, or use of code from these repositories is the user's own responsibility. Always review the repository, source code, dependencies, licenses, and security implications before running or installing anything. OSRepos is not responsible for issues, damages, or losses resulting from third-party repositories.

Introduction

Whisper Web, developed by Xenova, offers cutting-edge, ML-powered speech recognition directly within your web browser. This project leverages the power of ? Transformers.js to perform complex machine learning tasks entirely client-side, eliminating the need for server-side processing. It's designed for privacy and efficiency, allowing users to transcribe audio without sending data to external servers. Furthermore, Whisper Web includes experimental WebGPU support, enabling GPU acceleration for even faster transcription directly in compatible browsers.

Installation

To get Whisper Web running locally, follow these simple steps:

  • Clone the repository and install dependencies:

    git clone https://github.com/xenova/whisper-web.git
    cd whisper-web
    npm install
    
  • Run the development server:

    npm run dev
    

    Note for Firefox users: You may need to change the dom.workers.modules.enabled setting in about:config to true to enable Web Workers. More details can be found in this issue.

  • Open the link (e.g., http://localhost:5173/) in your browser.

Examples

Experience Whisper Web in action through its live demos:

Why Use It?

Whisper Web stands out for several compelling reasons:

  • Client-Side Processing: All speech recognition happens directly in the user's browser, enhancing privacy and enabling offline functionality.

  • Performance: With experimental WebGPU support, it can leverage your device's GPU for significantly faster transcription speeds.

  • Ease of Integration: Built with TypeScript and JavaScript, it's straightforward to integrate into web applications.

  • Open Source: Licensed under MIT, it's free to use, modify, and distribute.

Links

Related repositories

Similar repositories that may be relevant next.

Loop Engineering: Orchestrating AI Agents with Practical Patterns and Tools

Loop Engineering: Orchestrating AI Agents with Practical Patterns and Tools

June 25, 2026

Loop Engineering is a GitHub repository offering practical patterns, starters, and CLI tools for building robust AI coding agent systems. It shifts the focus from individual prompt crafting to designing control systems that orchestrate agents over time. This project empowers developers to create autonomous, iterative AI workflows for various development tasks.

agentic-aiai-agentsloop-engineering
Pic Smaller: Intelligent Online Image Compression for Web and Mobile

Pic Smaller: Intelligent Online Image Compression for Web and Mobile

June 20, 2026

Pic Smaller is an intuitive and powerful online tool designed for intelligently compressing various image formats, including JPEG, PNG, WEBP, AVIF, SVG, and GIF. It offers a user-friendly experience with automatic compression and customizable output options, all while ensuring complete safety through purely local processing without server-side logic.

image-compressionjavascriptwebassembly
My Virtual World: A Self-Hosted 3D AI Virtual World for Agent Harnesses

My Virtual World: A Self-Hosted 3D AI Virtual World for Agent Harnesses

June 16, 2026

My Virtual World is an innovative self-hosted 3D AI virtual environment designed for agent harnesses such as OpenClaw and Hermes. It allows AI agents to live, work, move between buildings, and interact with objects in a persistent voxel-style world. This project offers a unique platform for developing and observing AI agent behaviors in a rich, customizable 3D setting.

ai-agentshermesopenclaw
Deliberation: Multi-Agent LLM Consensus for Code and Plan Review

Deliberation: Multi-Agent LLM Consensus for Code and Plan Review

June 15, 2026

Deliberation is an innovative GitHub repository that enables Claude Code to leverage multiple LLMs like GPT, Gemini, Grok, and 400+ OpenRouter models for expert second opinions and arbiter-mediated consensus. It provides specialized AI agents for tasks such as code review, security analysis, and architectural design, ensuring comprehensive and reliable feedback. This project helps developers get diverse perspectives and achieve higher quality in their work.

ai-agentsllmmulti-agent

Source repository

Open the original repository on GitHub.

View on GitHub
OS
OSRepos

Analysis and discovery of open source repositories. Find interesting projects and follow their updates.

Monitor your website with YourWebsiteScore

OSRepos shares public repositories for knowledge and discovery only. Any installation, execution, configuration, or use of third-party repository code is at your own risk. Always review source code, dependencies, licenses, and security implications before running anything.

© 2025 OSRepos. Built with Nuxt 3 and lots of ❤️