{"name":"parakeet-mlx: Nvidia's Parakeet ASR Models on Apple Silicon with MLX","description":"parakeet-mlx is an open-source project that implements Nvidia's advanced Automatic Speech Recognition (ASR) Parakeet models for Apple Silicon, leveraging the MLX framework for optimized performance. This Python library offers both a command-line interface and a flexible Python API, enabling efficient transcription of audio files, including real-time streaming capabilities. It provides a powerful solution for developers and researchers working with speech processing on Apple hardware.","github":"https://github.com/senstella/parakeet-mlx","url":"https://osrepos.com/repo/senstella-parakeet-mlx","source":"osrepos.com","sourceDescription":"This repository profile is provided by osrepos.com, an open source repository discovery platform.","repositoryProfile":"https://osrepos.com/repo/senstella-parakeet-mlx","generatedFor":"open source discovery and AI-assisted research","markdown":"https://osrepos.com/repo/senstella-parakeet-mlx.md","json":"https://osrepos.com/repo/senstella-parakeet-mlx.json","topics":["Python","ASR","Speech Recognition","MLX","Apple Silicon","Deep Learning","Audio Processing","Transcription"],"keywords":["Python","ASR","Speech Recognition","MLX","Apple Silicon","Deep Learning","Audio Processing","Transcription"],"stars":null,"summary":"parakeet-mlx is an open-source project that implements Nvidia's advanced Automatic Speech Recognition (ASR) Parakeet models for Apple Silicon, leveraging the MLX framework for optimized performance. This Python library offers both a command-line interface and a flexible Python API, enabling efficient transcription of audio files, including real-time streaming capabilities. It provides a powerful solution for developers and researchers working with speech processing on Apple hardware.","content":"## Introduction\n\n`parakeet-mlx` is an implementation of Nvidia's Parakeet models, which are Automatic Speech Recognition (ASR) models, optimized for Apple Silicon using the MLX framework. This open-source project allows users to efficiently transcribe audio files, leveraging Apple hardware for superior performance.\n\nWith `parakeet-mlx`, you can easily convert speech to text using a straightforward command-line interface (CLI) or integrate advanced ASR capabilities into your Python applications. It supports various output options, including subtitles with word-level timestamps, and offers features like beam decoding, audio chunking for long files, and real-time streaming transcription.\n\n## Installation\n\nBefore installing, make sure you have `ffmpeg` installed on your system, as it is required for the CLI to work properly.\n\n**Using `uv` (recommended):**\n\nTo add as a project dependency:\n\nbash\nuv add parakeet-mlx -U\n\n\nOr, for the CLI globally:\n\nbash\nuv tool install parakeet-mlx -U\n\n\n**Using `pip`:**\n\nbash\npip install parakeet-mlx -U\n\n\n## Examples\n\n<p><strong>CLI Quick Start</strong></p>\n\nTranscribe a single audio file:\n\nbash\nparakeet-mlx audio.mp3\n\n\nTranscribe multiple files and generate VTT subtitles with word-level timestamps:\n\nbash\nparakeet-mlx *.mp3 --output-format vtt --highlight-words\n\n\nGenerate all available output formats:\n\nbash\nparakeet-mlx audio.mp3 --output-format all\n\n\n<p><strong>Python API Quick Start</strong></p>\n\n**Transcribe a file:**\n\npython\nfrom parakeet_mlx import from_pretrained\n\nmodel = from_pretrained(\"mlx-community/parakeet-tdt-0.6b-v3\")\n\nresult = model.transcribe(\"audio_file.wav\")\n\nprint(result.text)\n\n\n**Check timestamps:**\n\npython\nfrom parakeet_mlx import from_pretrained\n\nmodel = from_pretrained(\"mlx-community/parakeet-tdt-0.6b-v3\")\n\nresult = model.transcribe(\"audio_file.wav\")\n\nprint(result.sentences)\n# [AlignedSentence(text=\"Hello World.\", start=1.01, end=2.04, duration=1.03, tokens=[...])]\n\n\n**Do chunking:**\n\npython\nfrom parakeet_mlx import from_pretrained\n\nmodel = from_pretrained(\"mlx-community/parakeet-tdt-0.6b-v3\")\n\nresult = model.transcribe(\"audio_file.wav\", chunk_duration=60 * 2.0, overlap_duration=15.0)\n\nprint(result.sentences)\n\n\n**Streaming Transcription:**\n\nFor real-time transcription, use the `transcribe_stream` method:\n\npython\nfrom parakeet_mlx import from_pretrained\nfrom parakeet_mlx.audio import load_audio\nimport numpy as np\n\nmodel = from_pretrained(\"mlx-community/parakeet-tdt-0.6b-v3\")\n\n# Create a streaming context\nwith model.transcribe_stream(\n    context_size=(256, 256),  # (left_context, right_context) frames\n) as transcriber:\n    # Simulate real-time audio chunks\n    audio_data = load_audio(\"audio_file.wav\", model.preprocessor_config.sample_rate)\n    chunk_size = model.preprocessor_config.sample_rate  # 1 second chunks\n\n    for i in range(0, len(audio_data), chunk_size):\n        chunk = audio_data[i:i+chunk_size]\n        transcriber.add_audio(chunk)\n\n        # Access current transcription\n        result = transcriber.result\n        print(f\"Current text: {result.text}\")\n\n\n## Why Use `parakeet-mlx`?\n\n`parakeet-mlx` stands out as an essential tool for anyone needing high-performance ASR capabilities on Apple Silicon devices.\n\n*   **Optimized for Apple Silicon:** By leveraging the MLX framework, `parakeet-mlx` delivers native and efficient performance, making it ideal for Mac users.\n*   **High-Quality ASR:** It implements Nvidia's Parakeet models, known for their accuracy and robustness in speech recognition.\n*   **Versatility:** Whether you prefer a command-line tool for quick tasks or a flexible Python API for integration into larger projects, `parakeet-mlx` has you covered.\n*   **Advanced Features:** From detailed word and sentence-level timestamps to advanced decoding options and real-time streaming transcription, the project offers a rich set of functionalities for diverse needs.\n*   **Ease of Use:** With clear installation instructions and comprehensive examples, it is accessible to both beginners and experienced developers.\n\n## Links\n\nFor more details, documentation, and to contribute to the project, visit the official GitHub repository:\n\n*   [parakeet-mlx GitHub Repository](https://github.com/senstella/parakeet-mlx){target=\"_blank\"}","metrics":{"detailViews":3,"githubClicks":2},"dates":{"published":null,"modified":"2026-01-29T20:01:24.000Z"}}