ggml: A Low-Level Tensor Library for Machine Learning
This repository profile is provided by osrepos.com, an open source repository discovery platform.

Summary
ggml is an innovative tensor library designed for machine learning, emphasizing low-level, cross-platform implementation. It offers features like integer quantization, automatic differentiation, and broad hardware support, all while maintaining zero third-party dependencies and efficient memory usage. This project is actively developed and forms the backbone for other popular projects like llama.cpp and whisper.cpp.
Repository Information
Topics
Click on any tag to explore related repositories
Use at your own risk
OSRepos shares public repositories for knowledge and discovery only. Any installation, execution, configuration, or use of code from these repositories is the user's own responsibility. Always review the repository, source code, dependencies, licenses, and security implications before running or installing anything. OSRepos is not responsible for issues, damages, or losses resulting from third-party repositories.
Introduction
ggml is a powerful tensor library specifically engineered for machine learning applications. It stands out for its low-level, cross-platform implementation, making it highly versatile and efficient. The library is under active development, with significant contributions also happening within related projects such as llama.cpp and whisper.cpp.
Key features of ggml include integer quantization support, broad hardware compatibility, and built-in automatic differentiation. It also incorporates ADAM and L-BFGS optimizers, all without relying on any third-party dependencies, ensuring a lean and performant codebase. A notable design principle is its commitment to zero memory allocations during runtime, which contributes to its exceptional efficiency.
Installation
To get started with ggml, clone the repository and follow the build steps:
git clone https://github.com/ggml-org/ggml
cd ggml
# install python dependencies in a virtual environment
python3.10 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
# build the examples
mkdir build && cd build
cmake ..
cmake --build . --config Release -j 8
Examples
Here's an example of how to run GPT-2 inference using ggml:
# run the GPT-2 small 117M model
../examples/gpt-2/download-ggml-model.sh 117M
./bin/gpt-2-backend -m models/gpt-2-117M/ggml-model.bin -p "This is an example"
For more detailed examples and usage scenarios, explore the examples folder within the repository.
Why Use ggml?
ggml offers several compelling advantages for machine learning developers:
- Low-level and Cross-platform: Provides fine-grained control and runs efficiently across various operating systems and architectures.
- Integer Quantization Support: Enables efficient model deployment on resource-constrained devices by reducing model size and computational requirements.
- Broad Hardware Support: Designed to work across a wide range of hardware, including specialized accelerators.
- Automatic Differentiation: Simplifies the implementation of complex machine learning models and training algorithms.
- Optimizers Included: Comes with ADAM and L-BFGS optimizers built-in, ready for use in training processes.
- No Third-Party Dependencies: Reduces complexity, potential conflicts, and improves portability.
- Zero Memory Allocations During Runtime: Ensures highly efficient memory usage, crucial for performance-critical applications.
Links
- GitHub Repository: https://github.com/ggml-org/ggml
- Roadmap: https://github.com/users/ggerganov/projects/7
- Manifesto: https://github.com/ggerganov/llama.cpp/discussions/205
- Introduction to ggml (Hugging Face Blog): https://huggingface.co/blog/introduction-to-ggml
- The GGUF file format: https://github.com/ggerganov/ggml/blob/master/docs/gguf.md
Related repositories
Similar repositories that may be relevant next.
Evidently: Open-Source ML and LLM Observability Framework
June 30, 2026
Evidently is an open-source Python library designed for evaluating, testing, and monitoring machine learning and large language model systems. It provides over 100 built-in metrics for various tasks, from data drift detection to LLM judges, supporting both tabular and text data. This framework helps ensure the quality and performance of AI-powered systems throughout their lifecycle.

MarkLLM: An Open-Source Toolkit for LLM Watermarking
June 23, 2026
MarkLLM is an open-source toolkit designed to simplify the research and application of watermarking technologies for large language models (LLMs). It offers a unified framework for implementing various watermarking algorithms, alongside robust visualization and comprehensive evaluation tools. This toolkit helps researchers and the broader community understand and assess the authenticity and origin of machine-generated text.

bg-remove: Client-Side Image Background Removal with Transformers.js
June 17, 2026
bg-remove is a powerful React + Vite application that enables free, client-side image background removal directly in your browser. Leveraging machine learning models via Transformers.js, it ensures all processing happens locally, prioritizing user privacy. This tool offers one-click removal, custom background options, and optional WebGPU acceleration for enhanced performance.

NVIDIA PhysicsNeMo: Deep Learning Framework for Physics-ML Models
June 16, 2026
NVIDIA PhysicsNeMo is an open-source deep learning framework designed for building, training, and fine-tuning Physics AI models. It leverages state-of-the-art scientific machine learning methods, enabling real-time predictions by combining physics knowledge with data. This framework provides scalable, GPU-optimized tools for AI4Science and engineering applications.
Source repository
Open the original repository on GitHub.