Repository History

2 repositories tagged with High Performance Computing

Topic: High Performance Computing

FlashAttention: Fast and Memory-Efficient Exact Attention

FlashAttention is a cutting-edge library from Dao-AILab, designed to provide fast and memory-efficient exact attention for deep learning models. It significantly accelerates transformer training and inference by optimizing memory usage and computational speed. This makes it an essential tool for researchers and developers working with large-scale AI models.

Analyzed Feb 18, 2026

View Details

ZLUDA: Run CUDA Applications on Non-NVIDIA GPUs with Near-Native Performance

ZLUDA is an innovative open-source project providing a drop-in replacement for CUDA, enabling users to run CUDA applications on non-NVIDIA GPUs. Written in Rust, it aims to deliver near-native performance, significantly expanding hardware compatibility for CUDA-dependent software. This project offers a powerful solution for greater flexibility in GPU computing environments.

Analyzed Feb 14, 2026

View Details

Previous Page 1 Next