Repository History
161 repositories tagged with python

vuln-bank: A Deliberately Vulnerable Banking App for Security Testing
vuln-bank is a Python-based banking application intentionally built with a wide array of security vulnerabilities. It serves as an excellent hands-on platform for security professionals, developers, and enthusiasts to practice web, API, and AI application security testing. This project is ideal for learning about common exploits, secure coding practices, and DevSecOps implementation in a controlled environment.

awesome-aws: A Curated List of AWS Resources and Libraries
awesome-aws is a comprehensive, curated list of Amazon Web Services (AWS) resources. It features a wide array of libraries, open-source repositories, guides, blogs, and other valuable content for anyone working with AWS. This repository is an essential tool for developers and architects looking to navigate the vast AWS ecosystem.

E2M: Convert Various File Types to Markdown for RAG and LLM Training
E2M is a Python library designed to convert diverse file types, including documents, web pages, and audio, into Markdown format. It features a robust parser-converter architecture, making it highly flexible and easy to integrate. This tool is specifically aimed at generating high-quality data for Retrieval-Augmented Generation (RAG) and large language model training.

Agentless: An Agentless Approach to Solve Software Development Problems
Agentless is an innovative open-source project that offers an agentless approach to automatically solve software development problems. It streamlines the bug-fixing process through localization, repair, and patch validation phases. This tool aims to enhance efficiency in addressing software issues, particularly demonstrated by its performance on benchmarks like SWE-bench lite.

GraphRAG: A Modular Graph-Based RAG System for LLM Discovery
GraphRAG, developed by Microsoft, is a powerful and modular graph-based Retrieval-Augmented Generation (RAG) system. It is designed to extract meaningful, structured data from unstructured text using Large Language Models (LLMs). This system enhances an LLM's ability to reason about private and narrative data by leveraging knowledge graph memory structures.

scikit-learn: The Essential Python Library for Machine Learning
scikit-learn is a widely-used open-source Python library for machine learning, built upon SciPy. It provides a comprehensive suite of tools for data mining and data analysis, making it an indispensable resource for developers and data scientists. With its extensive algorithms and user-friendly interface, scikit-learn simplifies complex machine learning tasks.
Pedalboard: Spotify's Python Library for Audio Effects and Machine Learning
Pedalboard is a robust Python library developed by Spotify's Audio Intelligence Lab, designed for comprehensive audio processing tasks. It facilitates reading, writing, rendering, and applying a wide array of audio effects, including support for VST3® and Audio Unit plugins. Internally, Spotify leverages Pedalboard for data augmentation to enhance machine learning models and power innovative features like AI DJ, making advanced audio manipulation accessible within Python and TensorFlow environments.

Cerberus: Lightweight and Extensible Data Validation for Python
Cerberus is a lightweight and extensible data validation library for Python, offering robust type checking and base functionality. It is designed for easy customization and integration, allowing for custom validation rules. With no external dependencies, Cerberus provides a powerful yet simple solution for validating data structures.
DragGAN: Interactive Point-Based Image Manipulation with Generative AI
DragGAN is the official code for the SIGGRAPH 2023 paper, "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold." This powerful Python-based repository enables users to precisely control and manipulate generated images using interactive dragging points. It offers an intuitive way to edit AI-generated content, making complex image transformations accessible.
Whoogle Search: A Privacy-Respecting, Ad-Free Metasearch Engine
Whoogle Search is a self-hosted metasearch engine that delivers Google results without ads, tracking, or JavaScript. It prioritizes user privacy and offers a customizable, clean search experience. Easily deployable via Docker or various cloud platforms, Whoogle provides a seamless transition to a more private search environment.
Loguru: Python Logging Made Simple and Enjoyable
Loguru is a powerful and user-friendly Python library designed to simplify logging. It aims to make logging an enjoyable and straightforward process, eliminating the need for complex configurations. With Loguru, developers can implement robust logging solutions with minimal boilerplate, enhancing debugging and application monitoring.

13ft: Self-Hosted Paywall Bypass and Ad Blocker
13ft is a powerful, self-hosted Python application designed to bypass paywalls and block ads on various websites, including those that services like 12ft.io might miss. It operates by impersonating GoogleBot to access the full content of articles. This open-source tool offers a flexible solution for users seeking to read restricted content.