Taichi: Productive, portable, and performant GPU programming in Python
https://github.com/taichi-dev/taichi
LabelU
Data annotation toolbox supports image, audio and video data.
https://github.com/opendatalab/labelU
I switched from full stack to streamlit/python and it reduced my development time to 2 weeks !
https://www.reddit.com/r/Python/comments/1f07c7d/i_switched_from_full_stack_to_streamlitpython_and/
How to build a query language in Python
The article discusses the complexities and challenges of building a custom query language, highlighting key considerations such as syntax design, parsing, and execution. It provides insights and practical tips for developers looking to create or improve their own query languages.
https://jamesg.blog/2024/08/17/build-a-query-language/
Try LangChain with Python and Upstash Vector
The video covers setting up a Lang chain for custom data sets using a variety of tools like Python, Jupyter, and OpenAI, demonstrating API creation, embedding models, and vector database integration. It also discusses implementing rate limiting in a FastAPI application and setting up agents for querying and handling data.
https://www.youtube.com/watch?v=FjKMnszG8Dk
PromptMage - simplifies the process of creating and managing LLM workflows
"PromptMage" is designed to offer an intuitive interface that simplifies the process of creating and managing LLM workflows. It facilitates prompt testing and comparison, and version control.
https://github.com/tsterbak/promptmage
Saving Scrapy Crawl Stats to PostgreSQL with a Custom Extension and SQLAlchemy
The post explains how to extend Scrapy to save crawl statistics directly into a PostgreSQL database, detailing the implementation process and configuration needed. It provides a guide for integrating Scrapy with PostgreSQL to enhance data management and analysis.
https://www.xiegerts.com/post/scrapy-extension-save-crawlstats-postgres/
Pare
Pare is the easiest way to deploy Python Lambdas alongside your primary web application.
https://github.com/gauge-sh/pare
Cloudflare R2 x Django. Static Files. User uploads, css, images, js and more. Production-ready.
The video covers setting up and managing Django files, including static and user-uploaded files, using Cloudflare's R2 object storage. It emphasizes best practices for configuring environment variables, securing API keys, and managing static and media files in Django with advanced validation and customization options.
https://www.youtube.com/watch?v=VU3MAN1gs1s
From Boring Object-Oriented to INSANE Functional Code
This video demonstrates that there's a place for both object-oriented and functional code. In Python, these two approaches can be combined effectively, allowing you to leverage the strengths of each for the best results.
https://www.youtube.com/watch?v=DvdZv_DD0DY
Automating Boolean-Based SQL Injection with Python
How to be efficiently lazy at finding hidden gems in predictable places – Database Edition.
https://trebledj.me/posts/automating-boolean-sql-injection-with-python/
Deep-Live-Cam
Real time face swap and one-click video deepfake with only a single image.
https://github.com/hacksider/Deep-Live-Cam
CSVs Are Kinda Bad. DSVs Are Kinda Good.
The article argues that CSVs (Comma-Separated Values) are problematic due to various edge cases involving delimiters, quotes, and newlines, and proposes using Delimiter-Separated Values (DSV) with ASCII control characters as a more robust alternative. It demonstrates how DSVs can handle complex data without escaping or quoting issues, but acknowledges that the lack of widespread tool sup...
https://matthodges.com/posts/2024-08-12-csv-bad-dsv-good/
Py5, a Python version of Processing for your creative coding projects
http://py5coding.org/index.html
Qik
Qik is a command runner that understands the import graph, allowing you to cache commands and only run the things that matter. Can dramatically speed up CI/dev in large monorepos.
https://github.com/Opus10/qik
RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems.
https://github.com/NirDiamant/RAG_Techniques
Rye and Uv: August Is Harvest Season for Python Packaging
https://lucumr.pocoo.org/2024/8/21/harvest-season/
Python's Preprocessor
Every now and then you hear outrageous claims such as “Python has no preprocessor”. This is simply not true. In fact, Python has the best preprocessor of all languages - it quite literally allows us to do whatever we want, and a lot more. It’s just a little tricky to (ab)use.
https://pydong.org/posts/PythonsPreprocessor/
OpenBB-finance / OpenBB
Investment Research for Everyone, Everywhere.
https://github.com/OpenBB-finance/OpenBB
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation
The video explains how to code a visual language model from scratch using PyTorch, covering topics such as vision transformers, contrastive learning, and language models. It provides a detailed walkthrough of implementing various components, including the vision encoder, language model, and how to combine image and text embeddings, with explanations of key concepts like attention mechani...
https://www.youtube.com/watch?v=vAmKB7iPkWw
Gemma for Streaming ML with Dataflow
The article demonstrates how to integrate Google's Gemma 2 language model into a Dataflow pipeline for real-time sentiment analysis and response generation in customer support chats. It provides a practical example of using Gemma to process streaming data, including code snippets for creating prompts, running inference, and handling model outputs within a scalable data processing framework.
https://developers.googleblog.com/en/gemma-for-streaming-ml-with-dataflow/
What are data structures anyway?
https://www.reddit.com/r/learnpython/comments/1ev5195/what_are_data_structures_anyway/
EDSL
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
https://github.com/expectedparrot/edsl
llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
https://github.com/Dicklesworthstone/llm_aided_ocr
DeltaDB
DeltaDB is a lightweight, fast, and scalable database built on polars and deltalake.
https://github.com/uname-n/deltadb
goauthentik / authentik
The authentication glue you need.
https://github.com/goauthentik/authentik
Some more batteries to do stuff with Mapping related data structures
This library provides utility functions for manipulating and transforming data structures which have or include Mapping-like characteristics.
https://github.com/erivlis/mappingtools