emval
emval is a blazingly fast Python email validator written in Rust.
https://github.com/bnkc/emval
FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention
https://pytorch.org/blog/flexattention/
Django: create sub-commands within a management command
https://adamj.eu/tech/2024/08/14/django-management-command-sub-commands/
AgentK
An autoagentic AGI that is self-evolving and modular.
https://github.com/mikekelly/AgentK
alexta69 / metube
Self-hosted YouTube downloader (web UI for youtube-dl / yt-dlp)
https://github.com/alexta69/metube
dir-assistant
Chat with your current directory's files using a local or API LLM.
https://github.com/curvedinf/dir-assistant/
GIL Become Optional in Python 3.13
https://geekpython.in/gil-become-optional-in-python
A Knownbits Abstract Domain for the Toy Optimizer, Correctly
The article discusses the implementation of a "Knownbits" abstract domain for the Toy optimizer, which tracks individual bits of a variable as "known zero," "known one," or "unknown" to optimize integer operations. It details the construction of the domain, transfer functions, and correctness proofs using property-based testing and automated proofs, with plans to apply a more complex ver...
https://pypy.org/posts/2024/08/toy-knownbits.html
Python extensions should be lazy
https://www.gauge.sh/blog/python-extensions-should-be-lazy
Talk to Django with natural language. Text to SQL and more.
The video demonstrates using Djeno database for natural language to SQL queries via embeddings, emphasizing semantic search, and integration with Django. It covers setting up PostgreSQL, creating Django models, embedding for efficient search, and caching for performance, while addressing bugs and customizing prompts in the SQL engine.
https://www.youtube.com/watch?v=GfDJ-Sxn4dE
Recent Performance Improvements in Function Calls in CPython
https://blog.codingconfessions.com/p/are-function-calls-still-slow-in-python
The trouble with __all__
This article discusses the limitations and pitfalls of using the all attribute in Python for defining public APIs, emphasizing how it fails to enforce module boundaries and can lead to tightly coupled codebases. The author proposes an alternative solution involving a custom import hook to ensure stricter module interface enforcement.
https://www.gauge.sh/blog/the-trouble-with-all
MindSearch
An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
https://github.com/InternLM/MindSearch
pgai
Bring AI models closer to your PostgreSQL data.
https://github.com/timescale/pgai
Unstract
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents.
https://github.com/Zipstack/unstract
Optimizing PyTorch Docker images: how to cut size by 60%
https://mveg.es/posts/optimizing-pytorch-docker-images-cut-size-by-60percent/
Crawling Pages with Infinite Scroll using Scrapy and Playwright
This post provides a detailed guide on how to scrape infinite scroll websites using Scrapy and Playwright in Python. It covers the setup process, explains how to implement a custom downloader middleware to handle JavaScript rendering, and demonstrates how to extract data from dynamically loaded content, offering a practical solution for web scraping challenges posed by modern web applica...
https://www.xiegerts.com/post/infinite-scroll-scrapy-playwright/
Log context propagation in Python ASGI apps
The article explains how to implement log context propagation in Python ASGI applications using middleware to automatically tag log messages with contextual information like user ID and platform. This approach simplifies logging by eliminating the need for manual context passing across different layers of an application, ensuring all logs within a request-response cycle are consistently ...
https://rednafi.com/python/log_context_propagation/
OpenAdapt
AI-First Process Automation with Large Multimodal Models (LMMs).
https://github.com/OpenAdaptAI/OpenAdapt
RAG Fundamentals and Advanced Techniques
This course will guide you through the basics of Retrieval-Augmented Generation (RAG), starting with its fundamental concepts and components. You'll learn how to build a RAG system for chatting with documents, explore advanced techniques, and understand the pitfalls of naive RAG.
https://www.youtube.com/watch?v=ea2W8IogX80
Django 5.1 released
Django 5.1 introduces LoginRequiredMiddleware for easier authentication enforcement, accessibility enhancements like improved screen reader support and better HTML semantics, and a new querystring template tag for simpler URL handling in templates.
https://www.djangoproject.com/weblog/2024/aug/07/django-51-released/
AudioSample
AudioSample is an optimized numpy-like audio manipulation library, created for researchers, used by developers.
https://github.com/deepdub-ai/audiosample
nano-llama31
nanoGPT style version of Llama 3.1
https://github.com/karpathy/nano-llama31
DCPerf
DCPerf benchmark suite for hyperscale cloud applications,
https://github.com/facebookresearch/DCPerf
FINALLY Python is Getting Rid of the GIL!
This video discusses how Python 3.13 is revolutionizing performance by making the Global Interpreter Lock (GIL) optional! Learn what the GIL is, why it exists, and the potential impacts of its removal on your Python projects.
https://www.youtube.com/watch?v=zWPe_CUR4yU
sqlite-vec
A vector search SQLite extension that runs anywhere!
https://github.com/asg017/sqlite-vec
Formy
Generate dynamic UI forms from text using OpenAI's structured output API.
https://github.com/deedy/formy
Knuckledragger, a Semi-Automated Python Proof Assistant
https://www.philipzucker.com/state_o_knuck/