explodinggradients / ragas
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
https://github.com/explodinggradients/ragas
Let's build the GPT Tokenizer
The Tokenizer, essential for Large Language Models (LLMs), translates between strings and tokens, operating as a distinct stage with separate training sets and algorithms. This lecture builds the GPT series Tokenizer from scratch, uncovering peculiar behaviors in LLMs linked to tokenization. We explore these issues, attributing them to tokenization, and consider the ideal scenario of eli...
https://www.youtube.com/watch?v=zduSFxRajkE
The simplest way to add a Django worker (Using AWS Chalice)
The article discusses the utilization of AWS Chalice to incorporate a Django worker, enabling the use of a lambda function as a serverless background worker for any application. This approach allows the lambda function to run in the background without blocking the main thread of the app, and it can call an endpoint on the Django app when completed, offering the advantage of using any Pyt...
https://www.photondesigner.com/articles/lambda-for-django
hustvl / Vim
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
https://github.com/hustvl/Vim
Hyperdiv – Reactive, immediate-mode web UI framework for Python
https://github.com/hyperdiv/hyperdiv
How to dockerize a Django, Preact, and PostgreSQL Application
Dockerizing your Django application can be intimidating, but the rewards outweigh the risks. In this guide, Charlie Macnamara walks you through the setup process so you can get the most out of your applications.
https://www.honeybadger.io/blog/dockerize-django-preact-postgres
Algorithmic Art with Python
In this talk we’re going to start from nothing and build out our own tools for making art in Python, no AI needed! We’ll show how Python’s expressiveness allows us to describe graphics elegantly and use that to make some unique art programmatically.
https://www.youtube.com/watch?v=_XeRM-4DZz0
A simple MUD server in Python which can be run on a Raspberry Pi
https://github.com/Frimkron/mud-pi
Building an LLM from scratch
Learn how to build a modern language model with all the bells and whistles completely from scratch: from vanilla Python to functional coding assistant
https://bclarkson-code.github.io/posts/llm-from-scratch-scalar-autograd/post.html
mwmbl
An open source, non-profit search engine implemented in Python.
https://github.com/mwmbl/mwmbl
Visualizing Neural Network Internals
Visualizing some of the internals of a neural network during training and inference.
https://www.youtube.com/watch?v=ChfEO8l-fas
Lag-Llama
Towards Foundation Models for Probabilistic Time Series Forecasting.
https://github.com/time-series-foundation-models/lag-llama
OS-Copilot
An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
https://github.com/OS-Copilot/FRIDAY
Using IPython Jupyter Magic Commands to Improve the Notebook Experience
A post on creating a custom IPython Jupyter Magic command.
https://towardsdatascience.com/using-ipython-jupyter-magic-commands-to-improve-the-notebook-experience-f2c870cab356
Cry Baby: A Tool to Detect Baby Cries
https://www.reddit.com/r/Python/comments/1awbm1r/cry_baby_a_tool_to_detect_baby_cries/
Alto
Serverless for data practitioners. The fastest way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter notebooks in virtual machines.
https://github.com/runprism/alto
5 Tips to Safely Use Credentials in Your Python Project
Learn 5 simple tips to help keep your Python credentials safe and fix problems quickly if they occur.
https://www.youtube.com/watch?v=OOvvQRBcrhI
Neat Parallel Output in Python
https://bernsteinbear.com/blog/python-parallel-output/
Tracing System Calls in Python
The article discusses the author's development of a tool added to Cirron that enables the tracing of system calls made by Python code. It provides an example of tracing the "print" function and explains the tool's implementation using the strace tool for effective analysis. The article also outlines the initial intention to use the ptrace syscall for implementation and the subsequent uti...
https://blog.mattstuchlik.com/2024/02/16/counting-syscalls-in-python.html
Streamline-Analyst
An AI agent powered by LLMs that streamlines the entire process of data analysis.
https://github.com/Wilson-ZheLin/Streamline-Analyst
Deploy a Serverless FastAPI App with Neon Postgres and AWS App Runner at any scale
Create a serverless API using FastAPI, deployed on AWS App Runner and powered by Neon Postgres.
https://neon.tech/blog/deploy-a-serverless-fastapi-app-with-neon-postgres-and-aws-app-runner-at-any-scale
Vega-Altair: Declarative Visualization in Python
https://altair-viz.github.io/index.html
magika
Detect file content types with deep learning.
https://github.com/google/magika
netease-youdao / QAnything
Question and Answer based on Anything.
https://github.com/netease-youdao/QAnything
Summary of Major Changes Between Python Versions
This post is designed to be a quick reference for the major changes introduced with each new version of Python. This can help with taking advantages of using new features as you upgrade your code base, or ensuring that you have the correct guards for compatibility with older versions.
https://www.nicholashairs.com/posts/major-changes-between-python-versions/
Handling Tasks in Asyncio Like a Pro
https://jacobpadilla.com/articles/handling-asyncio-tasks
The Many Ways to Deploy a Model
There are many ways to deploy models and perform inference. Here, we share our decision rubric for model deployments using LLM inference as an example.
https://outerbounds.com/blog/the-many-ways-to-deploy-a-model