News & links about Python programming. https://pythonhub.dev/ Administrator: @rukeba
An open source vuln scanner in Python: OXO
https://oxo.ostorlab.co/
VinciGit00 / Scrapegraph-ai
Python scraper based on AI
https://github.com/VinciGit00/Scrapegraph-ai
LlamaFS
A self-organizing file system with llama 3.
https://github.com/iyaja/llama-fs
Don't worry about LLMs
The post argues that while large language models (LLMs) are receiving a lot of hype, the engineering systems built around them are similar to previous machine learning systems. It advises practitioners to cut through the hype and treat LLMs as regular engineering and ML problems.
https://vickiboykis.com/2024/05/20/dont-worry-about-llms/
How AI Can Help Deaf People Hear
This project facilitates communication between Deaf individuals and hearing individuals who do not understand American Sign Language (ASL). It is designed to respect and preserve ASL as the primary language.
https://www.youtube.com/watch?v=uuPxMWQRoXc
Fire Up Your Logging Needs with Pydantic Logfire
The pydantic team recently introduced logfire, a new logging tool that makes it easy to track and analyze your logs. Simply integrate logfire into your projects with just a few lines of code.
https://kadermiyanyedi.medium.com/fire-up-your-logging-needs-with-logfire-6330d7a08dfe
google-deepmind / alphafold
Open source code for AlphaFold.
https://github.com/google-deepmind/alphafold
Web Development with Htmx, Type-Guided Components, Pure Python
https://getludic.dev/docs/
Python notebooks for fundamentals of music processing
https://www.audiolabs-erlangen.de/resources/MIR/FMP/C0/C0.html
Koheesio
Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
https://github.com/Nike-Inc/koheesio
How good is GPT-4o at generating Flask apps? Surprisingly promising
This article summarizes the findings when asking GPT-4o to generate Flask applications, ranging from a simple "Hello, World!" app to a full-fledged CRUD app with three database models and HTML pages with Tailwind. With careful prompting, GPT-4o can produce working Flask applications and follow (some) best coding practices.
https://ploomber.io/blog/gpt-4o-flask/
How Python Compares Floats and Ints: When Equals Isn’t Really Equal
Another Python gotcha and an investigation into its internals to understand why this happens.
https://blog.codingconfessions.com/p/how-python-compares-floats-and-ints
Working with Excel Files in Python
https://www.python-excel.org/
What is your favorite Python-related YouTube channel?
https://www.reddit.com/r/learnpython/comments/1cyeyp8/what_is_your_favorite_pythonrelated_youtube/
cover-agent
An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement!
https://github.com/Codium-ai/cover-agent
wakepy
Cross-platform keep-awake with Python.
https://github.com/fohrloop/wakepy
Python Hub Weekly Digest for 2024-06-09
https://pythonhub.dev/digest/2024-06-09/
Elia
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
https://github.com/darrenburns/elia
Rapidly creating smoke tests for Django views
A management command for quickly generating tests based off Django's URL routing mechanism.
https://daniel.feldroy.com/posts/2024-05-rapidly-creating-smoke-tests-for-django-views
DataFrames at Scale Comparison: TPC-H
We run benchmarks derived from the TPC-H benchmark suite on a variety of scales, hardware architectures, and dataframe projects, notably Apache Spark, Dask, DuckDB, and Polars. No project wins. This post analyzes results within each project and between projects.
https://docs.coiled.io/blog/tpch.html
mistral-finetune
mistral-finetune is a light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models.
https://github.com/mistralai/mistral-finetune
Blealtan / efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
https://github.com/Blealtan/efficient-kan
Pandora
Towards General World Model with Natural Language Actions and Video States.
https://github.com/maitrix-org/Pandora
Terminal Text Effects
Visual effects applied to text in the terminal.
https://github.com/ChrisBuilds/terminaltexteffects
MicroPython 1.23 Brings Custom USB Devices, OpenAMP, Much More
https://hackaday.com/2024/06/02/micropython-1-23-brings-custom-usb-devices-openamp-much-more/
Gloe
Gloe is a general-purpose library made to help developers create, maintain, document, and test both operational and flow-oriented code.
https://github.com/ideos/gloe
Python Hub Weekly Digest for 2024-06-02
https://pythonhub.dev/digest/2024-06-02/
Signals, shells, and docker: an onion of footguns
The post discusses potential security risks and "footguns" that can arise when using Docker containers, signals, and shells together in a development environment. It highlights how certain combinations of these tools can inadvertently grant excessive permissions or expose sensitive data, emphasizing the need for caution and proper configuration.
https://benchling.engineering/signals-shells-and-docker-an-onion-of-footguns-ee592e2b587b
Curating Custom Datasets for LLM Training with NVIDIA NeMo Curator
NeMo Curator, which is part of NVIDIA NeMo, offers workflows to download and curate data from various public sources out of the box such as Common Crawl, Wikipedia, and arXiv. It also provides flexibility for developers to customize data curation pipelines to address their unique requirements and create custom datasets. This post walks you through creating a custom data curation pipeline...
https://developer.nvidia.com/blog/curating-custom-datasets-for-llm-training-with-nvidia-nemo-curator/
AgentOps
Open source Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen.
https://github.com/AgentOps-AI/agentops