News & links about Python programming. https://pythonhub.dev/ Administrator: @rukeba
LibrePythonista allows running IPython code in a LibreOffice spreadsheet
https://github.com/Amourspirit/python_libre_pythonista_ext
peepDB
open-source CLI tool and python library to quickly view MySQL, PostgreSQL, MariaDB database tables.
https://github.com/evangelosmeklis/peepdb
GraalPy – A high-performance embeddable Python 3 runtime for Java
https://www.graalvm.org/python/
What’s New In Python 3.13
This article explains the new features in Python 3.13, compared to 3.12. Python 3.13 will be released on October 1, 2024.
https://docs.python.org/3.13/whatsnew/3.13.html
Multiversion Python Thoughts
A braindump on how to make multi version in Python work.
https://lucumr.pocoo.org/2024/9/9/multiversion-python/
PyPI Proxying for Docker Builds
I wanted to improve our CI system by caching PyPI data locally. I saw that there’s a project to do this, but I didn’t see any good examples actually using it.
https://www.robopenguins.com/pypi-proxy/
Shades of testing HTTP requests in Python
The post discusses various approaches to testing HTTP requests in Python applications, focusing on mocking external API calls during unit and integration testing.
https://rednafi.com/python/testing_http_requests/
kazam
Linux Screen Recorder, Broadcaster, Capture and OCR with AI in mind.
https://github.com/henrywoo/kazam
Pure Python: Build a full stack ChatGPT-like UI. Reflex, Neon Postgres. Deploy with Docker to a VM
This video tutorial demonstrates how to build a full-stack ChatGPT-like UI using Reflex, a Python framework for web development, integrating it with Neon Postgres database and OpenAI. It covers the entire process from setting up the development environment to deploying the application using Docker, GitHub Actions, and Ansible on a virtual machine.
https://www.youtube.com/watch?v=NuNaI__4xiU
Integrating Stripe Into A One-Product Django Python Shop
In the first part of this series, we created a Django online shop with htmx. In this second part, we'll handle orders using Stripe.
https://blog.appsignal.com/2024/09/04/integrating-stripe-into-a-one-product-django-python-shop.html
Lesser known parts of Python standard library – Trickster Dev
https://www.trickster.dev/post/lesser-known-parts-of-python-standard-library/
smartcut
Cut video files with minimal recoding.
https://github.com/skeskinen/smartcut
How to Create a Pre-Commit Hook
A step-by-step guide to developing your own pre-commit hook.
https://stefaniemolin.com/articles/devx/pre-commit/hook-creation-guide/
Classifying all of the pdfs on the internet
The article describes an attempt to classify a massive dataset of 8.4 million PDFs from Common Crawl using various machine learning techniques. The author experiments with different approaches, including deep learning models and traditional machine learning methods like XGBoost, ultimately achieving the best performance with an XGBoost model trained on embeddings, reaching 85.26% accurac...
https://snats.xyz/pages/articles/classifying_a_bunch_of_pdfs.html
Tinystatus: A tiny status page generated by a Python script
https://github.com/harsxv/tinystatus
How To Perform Unit Testing in Flask
Learn how to efficiently test your Flask applications using pytest to ensure robust and error-free code.
https://www.digitalocean.com/community/tutorials/unit-test-in-flask
Revival Hijack – PyPI hijack technique exploited in the wild, puts 22K packages at risk
The article discusses a PyPI supply chain attack technique called "Revival Hijack," where attackers can hijack removed packages by re-registering them. JFrog's research team found that this technique could potentially affect 22,000 existing PyPI packages, leading to hundreds of thousands of malicious downloads, and they took proactive measures to protect the PyPI community by safely rese...
https://jfrog.com/blog/revival-hijack-pypi-hijack-technique-exploited-22k-packages-at-risk/
Dante
A zero-setup, easy to use document store for Python.
https://github.com/senko/dante
NestedText
Human readable and writable data interchange format.
https://github.com/KenKundert/nestedtext
Python Hub Weekly Digest for 2024-09-15
https://pythonhub.dev/digest/2024-09-15/
nlp-zero-to-hero
A comprehensive resource for learning Natural Language Processing (NLP) from the basics to advanced topics. It contains Jupyter notebooks covering various NLP concepts, techniques, and implementations, making it a valuable guide for beginners and intermediate learners in the field of NLP.
https://github.com/JUSTSUJAY/nlp-zero-to-hero
cookiecutter-uv
A modern cookiecutter template for Python projects that use uv for dependency management.
https://github.com/fpgmaas/cookiecutter-uv
Create & Share Streamlit, Dash and Python Apps Online.
https://py.cafe
My Favorite Error Handling Technique
This video presents a surprising “Let it burn” approach to error handling, demonstrating how allowing code to fail fast can result in simpler, clearer, and more robust software. Discover the benefits of this method and its impact on improving overall code quality.
https://www.youtube.com/watch?v=YA0Wq1rcs6U
Using GPT-4o for web scraping
The article discusses using GPT-4 with OpenAI's structured outputs feature to create an AI-assisted web scraper, exploring its capabilities in parsing complex tables and generating XPaths. While the author found GPT-4 effective at extracting data from various HTML tables, they also noted challenges with merged rows, high API costs, and the need for further refinements to improve accuracy...
https://blancas.io/blog/ai-web-scraper/
Why hash tables are faster?
https://www.reddit.com/r/learnpython/comments/1fcj0ci/why_hash_tables_are_faster/
pipefunc
Lightweight function pipeline (DAG) creation in pure Python for scientific workflows.
https://github.com/pipefunc/pipefunc
Mini-Omni
Mini-Omni is an open-source multimodel large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
https://github.com/gpt-omni/mini-omni
Lessons learnt building a real-time audio application in Python
https://www.vangemert.dev/#/blog/lessons-learnt-backlooper
Multimodal Data Analysis with LLMs and Python – Tutorial
The tutorial teaches how to analyze multimodal data using Large Language Models (LLMs) and Python, covering text classification, image-based question answering, audio transcription, and creating a natural language query interface for SQL databases.
https://www.youtube.com/watch?v=3-4qAkFRpAk