9885
See what the GitHub community is most excited about today. A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel. Author and maintainer: https://github.com/katursis
#rust #document_ocr #document_processing #ocr #ocr_recognition #pdf #pdf_parser #text_extraction
LiteParse is a fast, local PDF parser that extracts text with bounding boxes, can use OCR, and works in Rust, Python, Node.js, and the browser. It also makes screenshots and can handle files like DOCX, XLSX, PPTX, and images after conversion. Benefit: you can turn documents into clean text or JSON on your own machine, which helps with private, quick, and structured document processing.
https://github.com/run-llama/liteparse
#python #deep_learning #jepa #model_predictive_control #pytorch #world_model
stable-worldmodel is a tool for world model research that lets you collect data, train models, and test them in one place across many standard environments. It helps you work faster and compare results more easily by giving you ready-made baselines, solvers, and simple data formats, so you can focus on improving your model instead of building the whole setup yourself.
https://github.com/galilai-group/stable-worldmodel
#python #audio #audio_tokenizer #llm #multimodal #text_to_speech #voice_cloning
MOSS-TTS is an open-source family of speech and sound models for natural, high-quality audio generation, including voice cloning, multi-speaker dialogue, real-time speech, and sound effects. It supports 31 languages in v1.5, better voice stability, and pause control, and it also offers a lightweight Nano version that can run on 4 CPU cores. The benefit to you is simple: you can create realistic speech or sound for apps, demos, or products with strong quality, flexible control, and multiple ways to run it.
https://github.com/OpenMOSS/MOSS-TTS
#other #chinese #english_learning #tutorial
This is an English learning guide that shares the writer’s methods, reading, listening, speaking, writing, vocabulary, and AI tips. Its benefit is simple: it helps you learn English more efficiently, find the right study path, and use AI as a real study tool instead of just a translator.
https://github.com/byoungd/English-level-up-tips
#python #cloth #collision #contact #physics #simulation
ZOZO’s Contact Solver is a GPU-based tool for physics simulations of cloth, solids, and rods that aims to prevent object overlap and handle very large scenes. It works with Blender, JupyterLab, Docker, Windows, Linux, and cloud servers, so you can run simulations locally or remotely and even use a laptop or Mac while the heavy work runs on a GPU. The benefit to you is faster setup, flexible use, and reliable, penetration-free results for animation, design, or testing.
https://github.com/st-tech/ppf-contact-solver
#other
This skill helps you remove AI-like writing patterns so your prose sounds more natural, direct, and human. It gives clear rules for cutting banned phrases, avoiding cliché structures, using active voice, and checking quality, which helps you write cleaner text faster and make your work sound more trustworthy.
https://github.com/hardikpandya/stop-slop
#swift #amp #claude_code #codex #gemini #ghostty #opencode #terminal #tmux
cmux is a fast macOS terminal app made for AI coding work. It gives you vertical tabs, split panes, clear notifications, and a built-in browser, so you can manage many agent sessions in one place and quickly see which one needs attention. It also works with your Ghostty settings, supports SSH, custom commands, session restore, and automation through a CLI and API. This helps you stay organized, save time, and work more smoothly with coding agents.
https://github.com/manaflow-ai/cmux
#python #ai_agents #claude_code #cloud_security #cybersecurity #devsecops #ethical_hacking #incident_response #infosec #llm #malware_analysis #mcp #mitre_attack #nist_csf #osint #penetration_testing #red_team #security #security_automation #threat_hunting #threat_intelligence
Anthropic Cybersecurity Skills is a free, open-source library with 754 ready-made security skills for AI agents across 26 domains. It helps an AI act more like a senior security analyst by giving clear workflows, checks, and verification steps. It also maps each skill to major security frameworks like MITRE ATT&CK, NIST CSF 2.0, ATLAS, D3FEND, and NIST AI RMF. The benefit is faster, smarter, and more reliable security work with less guesswork.
https://github.com/mukul975/Anthropic-Cybersecurity-Skills
#javascript #ai_agent #ai_presentation #api #gamma #powerpoint_automation #powerpoint_free #powerpoint_generation #presentation
Presenton is an open-source tool that makes AI slides and lets you use your own models, keys, templates, and data. You can run it in Docker, on your computer as a desktop app, or in the browser, and export slides as editable PPTX or PDF. This gives you more privacy, more control, and no lock-in or forced subscriptions.
https://github.com/presenton/presenton
#other
CFnew v2.9.8 is a Cloudflare Worker/Pages tool for managing VLESS, Trojan, and xhttp subscriptions with a simple web panel, custom paths, ECH support, delay testing, and many client formats. It now builds Clash, Sing-box, Surge, Loon, and Quantumult X configs directly, so setup is faster and more reliable. For you, this means easier deployment, instant config changes without redeploying, better performance, and more control over proxy and region settings.
https://github.com/byJoey/cfnew
#python #agentic_ai #agentic_workflow #agents #function_calling #llama_cpp #llamafile #llm #ollama #python #self_hosted #tool_calling
Forge is a Python tool that makes self-hosted LLM tool-calling more reliable. It helps local models handle multi-step tasks with guardrails, better context control, and support for Ollama, llama-server, Llamafile, and Anthropic. You can use it as a workflow runner, middleware, or proxy server with OpenAI-style clients. The benefit is fewer broken tool calls, better results on small models, and easier setup for agent apps, chat tools, and long-running sessions.
https://github.com/antoinezambelli/forge
#go #hysteria #hysteria2 #naive_proxy #shadowsocks #shadowtls #sing_box #trojan #tuic #vless #vmess
S-UI is a web panel for managing SagerNet/Sing-Box. It supports many protocols, several languages, traffic routing, client and system status, and subscription links. It runs on Linux, Windows, and partly on macOS, and can be installed by script, manually, or with Docker. It also supports HTTPS and dark/light mode. This helps you set up and control your proxy service more easily from one place.
https://github.com/alireza0/s-ui
#python #agents #ai #ai_agents #ai_engineering #computer_vision #course #deep_learning #from_scratch #generative_ai #llm #machine_learning #mcp #nlp #python #reinforcement_learning #rust #swarm_intelligence #transformers #tutorial #typescript
This is a free MIT learning guide for AI engineering with 428 lessons in 20 phases. It teaches you AI from the math up, then moves into machine learning, deep learning, LLMs, agents, tools, safety, and production. Each lesson helps you build useful code or AI tools, not just read theory. You can start at the right level, follow a clear path, and keep reusable artifacts for real work. The benefit is simple: you learn how AI actually works and gain practical skills you can use to build and ship better AI systems.
https://github.com/rohitg00/ai-engineering-from-scratch
#typescript #api #gateway #whatsapp
OpenWA is a free, open-source WhatsApp API gateway for developers. It gives you full control, no vendor lock-in, and supports sessions, webhooks, messages, groups, and a web dashboard. It works with SQLite or PostgreSQL, Redis, local or S3 storage, and Docker. The benefit is faster setup, flexible scaling, and secure WhatsApp automation with clear docs and simple API access.
https://github.com/rmyndharis/OpenWA
#go
Files.md is a simple, private app for plain `.md` files. It lets you store notes, documents, journals, tasks, and checklists on your device, with no data sent to a server. You can use it offline, sync later if you want, and even connect it to a chatbot or your own server. The main benefit is clear: you keep full control of your files and can think, write, and organize ideas in one calm place.
https://github.com/zakirullin/files.md
#typescript
Cursor plugins are ready-made add-ons for Cursor, each kept in its own folder with its own manifest file. The repository also has a master list of all plugins, plus files like skills, rules, MCP settings, README, changelog, and license, which helps you find, manage, and build plugins faster and more clearly.
https://github.com/cursor/plugins
#html #claude_code #claude_code_plugin #harness #harness_engineering
Harness is a Claude Code plugin that builds agent teams and skills for your project from one simple prompt, using six team patterns like pipeline, supervisor, and fan-out/fan-in. It helps you turn a domain idea into organized agents, better coordination, and tested outputs, so you can save time and handle complex work more reliably.
https://github.com/revfactory/harness
#shell
Claude Code Harness is a tool that adds a clear loop for coding: plan, work, review, sync, and release. It helps you avoid messy agent work by writing a spec and plan first, checking the work, and saving proof for pull requests or releases, so you can ship safer changes with less rework and more confidence.
https://github.com/Chachamaru127/claude-code-harness
#typescript #coderabbit #inngest #nextjs #shadcn_ui #stock_market #tailwindcss
OpenStock is a free, open-source stock market app that lets you track prices, set alerts, search stocks fast, and view company details and charts. It uses Next.js, MongoDB, Finnhub, and TradingView, and it also includes watchlists, email updates, and personalized onboarding. Benefit: you get a modern market tool without paying for expensive platforms, and you can even run or modify it yourself because it is open source.
https://github.com/Open-Dev-Society/OpenStock
#shell #agent #ai #claude #claude_code #codex #coding #design #frontend #lowcode #nocode #skill #skills #vibecoding
Taste Skill is a set of portable AI design tools that helps agents build better-looking interfaces with stronger layout, spacing, typography, and motion instead of plain, boilerplate UIs. It also includes image tools for web, mobile, and brand reference boards, so you can create polished design frames, then turn them into code with tools like Codex, Cursor, or Claude Code. The benefit is faster work with better visual quality and less repetitive design output.
https://github.com/Leonxlnx/taste-skill
#javascript #android #awesome #awesome_list #free_apps #ios #linux #macos #microsoft #mobile #windows
This page is a big list of free apps for Windows, Mac, Linux, Android, and iOS, sorted by what they do, like audio, browsers, security, video, games, editing, and file tools. It helps you quickly find a useful app for your device, especially if you want free or open-source software. The page also marks which apps are recommended and which are available on each platform, so you can choose faster and avoid wasted downloads.
https://github.com/Axorax/awesome-free-apps
#python
Plugins help Claude act like a specialist for different jobs, such as sales, marketing, finance, data, and support. They connect Claude to your tools, add useful skills, and give quick slash commands for common tasks. You can also change them to match your company’s words, systems, and ways of working. This saves time, reduces repeat work, and gives more consistent results because Claude can work more like part of your team.
https://github.com/anthropics/knowledge-work-plugins
#python #infra #long #nvfp4 #parallel #real_time #video_generation
LongLive 2.0 helps generate long videos faster by using NVFP4, parallel processing, and multi-shot support for training and inference. It can reach up to 45.7 FPS while keeping good quality, and it also supports real-time, user-guided video generation. The benefit is that you can make long videos more quickly and efficiently, with less computing cost and smoother interactive use.
https://github.com/NVlabs/LongLive
#rust #analytics #bi #data_visualization #javascript #jupyter #python #real_time #webassembly
Perspective is a fast tool for exploring large and streaming data. It lets you build dashboards, reports, notebooks, and apps with tables and many chart types. It works in the browser, Python, and Rust, and can connect to data sources like DuckDB or Arrow. This helps you quickly see patterns, make better decisions, and analyze data without heavy setup.
https://github.com/perspective-dev/perspective
#typescript #antigravity_skills #business_knowledge #claude_code #claude_skills #codebase_analysis #codex #codex_skills #developer_tools_ai_agent #gemini_cli_skills #karpathy_llm_wiki #knowledge_base #knowledge_graph #memory #opencode_skills #pi_agent #understandcode #vibe_coding
Understand Anything turns a codebase or docs into an interactive knowledge graph you can search, explore, and ask questions about. It works with tools like Claude Code, Codex, Cursor, Copilot, and Gemini CLI. You can see files, functions, classes, and dependencies in one place, get plain-English explanations, and find what changes affect before you commit. This helps you learn large projects faster, understand how pieces fit together, and save time when onboarding or reviewing code.
https://github.com/Lum1104/Understand-Anything
#csharp #agent_skills
.NET Agent Skills is a set of curated skills and custom agents for coding tools that help with common .NET tasks like coding, data access, debugging, builds, NuGet, upgrades, MAUI, AI, templates, testing, ASP.NET, and new .NET 11 features. You can install them in Copilot CLI, Claude Code, VS Code, Cursor, or Codex CLI. The benefit is faster, more accurate .NET help with less guesswork, so you can build, fix, and improve projects more easily.
https://github.com/dotnet/skills
#jupyter_notebook #gemini #large_language_models #llm #openai #training #transformers
This project shows how to build and train a transformer language model from scratch in PyTorch. It uses the Pile dataset, tokenizes text with tiktoken, and stores tokens in HDF5 files for faster training. The code includes attention, MLP, transformer blocks, training, saving, and text generation. The benefit is that you can learn how LLMs work and train your own small or large model on a single GPU, then use it to generate text for your own tasks.
https://github.com/FareedKhan-dev/train-llm-from-scratch
#javascript #anime #anime_downloader #anime_scraper #downloader #electron #modern_ui #movies #movies_streaming #piracy #series #streaming #streaming_video #tmdb_api #tv
Streambert is a cross-platform desktop app for streaming and downloading movies, TV shows, and anime with no ads or tracking. It also gives subtitles, a library, trending picks, and customization. You can use it to watch and save content faster, keep things organized, and enjoy a more private viewing experience.
https://github.com/truelockmc/streambert
#typescript #ai_agent #ai_coding_agent #anthropic #bun #claude #cli #coding_assistant #llm #mcp #multi_provider #openai #rust #terminal #tui #typescript
omp is a coding agent with the IDE built in. It works on macOS, Linux, and Windows, and gives you many tools for reading, editing, searching, debugging, browser use, and subagents. It can use lots of AI providers and model choices, and it is made to work well right away with real coding tasks. The benefit for you is faster, more accurate coding help in one place, with less setup and fewer extra tools.
https://github.com/can1357/oh-my-pi
#cplusplus
OpenToonz is a free 2D animation program for Windows, macOS, Linux, and BSD. It comes from Studio Ghibli and Toonz Studio, and you can download it, build it from source, or help test and improve it. The main benefit is that it gives you a powerful tool to make animations, with open licensing that allows personal or business use.
https://github.com/opentoonz/opentoonz