Top stories from https://news.ycombinator.com (with 100+ score) Contribute to the development here: https://github.com/phil-r/hackernewsbot Also check https://t.me/designer_news Contacts: @philr
Capital Trades: Tracking Stock Market Transactions of Politicians (Score: 152+ in 7 hours)
Link: https://readhacker.news/s/6simN
Comments: https://readhacker.news/c/6simN
Federal cuts disrupt repairs to iconic U.S. trails (Score: 151+ in 8 hours)
Link: https://readhacker.news/s/6si4Z
Comments: https://readhacker.news/c/6si4Z
Video footage appears to contradict Israeli account of Gaza medic killings (Score: 150+ in 16 hours)
Link: https://readhacker.news/s/6sh32
Comments: https://readhacker.news/c/6sh32
Gmail E2E is as terrible as expected (🔥 Score: 150+ in 3 hours)
Link: https://readhacker.news/s/6sisn
Comments: https://readhacker.news/c/6sisn
Recent AI model progress feels mostly like bullshit (🔥 Score: 153+ in 3 hours)
Link: https://readhacker.news/s/6sib7
Comments: https://readhacker.news/c/6sib7
Self-Driving Teslas Are Fatally Rear-Ending Motorcyclists More Than Any Other (🔥 Score: 166+ in 1 hour)
Link: https://readhacker.news/s/6shwP
Comments: https://readhacker.news/c/6shwP
The order of files in /etc/ssh/sshd_config.d/ matters (❄️ Score: 153+ in 2 days)
Link: https://readhacker.news/s/6s8Cm
Comments: https://readhacker.news/c/6s8Cm
Why do we need modules at all? (2011) (❄️ Score: 150+ in 4 days)
Link: https://readhacker.news/s/6s2xW
Comments: https://readhacker.news/c/6s2xW
Exeter's unassuming co-op worker leads double life as 'Lord of the Logos' (Score: 150+ in 17 hours)
Link: https://readhacker.news/s/6sfhn
Comments: https://readhacker.news/c/6sfhn
The ADHD body double: A unique tool for getting things done (Score: 151+ in 9 hours)
Link: https://readhacker.news/s/6sgft
Comments: https://readhacker.news/c/6sgft
Apple's Darwin OS and XNU Kernel Deep Dive (🔥 Score: 152+ in 3 hours)
Link: https://readhacker.news/s/6sgmL
Comments: https://readhacker.news/c/6sgmL
Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual) (Score: 150+ in 19 hours)
Link: https://readhacker.news/s/6secG
Comments: https://readhacker.news/c/6secG
Hi HN,
I’ve been working on an OCR pipeline specifically optimized for machine learning dataset preparation. It’s designed to process complex academic materials — including math formulas, tables, figures, and multilingual text — and output clean, structured formats like JSON and Markdown.
Some features:
• Multi-stage OCR combining DocLayout-YOLO, Google Vision, MathPix, and Gemini Pro Vision
• Extracts and understands diagrams, tables, LaTeX-style math, and multilingual text (Japanese/Korean/English)
• Highly tuned for ML training pipelines, including dataset generation and preprocessing for RAG or fine-tuning tasks
Sample outputs and real exam-based examples are included (EJU Biology, UTokyo Math, etc.)
Would love to hear any feedback or ideas for improvement.
GitHub: https://github.com/ses4255/Versatile-OCR-Program
A Vision for WebAssembly Support in Swift (Score: 152+ in 8 hours)
Link: https://readhacker.news/s/6sf36
Comments: https://readhacker.news/c/6sf36
Emulating an iPhone in QEMU (Score: 151+ in 9 hours)
Link: https://readhacker.news/s/6seDT
Comments: https://readhacker.news/c/6seDT
Show HN: I built a word game. My mom thinks it's great. What do you think? (Score: 152+ in 4 hours)
Link: https://readhacker.news/s/6sf6x
Comments: https://readhacker.news/c/6sf6x
Let's Ban Billboards (🔥 Score: 160+ in 1 hour)
Link: https://readhacker.news/s/6sj7d
Comments: https://readhacker.news/c/6sj7d
Rsync replaced with openrsync on macOS Sequoia (Score: 153+ in 4 hours)
Link: https://readhacker.news/s/6siEM
Comments: https://readhacker.news/c/6siEM
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators (Score: 150+ in 14 hours)
Link: https://readhacker.news/s/6sh4R
Comments: https://readhacker.news/c/6sh4R
Gumroad's Interestingly Timed "Open-Source" Play (🔥 Score: 151+ in 2 hours)
Link: https://readhacker.news/s/6sikR
Comments: https://readhacker.news/c/6sikR
What's in that bright red fire retardant? No one will say, so we had it tested (❄️ Score: 150+ in 2 days)
Link: https://readhacker.news/s/6saaN
Comments: https://readhacker.news/c/6saaN
Lessons from open source in the Mexican government (❄️ Score: 152+ in 2 days)
Link: https://readhacker.news/s/6saqi
Comments: https://readhacker.news/c/6saqi
The "S" in MCP Stands for Security (Score: 155+ in 4 hours)
Link: https://readhacker.news/s/6sh8S
Comments: https://readhacker.news/c/6sh8S
Standard Ebooks: liberated ebooks, carefully produced for the true book lover (🔥 Score: 158+ in 3 hours)
Link: https://readhacker.news/s/6sgWX
Comments: https://readhacker.news/c/6sgWX
Faster interpreters in Go: Catching up with C++ (Score: 151+ in 14 hours)
Link: https://readhacker.news/s/6sfzd
Comments: https://readhacker.news/c/6sfzd
The Importance of Fact-Checking (❄️ Score: 150+ in 4 days)
Link: https://readhacker.news/s/6rZ6F
Comments: https://readhacker.news/c/6rZ6F
Ten Rules for Negotiating a Job Offer (Score: 151+ in 4 hours)
Link: https://readhacker.news/s/6sg5s
Comments: https://readhacker.news/c/6sg5s
DeepSeek: Inference-Time Scaling for Generalist Reward Modeling (Score: 150+ in 1 day)
Link: https://readhacker.news/s/6sacg
Comments: https://readhacker.news/c/6sacg
Earth's clouds are shrinking, boosting global warming (Score: 150+ in 8 hours)
Link: https://readhacker.news/s/6seL6
Comments: https://readhacker.news/c/6seL6
OpenVertebrate Presents a Database of 13,000 3D Scans of Specimens (Score: 151+ in 17 hours)
Link: https://readhacker.news/s/6sdSF
Comments: https://readhacker.news/c/6sdSF
Llama4 (🔥 Score: 202+ in 36 minutes)
Link: https://readhacker.news/s/6sfEB
Comments: https://readhacker.news/c/6sfEB