Top stories from https://news.ycombinator.com (with 100+ score) Contribute to the development here: https://github.com/phil-r/hackernewsbot Also check https://t.me/designer_news Contacts: @philr
Show HN: Morphik – Open-source RAG that understands PDF images, runs locally (Score: 151+ in 14 hours)
Link: https://readhacker.news/s/6tdiG
Comments: https://readhacker.news/c/6tdiG
Hey HN, we’re Adi and Arnav. A few months ago, we hit a wall trying to get LLMs to answer questions over research papers and instruction manuals. Everything worked fine, until the answer lived inside an image or diagram embedded in the PDF. Even GPT‑4o flubbed it (we recently tried O3 with the same, and surprisingly it flubbed it too). Naive RAG pipelines just pulled in some text chunks and ignored the rest.
We took an invention disclosure PDF (https://drive.google.com/file/d/1ySzQgbNZkC5dPLtE3pnnVL2rW_9...) containing an IRR‑vs‑frequency graph and asked GPT “From the graph, at what frequency is the IRR maximized?”. We originally tried this on gpt-4o, but while writing this used the new natively multimodal model o4‑mini‑high. After a 30‑second thinking pause, it asked for clarifications, then churned out buggy code, pulled data from the wrong page, and still couldn’t answer the question. We wrote up the full story with screenshots here: https://docs.morphik.ai/blogs/gpt-vs-morphik-multimodal.
We got frustrated enough to try fixing it ourselves.
We built Morphik to do multimodal retrieval over documents like PDFs, where images and diagrams matter as much as the text.
To do this, we use Colpali-style embeddings, which treat each document page as an image and generate multi-vector representations. These embeddings capture layout, typography, and visual context, allowing retrieval to get a whole table or schematic, not just nearby tokens. Along with vector search, this could now retrieve exact pages with relevant diagrams and pass them as images to the LLM to get relevant answers. It’s able to answer the question with an 8B llama 3.1 vision running locally!
Early pharma testers hit our system with queries like "Which EGFR inhibitors at 50 mg showed ≥ 30% tumor reduction?" We correctly returned the right tables and plots, but still hit a bottleneck, we weren’t able to join the dots across multiple reports. So we built a knowledge graph: we tag entities in both text and images, normalize synonyms (Erlotinib → EGFR inhibitor), infer relations (e.g. administered_at, yields_reduction), and stitch everything into a graph. Now a single query could traverse that graph across documents and surface a coherent, cross‑document answer along with the correct pages as images.
To illustrate that, and just for fun, we built a graph of 100 Paul Graham’s essays here: https://pggraph.streamlit.app/ You can search for various nodes, (eg. startup, sam altman, paul graham and see corresponding connections). In our system, we create graphs and store the relevant text chunks along with the entities, so on querying, we can extract the relevant entity, do a search on the graph and pull in the text chunks of all connected nodes, improving cross document queries.
For longer or multi-turn queries, we added persistent KV caching, which stores intermediate key-value states from transformer attention layers. Instead of recomputing attention from scratch every time, we reuse prior layers, speeding up repeated queries and letting us handle much longer context windows.
We’re open‑source under the MIT Expat license: https://github.com/morphik-org/morphik-core
Would love to hear your RAG horror stories, what worked, what didn’t and any feedback on Morphik. We’re here for it.
How long does it take to create a new habit? (2015) (Score: 151+ in 10 hours)
Link: https://readhacker.news/s/6tdHn
Comments: https://readhacker.news/c/6tdHn
Making a smart bike dumb so it works again (❄️ Score: 151+ in 3 days)
Link: https://readhacker.news/s/6t4eU
Comments: https://readhacker.news/c/6t4eU
Atuin Desktop: Runbooks That Run (🔥 Score: 151+ in 2 hours)
Link: https://readhacker.news/s/6te5i
Comments: https://readhacker.news/c/6te5i
Algebraic Semantics for Machine Knitting (Score: 150+ in 5 hours)
Link: https://readhacker.news/s/6tdf8
Comments: https://readhacker.news/c/6tdf8
ClickHouse gets lazier (and faster): Introducing lazy materialization (🔥 Score: 153+ in 3 hours)
Link: https://readhacker.news/s/6tdgs
Comments: https://readhacker.news/c/6tdgs
Prolog Adventure Game (Score: 150+ in 18 hours)
Link: https://readhacker.news/s/6tbrn
Comments: https://readhacker.news/c/6tbrn
SerenityOS is a love letter to '90s user interfaces (Score: 151+ in 6 hours)
Link: https://readhacker.news/s/6tchL
Comments: https://readhacker.news/c/6tchL
A M.2 HDMI capture card (Score: 150+ in 19 hours)
Link: https://readhacker.news/s/6taAq
Comments: https://readhacker.news/c/6taAq
Whistleblower: DOGE Siphoned NLRB Case Data (🔥 Score: 181+ in 45 minutes)
Link: https://readhacker.news/s/6tckT
Comments: https://readhacker.news/c/6tckT
Haskelling My Python (❄️ Score: 151+ in 3 days)
Link: https://readhacker.news/s/6sZUa
Comments: https://readhacker.news/c/6sZUa
Astronomers confirm the existence of a lone black hole (Score: 150+ in 11 hours)
Link: https://readhacker.news/s/6tavB
Comments: https://readhacker.news/c/6tavB
AI assisted search-based research works now (Score: 151+ in 12 hours)
Link: https://readhacker.news/s/6t9Cq
Comments: https://readhacker.news/c/6t9Cq
Evertop: E-ink IBM XT clone with 100+ hours of battery life (🔥 Score: 150+ in 2 hours)
Link: https://readhacker.news/s/6tb9F
Comments: https://readhacker.news/c/6tb9F
FTC takes action against Uber for deceptive billing and cancellation practices (🔥 Score: 159+ in 2 hours)
Link: https://readhacker.news/s/6tagk
Comments: https://readhacker.news/c/6tagk
Verus: Verified Rust for low-level systems code (❄️ Score: 150+ in 2 days)
Link: https://readhacker.news/s/6t7Cm
Comments: https://readhacker.news/c/6t7Cm
Are polynomial features the root of all evil? (2024) (Score: 151+ in 11 hours)
Link: https://readhacker.news/s/6tdpP
Comments: https://readhacker.news/c/6tdpP
I should have loved biology too (Score: 151+ in 7 hours)
Link: https://readhacker.news/s/6tdpn
Comments: https://readhacker.news/c/6tdpn
Sapphire: Rust based package manager for macOS (Homebrew replacement) (🔥 Score: 155+ in 3 hours)
Link: https://readhacker.news/s/6tdG5
Comments: https://readhacker.news/c/6tdG5
Abusing DuckDB-WASM by making SQL draw 3D graphics (Sort Of) (Score: 152+ in 6 hours)
Link: https://readhacker.news/s/6tcJg
Comments: https://readhacker.news/c/6tcJg
We Diagnosed and Fixed the 2023 Voyager 1 Anomaly from 15B Miles Away [video] (❄️ Score: 150+ in 3 days)
Link: https://readhacker.news/s/6t3mS
Comments: https://readhacker.news/c/6t3mS
Supabase raises $200M Series D at $2B valuation (🔥 Score: 151+ in 2 hours)
Link: https://readhacker.news/s/6td8b
Comments: https://readhacker.news/c/6td8b
101 BASIC Computer Games (Score: 150+ in 17 hours)
Link: https://readhacker.news/s/6tbf7
Comments: https://readhacker.news/c/6tbf7
Cheating the Reaper in Go (Score: 151+ in 15 hours)
Link: https://readhacker.news/s/6tb6H
Comments: https://readhacker.news/c/6tb6H
Attacking My Landlord's Boiler (Score: 155+ in 6 hours)
Link: https://readhacker.news/s/6tbN3
Comments: https://readhacker.news/c/6tbN3
Launch HN: Magic Patterns (YC W23) – AI Design and Prototyping for Product Teams (Score: 150+ in 18 hours)
Link: https://readhacker.news/c/6t9AS
Alex and Teddy here. We’re launching Magic Patterns (https://www.magicpatterns.com), an AI prototyping tool that helps PMs and designers create functional, interactive designs and websites. There’s a demo video at https://www.youtube.com/watch?v=SK8C_tQBwIU, as well as video walkthroughs of specific examples at https://www.magicpatterns.com/docs/documentation/tutorials/v...
While other tools help with “AI-assisted coding,” we have been quietly focused on “AI-assisted designing.” With Magic Patterns you can visually communicate your idea, get hands on feedback from customers, and test new features.
Teddy and I are best friends and former frontend engineers turned founders. We arrived at Magic Patterns after several pivots—always in the design tooling space, but different products that all struggled to get usage. We started working on Magic Patterns after an internal hackathon. Teddy built a UI library catalog and I messed around with GPT 3.5. We thought it’d be fun to combine the two: an AI component generator. Describe whatever you want, and get back a React component!
That started to take off and we gained users, but it wasn’t developers using the tool. Instead, it was PMs, designers, and leadership who could finally communicate their ideas. They use it to test new ideas quickly, get feedback from customers, and improve communication with internal teams. Also, hobbyists (and programmers who aren’t designers) use us to create designs and UIs that they wouldn’t be able to otherwise.
We use Sonnet 3.5 and 3.7, and leverage a fine-tuned model for fast-applying edits. The most challenging part is determining the most relevant context to feed to the LLM. We attempt to solve this with our click to update feature and by letting users define a brand preset, or default prompt.
Unlike other tools in this space, we’re specifically focused on (1) product teams—we're realtime and collaborative; and (2) frontend only—we don't spin up a database or backend because we aren't solving "idea to fullstack app."
A common workflow is a product manager building an interactive prototype and then passing it off to a designer for more polish or directly to engineers. Many teams are even skipping Figma entirely now, telling us that it feels like an unnecessary middleman. Teams are instead generating clickable prototypes, collaborating directly with stakeholders, and using that as the mockup.
With Magic Patterns, you can: - Collaborate with your team on our infinite canvas; - Match your existing designs by creating reusable components directly; - Brainstorm features and flows. (The latter is what we use it for internally.)
We started as a way to build small, custom components, but now people are one-shotting entire
websites and hosting them with us, or building dashboards that they share internally or in customer demos. People have sold $10k/mo contracts with Magic Patterns designs!
Small business owners—everyone from fishermen to driving instructors to hotel managers—are using us to build their websites and then hosting them with us. Example sites built by Magic Patterns include https://getdealflow.ai/ and https://joinringo.com/. It’s amazing how people who couldn’t have done that before are now able to, and super gratifying to us to be empowering people in this way.
You can get started with our docs here: https://www.magicpatterns.com/docs/documentation/get-started..., and you can try the actual product. Simply go to https://www.magicpatterns.com and prompt for any UI you want.
Today no login is required, just click “Coming from Hackernews?” and you’ll get 5 messages free to try. Once you hit the limit, you’ll then be prompted to login. Plans start at $19/mo for another 100 messages a month (https://www.magicpatterns.com/pricing).
We’re stoked to be sharing with HN today and are open to all feedback!
Synology Lost the Plot with Hard Drive Locking Move (❄️ Score: 161+ in 2 days)
Link: https://readhacker.news/s/6t42U
Comments: https://readhacker.news/c/6t42U
How I use Kate editor (Score: 152+ in 1 day)
Link: https://readhacker.news/s/6t7Lj
Comments: https://readhacker.news/c/6t7Lj
A new form of verification on Bluesky (Score: 154+ in 4 hours)
Link: https://readhacker.news/s/6ta5d
Comments: https://readhacker.news/c/6ta5d
Blog hosted on a Nintendo Wii (🔥 Score: 159+ in 1 hour)
Link: https://readhacker.news/s/6taut
Comments: https://readhacker.news/c/6taut