Discussion about this post

User's avatar
JP's avatar

Coming to this a bit late, but the DIY direction has moved a lot since this was published. The open-source options you covered have matured, and there's now another angle: assembling a pipeline from APIs buried in subscriptions you're already paying for.

Wrote it up here: https://reading.sh/how-to-build-a-solid-research-pipeline-in-claude-code-ff7878c5e2b5

Synthetic.new's standard plan includes a real-time search API that returns full page content - I only found it after digging through the docs. Combined with Firecrawl and Exa, that's a three-engine pipeline for the cost of one subscription.

Has the open-source landscape you covered here held up, or have these projects moved on significantly since you wrote this?

No posts

Ready for more?