Discussion about this post

User's avatar
UtsavChokshi's avatar

Steven, this seems like good insight but tough to grasp.

It would be awesome if you can add examples of what goes into fine-tuning, what you fetch using RAG and final picture.

Questions :

1. Wrapper service that fetches Gold Standard Examples is not very clear. How do you decide gold standard examples for given math problem ? Are you using simple similarity scores like cosine ?

2. "The theory suggests a linear maturity curve: you begin with prompts, accumulate data, fine-tune a model, and subsequently discard the context window because the domain knowledge has been successfully encoded into the weights."

-- Why one would go in direction of fine tuning model ? Lesser cost, faster inference and more precision ?

3. I always thought : People use finetuning for style transfer and use context window for providing knowledge. You seem to have use fine-tuning for teaching it particular skill. Why that was not enough ? Why gold standard examples has to be fetched ? Is your RAG data and training data different.

1 more comment...

No posts

Ready for more?