Discussion about this post

User's avatar
CJ Quines's avatar

ah yes, i have been playing with "getting the model to edit its prompt" at work. haven't been able to do any large-scale experiments with it though, but small-scale results show… uh, not much improvement actually. probably due to domain effects

Samuel Ratnam's avatar

I think the most interesting use cases for this kind of meta-learning are tasks involving approximating some kind of subjective judgement (eg. for predicting whether you will want to read a certain blog post). These are capabilities that are not directly optimised for during post training, but exist latently within the model (which are pretrained to be general purpose simulators).

5 more comments...

No posts

Ready for more?