Discussion about this post

User's avatar
CJ Quines's avatar

hm. you give, as an example, being contrarian by "working on interpretability, but not SAEs". but there's nothing stopping you from applying the same argument again: "working on SAEs, but not X". and to give credit to everyone working on SAEs, it's not like anyone is doing the *exact* same thing as anyone else. in general, as long as you're not doing the exact same thing as someone, you're contrarian at *some* level. so the razor of "as far down the stack as possible" doesn't quite make sense to me?

10 more comments...

No posts

Ready for more?