

Yeah i remember that Ed article ! I don’t think the technical aspects are relevant to the newer generation of models, but yeah of course any attempt to compress inference costs can have side effects : either response quality will degrade for using dumber models, or you’ll have re-inference costs when the dumb model shits its pants. In fact the re-inference can become super costly as dumber models tend to get lost in reasoning loops more easily.








Haha don’t know if i’m ai loving but i’m in my 40s if it helps. I was really using boomers as a slur to refer to dumb suits on LinkedIn, didn’t mean to offend.