Ph.D. Residency in AI / ML: Coding & Program Synthesis @Theteamatx dissertating @UW, alumn @mit @msft
Great thread discussing common issues with LLMs evaluation, and how to do better using methods from the behavioral sciences. #LLMs #evaluation
Great thread from Shayne as always