If you are at ICML and interested in RL or multilinguality, please say hi to @marafinkels! We worked closely the past few months to ship an RL method to fix a critical Gemini quality issue. She has great research ideas as well! Hope Gemini x academia stay in touch.
Mara Finkelstein
Mara Finkelstein27.11.2024
LLMs are typically evaluated w/ automatic metrics on standard test sets, but metrics + test sets are developed independently. This raises a crucial question: Can we design automatic metrics specifically to excel on the test sets we prioritize? Answer: Yes!
5,77K