Hi all.
I was researching generative model evaluation and found this post interesting: https://deepsense.ai/evaluation-derangement-syndrome-in-gpu-poor-genai
A lot of it kind of corresponds to what I see happening in the industry and feels like a good fit here
i mean the evaluation process itself is an active field of research…
That’s kind of what my original comment was all about.