[D] why falcon-180b ranking has dramatically decreased?

Life_Ask2806@alien.top · 10 months ago

[D] why falcon-180b ranking has dramatically decreased?

blackkettle@alien.top · 10 months ago

I think a big obstacle is that it is so big hardly anyone is trying to fine tune it.

vatsadev@alien.top · 10 months ago

Well, the model is trained on refinedWeb, which is 3.5T, so a little below chinchilla optimal for 180b. Also, all the models from the falcon series seem to feel more and more undertrained,

The 1b model was good, and is still good after several newer gens
the 7b was capable pre llama 2
40b and 180b were never as good

koolaidman123@alien.top · 10 months ago

Public leaderboards mean nothing because 99% of the finetuned models are overfitted to hell, its like nobody ever did a kaggle comp before

detached-admin@alien.top · 10 months ago

These leaderboards are dick measuring contests for small dicks. Imagine the dynamics of that.

Unlucky-Attitude8832@alien.top · 10 months ago

Falcon-180B is not good