Google Paper on LLM

rfeigel · September 3, 2024, 3:08am

Interesting paper on “small” vs “large” LLMs.
2408.16737v1.pdf (1.1 MB)

MartinDDDD · September 3, 2024, 6:11am

Interesting. Do I get it right that fine tuning using synthetic data from smaller models resulted in better performance, because smaller models can provide more samples for fine tuning for the same budget?

rfeigel · September 3, 2024, 2:29pm

That’s the way i read it. Probably will create lots of debate.

MartinDDDD · September 3, 2024, 3:28pm

Agree - would be interesting to see if there is a gap (and if so how big it is) if the same number of samples is taken from a “stronger” LLM.

Crazy how quick things develop in that space these days… currently playing around with some of the older and newer vision models and really blown away how good even “smaller” models (~7b Parameter) have become…

system · December 2, 2024, 3:28pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.