You must log in or register to comment.
This is interesting but I’ll reserve judgement until I see comparable performance past 8 billion params.
All sub-4 billion parameter models all seem to have the same performance regardless of quantization nowadays, so 3 billion is a little hard to see potential in.
Someday, we’ll have the technology to generate an image of a centaur with 4 boobs without using more energy than a small hospital. Very exciting stuff.
I obviously got it. But not everyone appreciates high culture.