• kakes@sh.itjust.works
    link
    fedilink
    arrow-up
    9
    ·
    4 months ago

    Never really occurred to me before how huge a 10x savings would be in terms of parameters on consumer hardware.

    Like, obviously 10x is a lot, but with the way things are going, it wouldn’t surprise me to see that kind of leap in the next year or two tbh.

  • Fisch@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    3
    ·
    4 months ago

    That would actually be insane. Right now, I still need my GPU and about 8-10 gigs of VRAM to run a 7B model tho, so idk how that’s supposed to work on a phone. Still, being able to run a model that’s as good as a 70B model but with the speed and memory usage of a 7B model would be huge.