Smaller models have been getting a lot more capable of late. A 32 billion param model can do quite a bit today, and there are a lot of things that can still be optimized going forward. If you look at capability improvements, it’s absolutely stunning, something you can run on a laptop today would’ve needed a whole data centre just a couple of years ago.
Smaller models have been getting a lot more capable of late. A 32 billion param model can do quite a bit today, and there are a lot of things that can still be optimized going forward. If you look at capability improvements, it’s absolutely stunning, something you can run on a laptop today would’ve needed a whole data centre just a couple of years ago.