loufe 1 day ago

I'm so grateful to live through such exciting times. I can open HN every two to some exciting new news about ML/transformer models. I really should read more into it, but does llama.cpp use a "custom kernel" per se, with cublas, or is it just making good use of the cublas kernal?

1
jonplackett 1 day ago

It’s funny that you’re missing the time frame from your sentence.

2 weeks? Two months? Two days? Two minutes?

All of the above are true sometimes! Exciting times indeed.

loufe 15 hours ago

Good catch, I meant every two days! :)