[Feature] Support more than 64 CPU threads #2696

fixerivan · 2024-07-19T10:58:09Z

Feature Request

currently gpt4all settings allow setting max cpu threads to 64 only

when i set it to 192 (which is my current hw setup) it always reverts to 64

ollama by itself supports 192 - i tried that so maybe this is just some UI restriction? didn't look in the code

thanks

chrisbarrera · 2024-07-19T14:59:59Z

I'm not from Nomic, but I have to ask what is the benefit of even 64 CPU threads? Have you benchmarked 64 threads vs 32, 16, or even 8, and found that higher (after a certain point) are better? My understanding, and shown in my own tests, is that after 6-8 cpu threads, the memory bus is saturated, and more threads tend to do nothing. Maybe you can do a few more than that on Epyc, just don't otherwise expect much more than that to actually accomplish anything. If I am wrong, would appreciate learning from tests you have done.

cosmic-snow · 2024-07-19T15:57:49Z

Looks like code is here:

gpt4all/gpt4all-chat/mysettings.cpp

Lines 434 to 443 in 56d5a23

    
           void MySettings::setThreadCount(int value) 
        
           { 
        
               if (threadCount() == value) 
        
                   return; 
        
               value = std::max(value, 1); 
        
               value = std::min(value, QThread::idealThreadCount()); 
        
               m_settings.setValue("threadCount", value); 
        
               emit threadCountChanged(); 
        
           }

Which means, the thread count is determined by what Qt thinks should be the upper limit. I'm unsure whether you'd get more performance out of it with a higher value, in any case.

supersonictw · 2024-07-19T20:11:53Z

Is it caused due to this?
https://stackoverflow.com/questions/46314471/qthreadidealthreadcount-returns-the-wrong-answer-how-to-solve-it

cosmic-snow · 2024-07-19T23:05:54Z

Is it caused due to this?

Maybe, although I don't know about Qt internals and that Q&A is really old. But anyway, as chrisbarrera said, I'm not even sure it would help any to go past Qt defined limit.

cebtenzzre · 2024-07-29T16:16:36Z

llama.cpp on CPU is memory-bottlenecked in practice, so using more CPU threads doesn't provide much benefit. The default of 4 threads is enough on my machine. Try with ollama or the llama.cpp CLI and see if you actually get any t/s improvement compared to 64 threads—you may actually see a slowdown.

fixerivan added the enhancement New feature or request label Jul 19, 2024

cebtenzzre added the need-info Further information from issue author is requested label Jul 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support more than 64 CPU threads #2696

[Feature] Support more than 64 CPU threads #2696

fixerivan commented Jul 19, 2024

chrisbarrera commented Jul 19, 2024

cosmic-snow commented Jul 19, 2024

supersonictw commented Jul 19, 2024 •

edited

Loading

cosmic-snow commented Jul 19, 2024

cebtenzzre commented Jul 29, 2024

[Feature] Support more than 64 CPU threads #2696

[Feature] Support more than 64 CPU threads #2696

Comments

fixerivan commented Jul 19, 2024

Feature Request

chrisbarrera commented Jul 19, 2024

cosmic-snow commented Jul 19, 2024

supersonictw commented Jul 19, 2024 • edited Loading

cosmic-snow commented Jul 19, 2024

cebtenzzre commented Jul 29, 2024

supersonictw commented Jul 19, 2024 •

edited

Loading