mathieumorlighem
Hi Mathieu,
It does seem that MUMPS would not scale beyond tens of cores on a mesh of 20k elements. In fact it slows down when increasing np from 32 to 64 on EYPC 7502*2 with hyperthreading off and I highly doubt it would gain any performance increase with 7702.
It's kind of comprehensible that direct solvers would be limited by the rate of which each core exchanges data with others (correct me if I am wrong). However I still want to double check if it's normal that after increasing memory channel from 4 per CPU to 8 channels per CPU, the speed of runs using 32 and 64 cores remains the same.
Best,
Wade