Thank you guys for your help. I tried with NUMA disabled. Results are the same in all cases of QPI snooping configuration, except when QPI is set to Home Dir (Early Snoop disabled, CoD disabled) it made time twice as much over what was in previous
modes testing - showed about 16m remaining so I canceled Benchwell after 20 seconds.
May that behavior be related to the fact that my CPUs are engineering samples with QHUZ code on their lids? Or maybe it's some pretty obscure little bug in V4 code related to Numa or QPI snooping mode set in UEFI? I even suspect some mistake in ASUS UEFI itself - at least there can be a chance of it's presence. Anyway, it's weird behavior regarding that Maxwell V3 is working flawlessly on any UEFI configuration tested here...
2x Xeon E5-2698 V4 ES, ASUS Z10PE-D8 WS, 128GB RAM, Geforce GTX960 4GB, Dell U2711.