float128 类型的性能基准？

Performance benchmark of float128 type?

有没有人使用过 float128 类型？

我想知道它与double和其他高精度类型如boost::multiprecision:cpp_dec_float相比的性能如何？是否已经完成任何基准测试？

显然，float128 比 double 慢得多。就像最多慢 100 倍，但精确的减速当然取决于操作。

论文 Benchmark of an MPFR emulation of Binary128 arithmetic 引用求和减速 250 倍，乘积减速 120 倍，点积减速 400 倍。

另一篇论文 Twofolds in C and C++ 说 __float128 求和比 double 求和慢 150 倍（25 倍 6 倍）。