How much better would a perfect multi-processing implementation perform compared to the above values?
Here, “perfect” is defined as an implementation which gives a speedup of 2.0 on two cores, 4.0 on four cores, etc. It is (probably) not quite possible to achieve this. The question is relevant for us because this represents an upper bound on potential multi-processing performance. The following table shows how actual Rybka performance diverges from this theoretical upper bound as the number of cores increase.