Re: Benchmark results on mdds::multi_type_vector

Kohei Yoshida <kohei -AT- libreoffice.org>
Fri, 13 Dec 2019 17:38:01 -0500

On 13.12.2019 05:43, Luboš Luňák wrote:

On Friday 13 of December 2019, Kohei Yoshida wrote:

I just finished my benchmark testing on mdds::multi_type_vector, and
summarized my results in this blog post:

http://kohei.us/2019/12/12/benchmark-results-on-mdds-multi_type_vector/

Hopefully my findings and intepretations make sense.  In short, the
numbers look great.  The overhead of block shifting is a concern, but
I'm optimistic that this is going to be a non-issue for the most part.

I'd really like to see benchmarks of Calc with this new mdds,especially tosee how many regressions there will be, as I'm concerned whether itreally

would be worth it in reality.

Sure, I do share your concern, which is why I spent time designing andimplementing the benchmark I did so that I can get some answers for myconcern.


 You say that the vast majority of Calc

performance problems are with updating cell values without shifting,but that
makes sense because that's where the current bottleneck is. Once the
bottleneck moves to shifting of cells, we may get a whole new slew of
bugreports about that.

Sure, but that's just as much of a speculation as my own interpretation.To be fair, it is possible that you are right, and I am wrong. But Idid provide my own interpretations of those numbers based on my ownexperience and educated guesses. I'm not claiming that I'm right, butI'm claiming that what I concluded in my post is my truly honest,hopefully reasonably researched opinions.


E.g. copy&paste of a column is very likely to hit a

problem there, IIRC it internally results in a lot of shifting ofcells.

Yes, which is why I ran the benchmarks to get some numbers to get moreclarity.

One interpretation of the graphs may be that the change helps a lot atthecost of a regression in one place, but other possible interpretation isthatthe change brings an improvement that can already be mostly achievedusinghints at the expense of a cost that cannot be alleviated. Moreover wedid goover all the reported performance problems related to mdds some monthsbackand fixed all of them (at least I'm not aware of any pending ones). Sothereal question for me is how many of real-world cases will be improvedandworsened by this, which is why I'd like to see non-artificalbenchmarks.

So, I'm a bit concerned about your use of the word "artificial" todescribe my benchmark, because that word implies that I somehow madethose numbers up. Those are real numbers. Now, the numbers will ofcourse be quite different if you measure the entire Calc operationswhich include a whole bunch of other operations, and I believe this iswhat you are alluding to. I do share your concern there. But I thoughtit was reasonable to draw the conclusions that I did, given that the I/Owith mdds::multi_type_vector do constitute a large part of Calc's cellI/O's. Also, keep in mind that the rest of the Calc operations areconstant, and the only variable is the mdds portion. On this point, Ibelieve it's not unreasonable to draw *some* conclusions based on thenumbers on mdds alone.

Having said that, you are of course free to draw your own, differentconclusions.

BTW, I have you considered using vector operations like SSE for theupdates
(either checking whether the compiler can employ them automatically or
hand-writing them)?

Yes. For one, I did look into e.g. OpenMP's auto SIMD support. But itssupport appeared to be very limited, and MSVC did not seem to supportit. I also thought about hand-writing SIMD directly, and I am stillconsidering that as one of my future possibilities (note that I'm notentirely done with this work). But I couldn't think of a good one touse, especially when multi_type_vector uses array of structures (AoS).SIMD intrinsics I know of are mostly not suitable for AoS. If you knowof good SIMD instinsics that may work for multi_type_vector, I would beinterested.

I've done some SIMD coding in orcus to speed up XML and JSON parsing,but I can't say I'm expert at it, and I did not always manage to get thecode to run faster with SIMD.

Alright, since now one person is raising objection on hastilyintegrating this piece, I should hold on to integrating this piece fornow, and let the discussion continue.

And, while I would love to craft another benchmark test involving theentire Calc piece, I'm afraid I won't have enough bandwidth to do that.Even running this benchmark on mdds alone took me one month to do itend-to-end. It would be nice to have someone else chip in and conductanother, more through and satisfactory benchmark test, if anybody isinterested.


Thanks,

Kohei

--
Kohei Yoshida, LibreOffice Calc volunteer hacker

Context

Benchmark results on mdds::multi_type_vector · Kohei Yoshida
- Re: Benchmark results on mdds::multi_type_vector · Michael Meeks
  - Re: Benchmark results on mdds::multi_type_vector · Kohei Yoshida
    - Re: Benchmark results on mdds::multi_type_vector · Kohei Yoshida
- Re: Benchmark results on mdds::multi_type_vector · Luboš Luňák
  - Re: Benchmark results on mdds::multi_type_vector · Kohei Yoshida
    - Re: Benchmark results on mdds::multi_type_vector · Luboš Luňák
    - Re: Benchmark results on mdds::multi_type_vector · Kaganski Mike

Privacy Policy | Impressum (Legal Info) | Copyright information: Unless otherwise specified, all text and images on this website are licensed under the Creative Commons Attribution-Share Alike 3.0 License. This does not include the source code of LibreOffice, which is licensed under the Mozilla Public License (MPLv2). "LibreOffice" and "The Document Foundation" are registered trademarks of their corresponding registered owners or are in actual use as trademarks in one or more countries. Their respective logos and icons are also subject to international copyright laws. Use thereof is explained in our trademark policy.