[MDEV-18758] Test histograms precision Created: 2019-02-27 Updated: 2023-10-03 |
|
| Status: | Open |
| Project: | MariaDB Server |
| Component/s: | Optimizer |
| Fix Version/s: | N/A |
| Type: | Task | Priority: | Major |
| Reporter: | Sergei Petrunia | Assignee: | Sergei Petrunia |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | histogram-test | ||
| Issue Links: |
|
||||||||
| Description |
|
This is to track the work being done on testing precision of histograms. (Some tests were already done, will post the results here) We are going to measure the precision of selectivity estimate for equality (range predicates do not make much sense for names I guess).
I would like a few constants:
(the first quartile is: Count the number of total different names = 17711
pick 4428th, 4429th, 4430th names. Then 3 names at the end of the second quartile. and 3rd and 4th. the repeat the above "selectivity test" for each constant. We need to compare:
For MySQL/MariaDB, use EXPLAIN FORMAT=JSON as it prints selectivity with greater precision. |
| Comments |
| Comment by Sergei Petrunia [ 2019-02-27 ] | ||||||
Test #2: FlightStatsLet's study these columns:
For flightDate - Please check if the number of flights done in a week varies
For AirTime:
Then let's pick some values, like
Please measure these in the same settings as in the previous benchmarks. | ||||||
| Comment by Sergei Petrunia [ 2019-02-27 ] | ||||||
|
First results: https://docs.google.com/spreadsheets/d/1c2fyVo5bFtCDPfesaxLVtqx6fSRb0LaiVVjXD3i0jRY/edit?usp=sharing | ||||||
| Comment by Sergei Petrunia [ 2019-02-27 ] | ||||||
|
cc cvicentiu |