Handler Scores

Speed vs. Functionality

Scores are on scale of `0..100`. Higher is better.
Metrics algorithms completely made up.
See notes below for details.
Handler	Check box to see details Warnings	Check box to see details Benchmarks
chanchal/zaphandler	63.70	90.97
madkins/flash	98.67	95.48
madkins/sloggy	98.67	83.18
phsym/zeroslog	53.93	98.59
phuslu/slog	98.67	99.99
samber/slog-logrus	73.04	9.58
samber/slog-zap	61.48	54.48
samber/slog-zerolog	64.44	55.27
slog/JSONHandler	98.67	96.94
svcrunner/jsonlog	75.26	98.85

X Axis: `Warnings`

Numbers are scores for specific handler/warning level cases.
Scores are comprised of warnings counts per warning levels
mapped onto a 0..100 range. Higher is better.
Levels → ↓Handlers	Required	Implied	Suggested	Administrative	Check box to see detail score columns Score	by Data	Original
chanchal/zaphandler	77.78	33.33	50.00	100.00	63.70	63.70	61.83
madkins/flash	100.00	100.00	90.00	100.00	98.67	98.67	98.47
madkins/sloggy	100.00	100.00	90.00	100.00	98.67	98.67	98.47
phsym/zeroslog	44.44	66.67	60.00	66.67	53.93	53.93	53.44
phuslu/slog	100.00	100.00	90.00	100.00	98.67	98.67	98.47
samber/slog-logrus	66.67	88.89	70.00	66.67	73.04	73.04	73.28
samber/slog-zap	55.56	77.78	50.00	66.67	61.48	61.48	61.07
samber/slog-zerolog	66.67	66.67	50.00	66.67	64.44	64.44	64.12
slog/JSONHandler	100.00	100.00	90.00	100.00	98.67	98.67	98.47
svcrunner/jsonlog	77.78	66.67	70.00	100.00	75.26	75.26	74.05

Counts of *unique* warnings for handler / level
(not the total number of warning instances). Lower is better.
Levels → ↓Handlers	Required	Implied	Suggested	Administrative	Check box to see detail score columns Score	by Data	Original
chanchal/zaphandler	2	6	5	0	63.70	63.70	61.83
madkins/flash	0	0	1	0	98.67	98.67	98.47
madkins/sloggy	0	0	1	0	98.67	98.67	98.47
phsym/zeroslog	5	3	4	1	53.93	53.93	53.44
phuslu/slog	0	0	1	0	98.67	98.67	98.47
samber/slog-logrus	3	1	3	1	73.04	73.04	73.28
samber/slog-zap	4	2	5	1	61.48	61.48	61.07
samber/slog-zerolog	3	3	5	1	64.44	64.44	64.12
slog/JSONHandler	0	0	1	0	98.67	98.67	98.47
svcrunner/jsonlog	2	3	3	0	75.26	75.26	74.05

Y Axis: `Benchmarks`

Memory:

Numbers are scores for specific handler/test cases.
Scores are comprised of benchmark speed and memory usage data
mapped onto a `0..100` range. Higher is better.
Tests → ↓Handlers	Attributes	Big Group	Disabled	Key Values	Logging	Simple	Simple Source	With Attrs Attributes	With Attrs Key Values	With Attrs Simple	With Group Attributes	With Group Key Values	Check box to see detail score columns Score	by Test	by Data	Original
chanchal/zaphandler	97.02	97.42	0.00	96.99	96.93	95.07	94.27	97.93	98.03	99.06	97.25	97.29	90.97	88.94	93.01	88.94
madkins/flash	96.94	95.53	97.01	96.95	94.73	88.00	87.73	98.03	98.03	97.75	97.78	97.74	95.48	95.52	95.43	95.52
madkins/sloggy	78.95	82.51	97.02	78.98	77.53	76.90	71.40	86.11	86.28	95.08	86.36	86.24	83.18	83.61	82.75	83.61
phsym/zeroslog	98.90	99.37	96.94	98.98	99.60	99.25	94.69	99.29	99.35	99.86	98.21	98.25	98.59	98.56	98.62	98.56
phuslu/slog	100.00	100.00	99.89	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	100.00	99.99	99.99	100.00	99.99
samber/slog-logrus	2.28	31.97	97.02	2.28	3.30	0.00	0.00	0.06	0.06	0.77	0.00	0.00	9.58	11.48	7.69	11.48
samber/slog-zap	51.16	43.79	97.02	51.05	60.98	85.94	90.45	38.64	40.46	49.02	29.40	30.38	54.48	55.69	53.28	55.69
samber/slog-zerolog	57.64	42.86	96.99	57.50	69.98	87.45	51.87	49.67	49.08	55.77	29.25	29.66	55.27	56.48	54.07	56.48
slog/JSONHandler	96.46	98.48	96.08	96.55	98.27	97.59	89.47	97.77	97.74	99.60	97.41	97.50	96.94	96.91	96.97	96.91
svcrunner/jsonlog	98.98	94.96	100.00	99.07	99.20	98.56	99.67	99.18	99.21	99.28	99.15	99.20	98.85	98.87	98.83	98.87

Numbers represent microseconds per operation. Lower is better
Tests → ↓Handlers	Attributes	Big Group	Disabled	Key Values	Logging	Simple	Simple Source	With Attrs Attributes	With Attrs Key Values	With Attrs Simple	With Group Attributes	With Group Key Values	Check box to see detail score columns Score	by Data	by Test	Original
chanchal/zaphandler	1.440	23.697	0.029	1.507	33.850	0.373	0.611	1.532	1.529	0.384	1.535	1.584	90.97	93.01	88.94	88.94
madkins/flash	1.362	34.488	0.004	1.418	34.246	0.453	0.925	1.385	1.436	0.461	1.402	1.482	95.48	95.43	95.52	95.52
madkins/sloggy	2.826	68.957	0.004	2.863	68.537	0.692	1.756	2.933	2.891	0.692	2.839	2.983	83.18	82.75	83.61	83.61
phsym/zeroslog	1.001	16.221	0.004	1.035	17.394	0.199	0.476	1.021	1.042	0.210	1.148	1.189	98.59	98.62	98.56	98.56
phuslu/slog	0.884	10.797	0.003	0.944	14.260	0.153	0.213	0.897	0.949	0.160	0.906	0.958	99.99	100.00	99.99	99.99
samber/slog-logrus	17.797	371.154	0.004	17.759	403.544	3.242	7.436	26.774	26.775	17.645	20.933	21.407	9.58	7.69	11.48	11.48
samber/slog-zap	6.138	270.907	0.004	6.201	114.341	0.475	0.735	12.163	11.254	5.922	11.621	11.496	54.48	53.28	55.69	55.69
samber/slog-zerolog	5.288	275.299	0.004	5.371	86.631	0.382	2.564	9.302	9.640	5.049	11.401	11.509	55.27	54.07	56.48	56.48
slog/JSONHandler	1.502	23.819	0.004	1.530	27.730	0.302	0.866	1.494	1.558	0.300	1.537	1.567	96.94	96.97	96.91	96.91
svcrunner/jsonlog	1.059	35.892	0.003	1.089	20.482	0.242	0.260	1.085	1.118	0.339	1.076	1.114	98.85	98.83	98.87	98.87

Numbers represent number of memory allocations per operation. Lower is better.
Tests → ↓Handlers	Attributes	Big Group	Disabled	Key Values	Logging	Simple	Simple Source	With Attrs Attributes	With Attrs Key Values	With Attrs Simple	With Group Attributes	With Group Key Values	Check box to see detail score columns Score	by Data	by Test	Original
chanchal/zaphandler	5	215	0	5	51	1	3	5	5	1	7	7	90.97	93.01	88.94	88.94
madkins/flash	5	12	0	5	153	3	7	5	5	3	5	5	95.48	95.43	95.52	95.52
madkins/sloggy	64	2,358	0	64	1,865	14	30	64	64	14	64	64	83.18	82.75	83.61	83.61
phsym/zeroslog	3	1	0	3	0	0	4	3	3	0	4	4	98.59	98.62	98.56	98.56
phuslu/slog	1	1	0	1	0	0	0	1	1	0	1	1	99.99	100.00	99.99	99.99
samber/slog-logrus	94	4,024	0	94	2,704	26	63	139	139	93	168	168	9.58	7.69	11.48	11.48
samber/slog-zap	50	4,001	0	50	1,071	2	5	78	78	49	142	142	54.48	53.28	55.69	55.69
samber/slog-zerolog	60	4,001	0	60	1,173	2	39	95	95	59	144	144	55.27	54.07	56.48	56.48
slog/JSONHandler	6	1	0	6	0	0	6	6	6	0	6	6	96.94	96.97	96.91	96.91
svcrunner/jsonlog	3	14	0	3	0	0	0	4	4	1	4	4	98.85	98.83	98.87	98.87

Numbers represent bytes of memory allocated per operation. Lower is better.
Tests → ↓Handlers	Attributes	Big Group	Disabled	Key Values	Logging	Simple	Simple Source	With Attrs Attributes	With Attrs Key Values	With Attrs Simple	With Group Attributes	With Group Key Values	Check box to see detail score columns Score	by Data	by Test	Original
chanchal/zaphandler	416	5,214	0	417	1,635	32	280	416	416	32	505	505	90.97	93.01	88.94	88.94
madkins/flash	504	45,359	0	504	11,839	232	704	504	504	232	504	504	95.48	95.43	95.52	95.52
madkins/sloggy	1,384	25,120	0	1,384	26,992	240	1,280	1,384	1,384	240	1,384	1,384	83.18	82.75	83.61	83.61
phsym/zeroslog	352	48	0	352	0	0	312	352	352	0	640	640	98.59	98.62	98.56	98.56
phuslu/slog	240	48	0	240	0	0	0	240	240	0	240	240	99.99	100.00	99.99	99.99
samber/slog-logrus	9,079	251,921	0	9,079	202,592	1,481	4,276	16,437	16,435	8,841	15,348	15,348	9.58	7.69	11.48	11.48
samber/slog-zap	7,223	252,209	0	7,224	131,958	336	592	14,983	14,985	6,983	13,737	13,737	54.48	53.28	55.69	55.69
samber/slog-zerolog	5,582	263,418	0	5,582	91,033	336	2,763	11,308	11,307	5,342	13,961	13,961	55.27	54.07	56.48	56.48
slog/JSONHandler	472	48	0	472	0	0	568	472	472	0	472	472	96.94	96.97	96.91	96.91
svcrunner/jsonlog	280	56,137	0	280	0	0	0	288	288	8	296	296	98.85	98.83	98.87	98.87

Score Visualization

	← Chart Size →	Top Right:

Higher numbers are better on both axes. The "good" zone is the upper right and the "bad" zone is the lower left.
The top is fast, the bottom is slow. Left is more warnings, right is less.

Scoring Algorithms

The algorithms behind the scores shown on this page are somewhat arbitrary. The original scoring algorithm (Default) was deemed "good enough", but later work has focused on enabling multiple scoring algorithms. These can be found on the Home page or in the Scoring drop-down in the upper right section of every page.

Algorithms are implemented by "scorekeepers". Each scorekeeper is specified by the two axes shown in the scoring chart. Each axis interprets test data according to its own algorithm.

The current scorekeeper and axis algorithms are described below:

Score Keeper: Default

The Default scoring algorithm is the original (and initially the only) scoring algorithm. This algorithm uses benchmark and verification warnings to generate results.

The Default score chart graphs various slog handlers by speed versus functionality. This concept was the impetus behind creating scoring algorithms and charts. On this chart the X axis is a warning score and the Y axis is a benchmark (performance) score for each handler.

Going by the "score" values can be misleading, as they roll up a lot of different data items, hiding the detail. Use the checkboxes on the Scores table at the top to make detail tables visible. Further buttons above the detail tables show different classes of data.

X Axis: Warnings

The X-axis for the Default scoring chart shows the score derived from verification warnings.

The score is calculated using the score weights shown to the right which are applied to the warning levels during calculation.

Handlers are scored based on how few warnings are generated. Warnings are worth different amounts depending on their warning level. The weights applied during this process are shown on the right.

Source Data

Each handler results in a lot of verification test output:

Warnings for slog/JSONHandler:
  Suggested
     2 [Duplicates] Duplicate field(s) found
         TestAttributeDuplicate: map[alpha:2 charlie:3]
         TestAttributeWithDuplicate: map[alpha:2 charlie:3]

The Suggested line is an example of a warning "level" (warnings are grouped into levels on the warning page). In this example there are two instances of the Duplicates warning.

Warnings Algorithm

Scoring is done for all handlers at the same time:

for each handler
    score starts at zero
    for each warning level
        for each warning in level
            if warning shows up during testing
                score = score + weight(level) * len(warnings)
                adjust score to range of
                    zero to maximum possible number of warnings

Where the weight(level) comes from the predefined table shown above and to the right.

The scores for each handler are then divided by the maximum possible number of warnings that any handler might receive (if it were really awful) and that number is subtracted from 100.0. This results in a number from 0.0 (awful, all warnings logged) to 100.0 (no warnings at all). That number is stored for use and displayed on this page and on each handler page.

Note that most scores are above ~40 as it is difficult to throw all the warnings.

Scores

Multiple scores are generated for each handler. The main (or "default") score is shown in the data tables with the column header Score with an associated checkbox. The checkbox can be used to show several other "score" columns, as follows:

Default (Score)
This is the score that is shown in the overall chart at the top of the page in the column labeled Warnings. The default score is the same as the By Data score.
By Data
This score is calculated by rolling up scores calculated per warning level.
Original
This is the "original" score which has been overtaken by newer code. The Original score is within 5% of by Data value.

Level	Weight
Required	8
Implied	4
Suggested	2
Administrative	1

Y Axis: Benchmarks

The Y-axis for the Default scoring chart shows the score derived from running benchmarks.

The score is calculated using the score weights shown to the right which are applied to the several specific benchmark result values.

Handler benchmarks are scored based on several metrics on each of various tests. Metrics are worth different amounts depending on what they are. The weights applied during this process are shown on the right.

Source Data

Each combination of handler and test results in a single line of test output:

BenchmarkMadkinsFlash/BenchmarkSimple-8  3547497  327.1 ns/op  284.33 MB/s  24 B/op  1 allocs/op

From this line we get:

the handler name (BenchmarkMadkinsFlash),
the test name (BenchmarkSimple),
the number of test runs (3547497),
nanoseconds per operation (327.1 ns/op),
memory bytes allocated per operation (24 B/op),
separate memory allocations per operation (1 allocs/op), and
estimated logging throughput per second (284.33 MB/s).

Benchmark Algorithm

For each handler/test combination (single line or test results) we use one or more of the following three data items:

nanoseconds per operation,
memory bytes allocated per operation, and
separate memory allocations per operation.

These three items are combined over two steps. First the test value ranges are acquired:

for each test
    for each handler
        for each of the three results described above
            track the highest and lowest value for the test over all handlers

Then the test scores are calculated (this is the Original calculation):

for each handler
    for each test
        scorePerTest starts at zero
        for each of the three results described above
            convert the value to a fraction of
                the range of values for the test from the previous step
            scorePerTest = scorePerTest + weight(result) * 100.0 * the fraction
        scorePerTest /= sum of weight(result)
    scorePerHandler = average of scorePerTest for handler

Where the weight(result) comes from the predefined table shown above and to the right. There is currently no weighting by test, all tests are considered equal.

Scores

Default (Score)
This is the score that is shown in the overall chart at the top of the page in the column labeled Benchmarks. The default score is the average of the By Test and By Data scores.
By Test
This score is calculated by rolling up scores calculated per test.
By Data
This score is calculated by rolling up scores calculated per data item.
Original
This is the "original" score which has been overtaken by newer code. The Original score is within 5% of by Data value.

Data	Weight
Nanoseconds	3
Alloc Bytes	2
Allocations	1