Compressing the NN evaluation results : part 2

In part 1, I wrote an ad-hoc algorithm which applied two commonly-used techniques – variable length coding and run length coding. The ad-hoc mode seemed to start going nowhere, so I eventually settled with something more general.

Two steps:

  • Encode the values 0 ~ 2047 using a run length coding method into some symbols, and then:
  • Do a simple variable length coding (something like Huffman Coding) to encode those symbols

Since we know that the values are mostly zeros, it is pretty straightforward that we have to do a run length encoding of zeros. Other than that, just a smaller number of tricks will be better. I settled with the following symbols:

  • V0 ~ V63 : A single value of 0 ~ 63
  • Z0 ~ Z15 : 2 ~ 17 consecutive zeros
  • X0 ~ X31 : For any symbol Xn, f the previous value was a V-type symbol, add 64(n+1) to that value. If the previous value was a Z-type symbol, append 16(n+1) zeros.

For example,

129 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 2

gets encoded into:

V1 X2 Z0 X0 V1 V2

The next step is to encode these values in individual bits. To see how many bits should be used to encode each value, I measured the frequency of each bits, and hand-assigned a variable number of bits for each symbol, ending up being

SymbolBit encoding
V00100
V1000
V2 ~ V3X1100
V4 ~ V7XX0010
V8 ~ V15XXX1010
V16 ~ V31XXXX0110
V32 ~ V63XXXXX1110
Z00001
Z11001
Z2 ~ Z3X0101
Z4 ~ Z7XX1101
Z8 ~ Z15XXX0011
X01011
X100111
X2 ~ X3X10111
X4 ~ X7XX01111
X8 ~ X15XXX011111
X16~X31XXXX111111

(The X on the bit encoding is the LSBs of the symbol number – For example, V18 is 00100110. One notable thing is that V1 is more frequent than V0 which is because we encoded the majority of zeros using the Zn type symbols.)

Since we are dealing with little endian (the least significant bits comes first), if we represent the example above as a 64-bit integer, we end up getting:

01100 000 1011 0001 010111 000 = 0x0c162b8

So, how good are we? I got 261.7 bits in average (or 32.7 bytes) which is much better than the ad-hoc method!

Next step is to apply this in leela-zero, and then verify it doesn’t lose strength.

Leave a Reply

Your email address will not be published. Required fields are marked *