Commit 945b0d025fae3819f02dc1076fb0e7270199d143

Zoltan Szabadka 2015-05-07T17:23:07

Use a static context map with two buckets for UTF8 data. Enabled for quality >= 4, and if there are no obvious UTF8 violations detected. For each block, we gather two separate histograms, one for continuation bytes and one for ASCII or lead bytes.