Edit

kc3-lang/brotli/research/deorummolae.h

Branch :

  • Show log

    Commit

  • Author : Eugene Kliuchnikov
    Date : 2018-02-26 09:04:36
    Hash : 35e69fc7
    Message : New feature: "Large Window Brotli" (#640) * New feature: "Large Window Brotli" By setting special encoder/decoder flag it is now possible to extend LZ-window up to 30 bits; though produced stream will not be RFC7932 compliant. Added new dictionary generator - "DSH". It combines speed of "Sieve" and quality of "DM". Plus utilities to prepare train corpora (remove unique strings). Improved compression ratio: now two sub-blocks could be stitched: the last copy command could be extended to span the next sub-block. Fixed compression ineffectiveness caused by floating numbers rounding and wrong cost heuristic. Other C changes: - combined / moved `context.h` to `common` - moved transforms to `common` - unified some aspects of code formatting - added an abstraction for encoder (static) dictionary - moved default allocator/deallocator functions to `common` brotli CLI: - window size is auto-adjusted if not specified explicitly Java: - added "eager" decoding both to JNI wrapper and pure decoder - huge speed-up of `DictionaryData` initialization * Add dictionaryless compressed dictionary * Fix `sources.lst` * Fix `sources.lst` and add a note that `libtool` is also required. * Update setup.py * Fix `EagerStreamTest` * Fix BUILD file * Add missing `libdivsufsort` dependency * Fix "unused parameter" warning.

  • research/deorummolae.h
  • #ifndef BROTLI_RESEARCH_DEORUMMOLAE_H_
    #define BROTLI_RESEARCH_DEORUMMOLAE_H_
    
    #include <cstddef>
    #include <cstdint>
    #include <string>
    #include <vector>
    
    /* log2(maximal number of files). Value 6 provides some speedups. */
    #define DM_LOG_MAX_FILES 6
    
    /* Non tunable definitions. */
    #define DM_MAX_FILES (1 << DM_LOG_MAX_FILES)
    
    /**
     * Generate a dictionary for given samples.
     *
     * @param dictionary_size_limit maximal dictionary size
     * @param sample_sizes vector with sample sizes
     * @param sample_data concatenated samples
     * @return generated dictionary
     */
    std::string DM_generate(size_t dictionary_size_limit,
        const std::vector<size_t>& sample_sizes, const uint8_t* sample_data);
    
    #endif  // BROTLI_RESEARCH_DEORUMMOLAE_H_