facebook/zstd - zstd - Final Minetest

Author	SHA1	Message	Date
Felix Handte	ad8dbae1b7	Merge pull request #2103 from felixhandte/relative-includes Migrate Includes to Relative Paths	2020-05-06 09:42:23 -07:00
Yann Collet	c29fd7cd8b	some more conversion warnings hunting down some static analyzer warnings	2020-05-05 10:16:59 -07:00
Yann Collet	c1b836f4c3	fix minor conversion warnings	2020-05-04 14:43:09 -07:00
W. Felix Handte	6028827fee	Rewrite Include Paths to be Relative Addresses #1998.	2020-05-04 15:20:26 -04:00
Felix Handte	816ed80774	Merge pull request #1984 from MeghnaM/1636-Reduce-stack-usage-of-HUF_sort Reduce stack usage of HUF_sort()	2020-05-04 08:15:31 -07:00
Nick Terrell	e103d7b4a6	Fix superblock mode (#2100 ) Fixes: Enable RLE blocks for superblock mode Fix the limitation that the literals block must shrink. Instead, when we're within 200 bytes of the next header byte size, we will just use the next one up. That way we should (almost?) always have space for the table. Remove the limitation that the first sub-block MUST have compressed literals and be compressed. Now one sub-block MUST be compressed (otherwise we fall back to raw block which is okay, since that is streamable). If no block has compressed literals that is okay, we will fix up the next Huffman table. Handle the case where the last sub-block is uncompressed (maybe it is very small). Before it would skip superblock in this case, now we allow the last sub-block to be uncompressed. To do this we need to regenerate the correct repcodes. Respect disableLiteralsCompression in superblock mode Fix superblock mode to handle a block consisting of only compressed literals Fix a off by 1 error in superblock mode that disabled it whenever there were last literals Fix superblock mode with long literals/matches (> 0xFFFF) Allow superblock mode to repeat Huffman tables Respect ZSTD_minGain(). Tests: Simple check for the condition in #2096. When the simple_round_trip fuzzer enables superblock mode, it checks that the compressed size isn't expanded too much. Remaining limitations: O(targetCBlockSize^2) because we recompute statistics every sequence Unable to split literals of length > targetCBlockSize into multiple sequences Refuses to generate sub-blocks that don't shrink the compressed data, so we could end up with large sub-blocks. We should emit those sections as uncompressed blocks instead. ... Fixes #2096	2020-05-01 16:11:47 -07:00
Meghna Malhotra	0adfc8dfce	Fix broken CI; make changes in response to the comments	2020-05-01 13:45:48 -07:00
Meghna Malhotra	53d76dc20f	Remove magic constant and made other changes addressing the comments	2020-05-01 13:45:48 -07:00
Meghna Malhotra	fe8402b522	WIP: Still getting an error	2020-05-01 13:45:48 -07:00
Meghna Malhotra	a084d959bd	WIP: Increased wksp size, but it's segfaulting	2020-05-01 13:45:48 -07:00
Meghna Malhotra	fdb2780c47	Move rank table into HUF_buildCTable_wksp()	2020-05-01 13:45:48 -07:00
Carl Woffenden	a93fadfcd9	Further replication removed `CHECK_F` is now in `error_private.h`. Minor tidy.	2020-04-07 11:25:16 +02:00
Carl Woffenden	7c420344d2	Single-file decoder script can now (optionally) create an encoder To complement the single-file decoder a new script was added to create an amalgamated single-file of all of the Zstd source, along with examples and (simple) tests.	2020-04-03 19:07:46 +02:00
Nick Terrell	ac58c8d720	Fix copyright and license lines * All copyright lines now have -2020 instead of -present * All copyright lines include "Facebook, Inc" * All licenses are now standardized The copyright in `threading.{h,c}` is not changed because it comes from zstdmt. The copyright and license of `divsufsort.{h,c}` is not changed.	2020-03-26 17:02:06 -07:00
Bimba Shrestha	a3a3c62b81	[fuzz] Only set HUF_repeat_valid if loaded table has all non-zero weights (#1898 ) Fixes a fuzz issue where dictionary_round_trip failed because the compressor was generating corrupt files thanks to zero weights in the table. * Only setting loaded dict huf table to valid on non-zero * Adding hasNoZeroWeights test to fse tables * Forbiding nbBits != 0 when weight == 0 * Reverting the last commit * Setting table log to 0 when weight == 0 * Small (invalid) zero weight dict test * Small (valid) zero weight dict test * Initializing repeatMode vars to check before zero check * Removing FSE changes to seperate pr * Reverting accidentally changed file * Negating bool, using unsigned, optimization nit	2019-11-26 12:24:19 -08:00
Sen Huang	7ce891870c	Fix merge conflicts	2019-11-05 15:51:25 -05:00
Yann Collet	ededcfca57	fix confusion between unsigned <-> U32 as suggested in #1441. generally U32 and unsigned are the same thing, except when they are not ... case : 32-bit compilation for MIPS (uint32_t == unsigned long) A vast majority of transformation consists in transforming U32 into unsigned. In rare cases, it's the other way around (typically for internal code, such as seeds). Among a few issues this patches solves : - some parameters were declared with type `unsigned` in .h, but with type `U32` in their implementation .c . - some parameters have type unsigned*, but the caller user a pointer to U32 instead. These fixes are useful. However, the bulk of changes is about %u formating, which requires unsigned type, but generally receives U32 values instead, often just for brevity (U32 is shorter than unsigned). These changes are generally minor, or even annoying. As a consequence, the amount of code changed is larger than I would expect for such a patch. Testing is also a pain : it requires manually modifying `mem.h`, in order to lie about `U32` and force it to be an `unsigned long` typically. On a 64-bit system, this will break the equivalence unsigned == U32. Unfortunately, it will also break a few static_assert(), controlling structure sizes. So it also requires modifying `debug.h` to make `static_assert()` a noop. And then reverting these changes. So it's inconvenient, and as a consequence, this property is currently not checked during CI tests. Therefore, these problems can emerge again in the future. I wonder if it is worth ensuring proper distinction of U32 != unsigned in CI tests. It's another restriction for coding, adding more frustration during merge tests, since most platforms don't need this distinction (hence contributor will not see it), and while this can matter in theory, the number of platforms impacted seems minimal. Thoughts ?	2018-12-21 18:09:41 -08:00
Yann Collet	7b74405150	refactor HUF_compress_internal for clarity changed workspace parameter convention to always provide workspaceSize, so that size can be explicitly checked. Also, use more enum to make the meaning of some parameters more explicit.	2018-10-26 13:21:37 -07:00
Nick Terrell	f2d6db45cd	[zstd] Add -Wmissing-prototypes	2018-09-27 15:24:48 -07:00
Yann Collet	698fd00afb	huf: increase threshold detection of poorly compressible data	2018-06-21 18:32:38 -07:00
Yann Collet	2d76defbfe	grouped all histogram functions into hist.c renamed functions with HIST_* prefix	2018-06-13 19:49:31 -04:00
Yann Collet	fa41bcc2c2	grouped debug functions into debug.h There were 2 competing set of debug functions within zstd_internal.h and bitstream.h. They were mostly duplicate, and required care to avoid messing with each other. There is now a single implementation, shared by both. Significant change : The macro variable ZSTD_DEBUG does no longer exist, it has been replaced by DEBUGLEVEL, which required modifying several source files.	2018-06-13 15:43:09 -04:00
Yann Collet	338f738c24	pass entropy tables to optimal parser for proper estimation of symbol's weights when using dictionary compression. Note : using only huffman costs is not good enough, presumably because sequence symbol costs are incorrect.	2018-05-08 15:37:06 -07:00
Yann Collet	a95a88af57	removed huf_compress_impl.h re-imported all functions inside huf_compress.c for easier source editing. Also updated a bunch of code comments for clarification.	2018-03-13 14:14:05 -07:00
Yann Collet	33a3f18848	fixed wrong size test	2018-02-26 18:27:51 -08:00
Yann Collet	6cdf690441	minor cleaning of huff0 Update code documentation, and properly names a few "magic constants". Also, HUF_compress_internal() gets a cleaner way to determine size of tables inside workspace.	2018-02-26 14:52:23 -08:00
Yann Collet	653383f74a	minor nit from Mac XCode	2018-02-22 15:44:26 -08:00
Yann Collet	9c5a8040a9	fixed huf_compress workspace size	2018-02-21 11:34:49 -08:00
Nick Terrell	b58f01537e	[compress] Support BMI2	2018-02-14 19:20:32 -08:00
Yann Collet	e8093dde09	fixed #304 Pathological samples may result in literal section being incompressible. This case is now detected, and literal distribution is replaced by one that can be written into the dictionary.	2018-01-11 11:16:32 -08:00
Nick Terrell	a86a7097ec	Ensure dictionary Huff table can encode any symbol * Ensure that the dictionary Huffman CTable has maxSymbolValue 255. * Fix a stack buffer overflow during compression dictionary loading.	2017-10-03 13:22:13 -07:00
Nick Terrell	07c6ff588e	[FSE][HUF] Inline error checks Caught by Clang's optimization remarks.	2017-08-15 11:23:28 -07:00
Nick Terrell	ae20d413da	[libzstd] Fix CHECK_V_F macros	2017-07-25 12:52:01 -07:00
Nick Terrell	cc1522351f	[libzstd] Fix bug in Huffman encoding Summary: Huffman encoding with a bad dictionary can encode worse than the HUF_BLOCKBOUND(srcSize), since we don't filter out incompressible input, and even if we did, the dictionaries Huffman table could be ill suited to compressing actual data. The fast optimization doesn't seem to improve compression speed, even when I hard coded fast = 1, the speed didn't improve over hard coding it to 0. Benchmarks: $ ./zstd.dev -b1e5 Benchmarking levels from 1 to 5 1#Synthetic 50% : 10000000 -> 3139163 (3.186), 524.8 MB/s ,1890.0 MB/s 2#Synthetic 50% : 10000000 -> 3115138 (3.210), 372.6 MB/s ,1830.2 MB/s 3#Synthetic 50% : 10000000 -> 3222672 (3.103), 223.3 MB/s ,1400.2 MB/s 4#Synthetic 50% : 10000000 -> 3276678 (3.052), 198.0 MB/s ,1280.1 MB/s 5#Synthetic 50% : 10000000 -> 3271570 (3.057), 107.8 MB/s ,1200.0 MB/s $ ./zstd -b1e5 Benchmarking levels from 1 to 5 1#Synthetic 50% : 10000000 -> 3139163 (3.186), 524.8 MB/s ,1870.2 MB/s 2#Synthetic 50% : 10000000 -> 3115138 (3.210), 370.0 MB/s ,1810.3 MB/s 3#Synthetic 50% : 10000000 -> 3222672 (3.103), 223.3 MB/s ,1380.1 MB/s 4#Synthetic 50% : 10000000 -> 3276678 (3.052), 196.1 MB/s ,1270.0 MB/s 5#Synthetic 50% : 10000000 -> 3271570 (3.057), 106.8 MB/s ,1180.1 MB/s $ ./zstd.dev -b1e5 ../silesia.tar Benchmarking levels from 1 to 5 1#silesia.tar : 211988480 -> 73651685 (2.878), 429.7 MB/s ,1096.5 MB/s 2#silesia.tar : 211988480 -> 70158785 (3.022), 321.2 MB/s ,1029.1 MB/s 3#silesia.tar : 211988480 -> 66993813 (3.164), 243.7 MB/s , 981.4 MB/s 4#silesia.tar : 211988480 -> 66306481 (3.197), 226.7 MB/s , 972.4 MB/s 5#silesia.tar : 211988480 -> 64757852 (3.274), 150.3 MB/s , 963.6 MB/s $ ./zstd -b1e5 ../silesia.tar Benchmarking levels from 1 to 5 1#silesia.tar : 211988480 -> 73651685 (2.878), 429.7 MB/s ,1087.1 MB/s 2#silesia.tar : 211988480 -> 70158785 (3.022), 318.8 MB/s ,1029.1 MB/s 3#silesia.tar : 211988480 -> 66993813 (3.164), 246.5 MB/s , 981.4 MB/s 4#silesia.tar : 211988480 -> 66306481 (3.197), 229.2 MB/s , 972.4 MB/s 5#silesia.tar : 211988480 -> 64757852 (3.274), 149.3 MB/s , 963.6 MB/s Test Plan: I added a test case to the fuzzer which crashed with ASAN before the patch and succeeded after.	2017-07-18 13:20:40 -07:00
Nick Terrell	55fc1f91fd	[zstd] Fix up formatting edge cases for clang-format	2017-05-24 13:50:10 -07:00
Jos Collin	7cd7a7564b	lib/compress: warning: this statement may fall through The following warning appears during build. ../lib/compress/huf_compress.c: In function ‘HUF_compress1X_usingCTable’: ../lib/compress/huf_compress.c:444:8: warning: this statement may fall through [-Wimplicit-fallthrough=] if (sizeof((stream)->bitContainer)8 < HUF_TABLELOG_MAX4+7) HUF_FLUSHBITS(stream) ^ ../lib/compress/huf_compress.c:465:18: note: in expansion of macro ‘HUF_FLUSHBITS_2’ HUF_FLUSHBITS_2(&bitC); ^~~~~~~~~~~~~~~ ../lib/compress/huf_compress.c:466:9: note: here case 2 : HUF_encodeSymbol(&bitC, ip[n+ 1], CTable); ../lib/compress/zstd_compress.c: In function ‘ZSTD_compressStream_generic’: ../lib/compress/zstd_compress.c:3366:34: warning: this statement may fall through [-Wimplicit-fallthrough=] zcs->streamStage = zcss_flush; /* pass-through to flush stage */ ~~~~~~~~~~~~~~~~~^~~~~~~~~~~~ ../lib/compress/zstd_compress.c:3369:9: note: here case zcss_flush: Signed-off-by: Jos Collin <jcollin@redhat.com>	2017-05-11 13:17:26 +05:30
Nick Terrell	54c4babd8f	Always check Huffman tables for ZSTD_lazy+ The compressor always reuses the existing Huffman table if the literals size is at most 1 KiB. If the compression strategy is `ZSTD_lazy` or stronger always check to see if reusing the previous table or creating a new table is better. This doesn't yet weigh in decompression speed. I don't want to add any heuristics there until I have real data to work with to ensure that the heuristic works for at least one use case, preferably more.	2017-03-03 16:49:38 -08:00
Nick Terrell	d051cd5b43	Use workspace for count and CTable	2017-03-02 16:38:07 -08:00
Nick Terrell	a419777eb1	Allow compressor to repeat Huffman tables * Compressor saves most recently used Huffman table and reuses it if it produces better results. * I attempted to preserve CPU usage profile. I intentionally left all of the existing heuristics in place. There is only a speed difference on the second block and later. When compressing large enough blocks (say >= 4 KiB) there is no significant difference in compression speed. Dictionary compression of one block is the same speed for blocks with literals <= 1 KiB, and after that the difference is not very significant. * In the synthetic data, with blocks 10 KB or smaller, most blocks can't use repeated tables because the previous block did not contain a symbol that the current block contains. Once blocks are about 12 KB or more, most previous blocks have valid Huffman tables for the current block, and the compression ratio and decompression speed jumped. * In silesia blocks as small as 4KB can frequently reuse the previous Huffman table (85%), but it isn't as profitable, and the previous Huffman table only gets used about 3% of the time. * Microbenchmarks show that `HUF_validateCTable()` takes ~55 ns and `HUF_estimateCompressedSize()` takes ~35 ns. They are decently well optimized, the first versions took 90 ns and 120 ns respectively. `HUF_validateCTable()` could be twice as fast, if we cast the `HUF_CElt` to a `U32` and compare to 0. However, `U32` has an alignment of 4 instead of 2, so I think that might be undefined behavior. * I've ran `zstreamtest` compiled normally, with UASAN and with MSAN for 4 hours each. The worst case for the speed difference is a bunch of small blocks in the same frame. I modified `bench.c` to compress the input in a single frame but with blocks of the given block size, set by `-B`. Benchmarks on level 1: \| Program \| Block size \| Corpus \| Ratio \| Compression MB/s \| Decompression MB/s \| \|-----------\|------------\|-----------\|-------\|------------------\|--------------------\| \| zstd.base \| 256 \| synthetic \| 2.364 \| 110.0 \| 297.0 \| \| zstd \| 256 \| synthetic \| 2.367 \| 108.9 \| 297.0 \| \| zstd.base \| 256 \| silesia \| 2.204 \| 93.8 \| 415.7 \| \| zstd \| 256 \| silesia \| 2.204 \| 93.4 \| 415.7 \| \| zstd.base \| 512 \| synthetic \| 2.594 \| 144.2 \| 420.0 \| \| zstd \| 512 \| synthetic \| 2.599 \| 141.5 \| 425.7 \| \| zstd.base \| 512 \| silesia \| 2.358 \| 118.4 \| 432.6 \| \| zstd \| 512 \| silesia \| 2.358 \| 119.8 \| 432.6 \| \| zstd.base \| 1024 \| synthetic \| 2.790 \| 192.3 \| 594.1 \| \| zstd \| 1024 \| synthetic \| 2.794 \| 192.3 \| 600.0 \| \| zstd.base \| 1024 \| silesia \| 2.524 \| 148.2 \| 464.2 \| \| zstd \| 1024 \| silesia \| 2.525 \| 148.2 \| 467.6 \| \| zstd.base \| 4096 \| synthetic \| 3.023 \| 300.0 \| 1000.0 \| \| zstd \| 4096 \| synthetic \| 3.024 \| 300.0 \| 1010.1 \| \| zstd.base \| 4096 \| silesia \| 2.779 \| 223.1 \| 623.5 \| \| zstd \| 4096 \| silesia \| 2.779 \| 223.1 \| 636.0 \| \| zstd.base \| 16384 \| synthetic \| 3.131 \| 350.0 \| 1150.1 \| \| zstd \| 16384 \| synthetic \| 3.152 \| 350.0 \| 1630.3 \| \| zstd.base \| 16384 \| silesia \| 2.871 \| 296.5 \| 883.3 \| \| zstd \| 16384 \| silesia \| 2.872 \| 294.4 \| 898.3 \|	2017-03-02 13:27:52 -08:00
Anders Oleson	517577bf53	spelling fixes in comments i.e. occurred labeled Huffman	2017-02-20 12:08:59 -08:00
Yann Collet	a0d742b1e4	introduced HUF_buildCTable_wksp(), to reduce stack memory usage	2016-12-01 17:47:30 -08:00
Yann Collet	643d9a234b	replaced usage of FSE_buildCTable by FSE_buildCTable_wksp, using less stack space in the process	2016-12-01 16:24:04 -08:00
Yann Collet	e928f7e16d	introduced ext_wksp variants of count to reduce stack memory usage	2016-12-01 16:13:35 -08:00
Yann Collet	5e00b848a8	FSE_compress_wksp() uses less stack space	2016-11-30 16:46:13 -08:00
Yann Collet	d79a9a00d9	Introduced FSE_compress_wksp() and FSE_buildCTable_wksp() to reduce stack memory usage	2016-11-30 15:52:20 -08:00
Yann Collet	197a55ee7b	fix edge case	2016-10-18 11:27:52 -07:00
Yann Collet	1563bfeabc	fixing FORCE_INLINE for older compilers (#330 )	2016-09-02 11:44:21 -07:00
David Lam	e10f7f3dcb	merge	2016-08-30 12:03:36 -07:00
Yann Collet	14200a20f0	Fixed issue #304 , reported by @borzunov	2016-08-30 06:51:00 -07:00
David Lam	da9d3b7057	Cleanup some errors in typedef comments and remove duplicated HOWTO from zbuff_decompress.c	2016-08-29 17:31:51 -07:00

1 2

60 Commits