[SYSTEMDS-3779] Add ColGroupDDCLZW with LZW-compressed MapToData by florian-jobs · Pull Request #2398 · apache/systemds

florian-jobs · 2026-01-13T13:16:31Z

Summary

This PR introduces a new column group ColGroupDDCLZW that stores the mapping vector in LZW-compressed form.

Key design points

MapToData is not stored explicitly; only the compressed LZW representation is kept.
Operations that allow sequential access operate directly on _dataLZW without full decompression.
For complex or random-access patterns, the implementation falls back to DDC (uncompressed).

Current status

Core data structure and compression/decompression are in place.
Work in progress on operations that can be implemented via sequential decoding without full materialization.
Work in progress on Performance.

Feedback on design and integration is very welcome.

…extending on APreAgg like ColGroupDDC for easier implementation. Idea: store only compressed version of _data vector and important metadata. If decompression is needed we reconstruct the _data vector using the metadata and the compressed _data vector. Decompression takes place at most once. This is just an idea and theres other ways of implementing.

* - DDCLZW stores the mapping vector exclusively in compressed form. * - No persistent MapToData cache is maintained. * - Sequential operations decode on-the-fly, while operations requiring random access explicitly materialize and fall back to DDC. */

…ng von Decompress

…and decompress and its used data structures compatible.

…lgorithms and try made them compatible.

…DC test for ColGroupDDCTest. Improved compress/decompress methods in LZW class.

…lemted from its Interface.

…mapping This commit adds an initial implementation of ColGroupDDCLZW, a new column group that stores the mapping vector in LZW-compressed form instead of materializing MapToData explicitly. The design focuses on enabling sequential access directly on the compressed representation, while complex access patterns are intended to fall back to DDC. No cache or lazy decompression mechanism is introduced at this stage.

…press(). Decompress will now return an empty map if the index is zero.

janniklinde

Thank you for the PR. I left some comments in the code.

In general, please use tabs instead of spaces to make the diff more readable (can be done by importing the codestyle xml). It would be good if we are able to create the column group similar to this:

CompressionSettingsBuilder csb = new CompressionSettingsBuilder().setSamplingRatio(1.0)
	.setValidCompressions(EnumSet.of(AColGroup.CompressionType.DDCLZW))
		.setTransposeInput("false");
CompressionSettings cs = csb.create();

final CompressedSizeInfoColGroup cgi = new ComEstExact(mbt, cs).getColGroupInfo(colIndexes);
CompressedSizeInfo csi = new CompressedSizeInfo(cgi);
AColGroup cg = ColGroupFactory.compressColGroups(mbt, csi, cs, 1).get(0);

So corresponding features / methods to support this should be implemented.

src/main/java/org/apache/sysds/runtime/compress/colgroup/ColGroupDDCLZW.java

janniklinde · 2026-01-16T10:39:18Z

src/main/java/org/apache/sysds/runtime/compress/colgroup/ColGroupDDCLZW.java

All implemented methods must be covered by tests

src/main/java/org/apache/sysds/runtime/compress/colgroup/ColGroupDDCLZW.java

janniklinde · 2026-01-16T11:00:25Z

Please add some more tests to really verify correctness. For example, you should do a full compression and then decompress it again. Then it should be compared to the original data

src/main/java/org/apache/sysds/runtime/compress/colgroup/ColGroupDDC.java

src/test/java/org/apache/sysds/test/component/compress/colgroup/ColGroupDDCTest.java

…GroupDDCTest back to correct formatting. Added LZWMappingIterator to decompress values on the fly without having to allocate full compression map [WIP]. Added Test class ColGroupDDCLZWTest.

…tting again.

Signed-off-by: Luka Dekanozishvili <luka.dekanozishvili1@gmail.com>

LukaDeka · 2026-01-28T20:11:46Z

Update for benchmarks

Addressing the feedback

What you are looking for is to control the entropy of your data.

I wasn't able to "generate" data that matched a given entropy (percentage), but I added a helper function to calculate "Shannon-entropy" for the given arrays. It's displayed now in the benchmarks.

You can generate data that has exploitable patterns specific to LZW.

I added genPatternLZWOptimal which features "repeating patterns". Right now, it just repeats the same pattern (length 10) twice, but based on my observations, any repeating pattern is compressed very well.

Do not worry about input data that is smaller than 100 elements for these experiments.

I adjusted the sizes to 100, 1000, 10.000, 40.000.

...explicitly mention the number of distinct items you have...

nUnique is not displayed with the benchmarks.

I also added another for loop so that both nUnique and size are incremented:

================================================================================
Benchmark: benchmarkUniquesLZWOptimal
================================================================================

................................... Size: 100 ...................................
Size:     100 | nUnique:    2 | Entropy:  99.88% | DDC:      52 bytes | DDCLZW:     123 bytes | Memory reduction: -136.54% | De-/Compression speedup: 0.02/0.00 times
Size:     100 | nUnique:    3 | Entropy:  99.66% | DDC:     144 bytes | DDCLZW:     151 bytes | Memory reduction:   -4.86% | De-/Compression speedup: 0.01/0.00 times
Size:     100 | nUnique:    5 | Entropy:  99.41% | DDC:     160 bytes | DDCLZW:     187 bytes | Memory reduction:  -16.87% | De-/Compression speedup: 0.01/0.00 times
Size:     100 | nUnique:   10 | Entropy:  99.03% | DDC:     200 bytes | DDCLZW:     263 bytes | Memory reduction:  -31.50% | De-/Compression speedup: 0.01/0.00 times
Size:     100 | nUnique:   20 | Entropy:  83.91% | DDC:     280 bytes | DDCLZW:     367 bytes | Memory reduction:  -31.07% | De-/Compression speedup: 0.01/0.00 times
Size:     100 | nUnique:   50 | Entropy:  64.25% | DDC:     520 bytes | DDCLZW:     607 bytes | Memory reduction:  -16.73% | De-/Compression speedup: 0.01/0.00 times
Size:     100 | nUnique:  100 | Entropy:  54.58% | DDC:     920 bytes | DDCLZW:    1007 bytes | Memory reduction:   -9.46% | De-/Compression speedup: 0.01/0.00 times
................................... Size: 1000 ...................................
Size:    1000 | nUnique:    2 | Entropy:  99.96% | DDC:     164 bytes | DDCLZW:     355 bytes | Memory reduction: -116.46% | De-/Compression speedup: 0.00/0.00 times
Size:    1000 | nUnique:    3 | Entropy:  99.93% | DDC:    1044 bytes | DDCLZW:     439 bytes | Memory reduction:   57.95% | De-/Compression speedup: 0.00/0.00 times
Size:    1000 | nUnique:    5 | Entropy:  99.86% | DDC:    1060 bytes | DDCLZW:     527 bytes | Memory reduction:   50.28% | De-/Compression speedup: 0.00/0.00 times
Size:    1000 | nUnique:   10 | Entropy:  99.64% | DDC:    1100 bytes | DDCLZW:     659 bytes | Memory reduction:   40.09% | De-/Compression speedup: 0.00/0.00 times
Size:    1000 | nUnique:   20 | Entropy:  98.53% | DDC:    1180 bytes | DDCLZW:     911 bytes | Memory reduction:   22.80% | De-/Compression speedup: 0.00/0.00 times
Size:    1000 | nUnique:   50 | Entropy:  85.20% | DDC:    1420 bytes | DDCLZW:    1291 bytes | Memory reduction:    9.08% | De-/Compression speedup: 0.00/0.00 times
Size:    1000 | nUnique:  100 | Entropy:  72.37% | DDC:    1820 bytes | DDCLZW:    1691 bytes | Memory reduction:    7.09% | De-/Compression speedup: 0.00/0.00 times
Size:    1000 | nUnique:  200 | Entropy:  62.91% | DDC:    2620 bytes | DDCLZW:    2491 bytes | Memory reduction:    4.92% | De-/Compression speedup: 0.00/0.00 times
Size:    1000 | nUnique:  500 | Entropy:  53.63% | DDC:    6020 bytes | DDCLZW:    4891 bytes | Memory reduction:   18.75% | De-/Compression speedup: 0.00/0.00 times
Size:    1000 | nUnique: 1000 | Entropy:  48.25% | DDC:   10020 bytes | DDCLZW:    8891 bytes | Memory reduction:   11.27% | De-/Compression speedup: 0.00/0.00 times
................................... Size: 10000 ...................................
Size:   10000 | nUnique:    2 | Entropy:  99.99% | DDC:    1292 bytes | DDCLZW:    1147 bytes | Memory reduction:   11.22% | De-/Compression speedup: 0.00/0.00 times
Size:   10000 | nUnique:    3 | Entropy:  99.99% | DDC:   10044 bytes | DDCLZW:    1379 bytes | Memory reduction:   86.27% | De-/Compression speedup: 0.00/0.00 times
Size:   10000 | nUnique:    5 | Entropy:  99.98% | DDC:   10060 bytes | DDCLZW:    1719 bytes | Memory reduction:   82.91% | De-/Compression speedup: 0.00/0.00 times
Size:   10000 | nUnique:   10 | Entropy:  99.94% | DDC:   10100 bytes | DDCLZW:    2143 bytes | Memory reduction:   78.78% | De-/Compression speedup: 0.00/0.00 times
Size:   10000 | nUnique:   20 | Entropy:  99.81% | DDC:   10180 bytes | DDCLZW:    2619 bytes | Memory reduction:   74.27% | De-/Compression speedup: 0.00/0.00 times
Size:   10000 | nUnique:   50 | Entropy:  98.98% | DDC:   10420 bytes | DDCLZW:    3671 bytes | Memory reduction:   64.77% | De-/Compression speedup: 0.00/0.00 times
Size:   10000 | nUnique:  100 | Entropy:  95.94% | DDC:   10820 bytes | DDCLZW:    4047 bytes | Memory reduction:   62.60% | De-/Compression speedup: 0.00/0.00 times
Size:   10000 | nUnique:  200 | Entropy:  83.39% | DDC:   11620 bytes | DDCLZW:    4847 bytes | Memory reduction:   58.29% | De-/Compression speedup: 0.00/0.00 times
Size:   10000 | nUnique:  500 | Entropy:  71.09% | DDC:   24020 bytes | DDCLZW:    7247 bytes | Memory reduction:   69.83% | De-/Compression speedup: 0.00/0.00 times
Size:   10000 | nUnique: 1000 | Entropy:  63.96% | DDC:   28020 bytes | DDCLZW:   11247 bytes | Memory reduction:   59.86% | De-/Compression speedup: 0.00/0.00 times
................................... Size: 40000 ...................................
Size:   40000 | nUnique:    2 | Entropy: 100.00% | DDC:    5044 bytes | DDCLZW:    2319 bytes | Memory reduction:   54.02% | De-/Compression speedup: 0.00/0.00 times
Size:   40000 | nUnique:    3 | Entropy: 100.00% | DDC:   40044 bytes | DDCLZW:    2811 bytes | Memory reduction:   92.98% | De-/Compression speedup: 0.00/0.00 times
Size:   40000 | nUnique:    5 | Entropy:  99.99% | DDC:   40060 bytes | DDCLZW:    3463 bytes | Memory reduction:   91.36% | De-/Compression speedup: 0.00/0.00 times
Size:   40000 | nUnique:   10 | Entropy:  99.98% | DDC:   40100 bytes | DDCLZW:    4227 bytes | Memory reduction:   89.46% | De-/Compression speedup: 0.00/0.00 times
Size:   40000 | nUnique:   20 | Entropy:  99.95% | DDC:   40180 bytes | DDCLZW:    5319 bytes | Memory reduction:   86.76% | De-/Compression speedup: 0.00/0.00 times
Size:   40000 | nUnique:   50 | Entropy:  99.74% | DDC:   40420 bytes | DDCLZW:    7307 bytes | Memory reduction:   81.92% | De-/Compression speedup: 0.00/0.00 times
Size:   40000 | nUnique:  100 | Entropy:  99.09% | DDC:   40820 bytes | DDCLZW:    8927 bytes | Memory reduction:   78.13% | De-/Compression speedup: 0.00/0.00 times
Size:   40000 | nUnique:  200 | Entropy:  96.36% | DDC:   41620 bytes | DDCLZW:    8367 bytes | Memory reduction:   79.90% | De-/Compression speedup: 0.00/0.00 times
Size:   40000 | nUnique:  500 | Entropy:  82.16% | DDC:   84020 bytes | DDCLZW:   10767 bytes | Memory reduction:   87.19% | De-/Compression speedup: 0.00/0.00 times
Size:   40000 | nUnique: 1000 | Entropy:  73.91% | DDC:   88020 bytes | DDCLZW:   14767 bytes | Memory reduction:   83.22% | De-/Compression speedup: 0.00/0.00 times

Remarks

The main difficulty was judging which benchmarks are useful since most of my entropy values were pretty high to max.

Also benchmarkGetIdx doesn't make sense right now since the time signatures between DDC and DDCLZW don't match because of the "on-the-fly" sequential decompression, but the method could be swapped out trivially (so I kept the method).

I also commented out the benchmarkSlice since it didn't look useful.

…cheme according to guidelines.

… it into the compression pipeline and serialization framework.

… some documentation for non native ddc methods in ddclzw class.

… by IDE. Removed unneccesary comments from classes DDCLZW and DDCLZWTest. Optimized some tests to use compression framework.

…ngsBuilder erstellen

…al decompression and adjusting the function decompress to become decompressFull

… comments.

…ssCompression_NoRepetition to assertLosslessCompressionNoRepetition in order to pass codestyle test.

Baunsgaard · 2026-01-30T14:04:58Z

Okay, cool progress on the results!

However, I'm a bit skeptical about your byte estimates for the sizes. Do you do extra packing based on the number of bits in your implementation?

The ideal values for the current DDC implementation are 2, 256, and 65,536 unique values to avoid bit manipulations on lookup (see AMapToData specializations). Please explicitly compare against these cases and double-check your memory calculations.

I'd love to see some results with your idealized input to get a range of what to expect vs. what you get.

A recipe for X unique values at length L could be:

Use all X unique values once in sequence
(e.g., for X=4: 1,2,3,4)
Double repeatedly until you reach length L
- Round 1: 1,2,3,4 → 1,2,3,4,1,2,3,4 (length 8)
- Round 2: → 1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4 (length 16)
- Round 3: → length 32
- ...and so on

I don't know if it's exactly optimal, but it should be pretty good.

florian-jobs · 2026-01-30T14:20:16Z

At the moment the codes are still stored as int values by the LZW logic, but I’m in the process of changing the storage representation.

Instead of storing one code per array element, I’m implementing a bit-packed long wordstream, where codes are packed based on a fixed bit width (derived from the maximum emitted code), with the option to extend this to a growing bit-width policy later if needed.

LukaDeka · 2026-01-31T10:06:16Z

I have just tested it with the highest "optimal" values for DDC in the "distributed" benchmark, so with datasets like:

[0, 0, 0, 0, 1, 1, 1, 1, 2, 2, 2, 2, 3, 3, 3, 3]
for nUnique = 4, size = 16.

Size:  100000 | nUnique:     2 | Entropy: 100.00% | DDC:   12540 bytes | DDCLZW:    2567 bytes | Memory reduction:   79.53% | De-/Compression speedup: 0.00/0.00 times
Size:  100000 | nUnique:     3 | Entropy: 100.00% | DDC:  100044 bytes | DDCLZW:    3147 bytes | Memory reduction:   96.85% | De-/Compression speedup: 0.00/0.00 times
...
Size:  100000 | nUnique:   256 | Entropy:  99.99% | DDC:  102068 bytes | DDCLZW:   30767 bytes | Memory reduction:   69.86% | De-/Compression speedup: 0.00/0.00 times
Size:  100000 | nUnique:   257 | Entropy: 100.00% | DDC:  202076 bytes | DDCLZW:   30867 bytes | Memory reduction:   84.73% | De-/Compression speedup: 0.00/0.00 times
...
Size:  100000 | nUnique: 65536 | Entropy:  71.34% | DDC:  724308 bytes | DDCLZW:  787507 bytes | Memory reduction:   -8.73% | De-/Compression speedup: 0.00/0.00 times
Size:  100000 | nUnique: 65537 | Entropy:  71.34% | DDC:  824316 bytes | DDCLZW:  787519 bytes | Memory reduction:    4.46% | De-/Compression speedup: 0.00/0.00 times

There is a big jump at the 2-3 margin, as well as 256-257. But the reduction from 65536-65537 isn't that substantial.

Nevertheless, whenever nUnique/size approaches 7/10, DDC and DDCLZW get similar memory usage results (for size > 10000 approximately). For datasets with this many unique values, simple compression is expected to make things worse though.

I have also noticed that the entropy doesn't really influence the compression rate that much since entropy measures "how distributed" the values are and not "how they're arranged". So
[ 0, 1, 2, 0, 1, 2 ]
is going to have the same entropy as
[ 0, 1, 2, 3, 4, 5]
with both being 100%. The percentage is calculated by entropy/log_2{nUnique} so divided by the possible max.

… improvements to tests

LukaDeka · 2026-01-31T10:21:02Z

A recipe for X unique values at length L could be:

I also changed the genPatternLZWOptimal to do exactly this.

Baunsgaard · 2026-02-02T10:14:36Z

I am still very sure the memory estimates are wrong (but I would be happy to be wrong).

Please overwrite this method in your DDCLZW file:
https://github.com/apache/systemds/blob/3f841b7383a6cc626acbb2193a111d28f5a19404/src/main/java/org/apache/sysds/runtime/compress/colgroup/ColGroupDDC.java#L745C2-L750C3

Please use the utilities associated with estimating memory size as the other column group does.

florian-jobs · 2026-02-03T16:36:15Z

Yes, I think our memory estimates are indeed wrong @Baunsgaard. We where using getExactSizeOnDisc instead of estimateInMemorySize for memory estimation.

…InMemorySize instead of getExactSizeOnDisk for memory estimation.

florian-jobs · 2026-02-04T09:30:57Z

We changed the ColGroupDDCLZWBenchmark class to use estimateInMemorySize() instead of getExactSizeOnDisk() for memory estimation. While getExactSizeOnDisk() returns the exact serialized size produced by write(), estimateInMemorySize() is the intended method in SystemDS for estimating the in-memory footprint of column groups as you have suggested to us. We also updated estimateInMemorySize() to account for the LZW metadata and the LZW mapping.

Observation from the “distributed” benchmark:

The absolute byte numbers differ between the two modes (expected: in-memory estimate includes JVM overhead, whereas on-disk size is a compact serialization format).
However, the qualitative behavior and relative trends are very similar between estimateInMemorySize() and getExactSizeOnDisk() across the tested (size, nUnique) points (i.e., where DDCLZW is beneficial/harmful stays consistent).
As expected, DDCLZW tends to be unfavorable for very small inputs, while for larger sizes and low-to-moderate nUnique it achieves strong reductions. Around typical DDC representation boundaries (e.g., 256→257, 65536→65537) the baseline DDC memory changes noticeably, which is reflected in the reported reductions as well.

Below are the results from benchmarkDistributed using both types for comparison.

================================================================================
Benchmark: benchmarkDistributed using estimateInMemorySize
================================================================================

................................... Size: 100 ...................................
Size:     100 | nUnique:     2 | Entropy: 100,00% | DDC:     172 bytes | DDCLZW:     248 bytes | Memory reduction:  -44,19% | De-/Compression speedup: 0,01/0,00 times
Size:     100 | nUnique:     3 | Entropy:  99,99% | DDC:     280 bytes | DDCLZW:     272 bytes | Memory reduction:    2,86% | De-/Compression speedup: 0,01/0,00 times
Size:     100 | nUnique:     5 | Entropy: 100,00% | DDC:     296 bytes | DDCLZW:     312 bytes | Memory reduction:   -5,41% | De-/Compression speedup: 0,02/0,00 times
Size:     100 | nUnique:    10 | Entropy: 100,00% | DDC:     336 bytes | DDCLZW:     392 bytes | Memory reduction:  -16,67% | De-/Compression speedup: 0,00/0,00 times
Size:     100 | nUnique:    20 | Entropy: 100,00% | DDC:     416 bytes | DDCLZW:     552 bytes | Memory reduction:  -32,69% | De-/Compression speedup: 0,00/0,00 times
Size:     100 | nUnique:    50 | Entropy: 100,00% | DDC:     656 bytes | DDCLZW:     952 bytes | Memory reduction:  -45,12% | De-/Compression speedup: 0,01/0,00 times
Size:     100 | nUnique:   100 | Entropy: 100,00% | DDC:    1056 bytes | DDCLZW:    1352 bytes | Memory reduction:  -28,03% | De-/Compression speedup: 0,00/0,00 times
................................... Size: 100000 ...................................
Size:  100000 | nUnique:     2 | Entropy: 100,00% | DDC:    6420 bytes | DDCLZW:    2696 bytes | Memory reduction:   58,01% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:     3 | Entropy: 100,00% | DDC:  100184 bytes | DDCLZW:    3272 bytes | Memory reduction:   96,73% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:     5 | Entropy: 100,00% | DDC:  100200 bytes | DDCLZW:    4192 bytes | Memory reduction:   95,82% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:    10 | Entropy: 100,00% | DDC:  100240 bytes | DDCLZW:    5872 bytes | Memory reduction:   94,14% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:    20 | Entropy: 100,00% | DDC:  100320 bytes | DDCLZW:    8312 bytes | Memory reduction:   91,71% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:    50 | Entropy: 100,00% | DDC:  100560 bytes | DDCLZW:   13152 bytes | Memory reduction:   86,92% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:   100 | Entropy: 100,00% | DDC:  100960 bytes | DDCLZW:   18952 bytes | Memory reduction:   81,23% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:   200 | Entropy: 100,00% | DDC:  101760 bytes | DDCLZW:   27352 bytes | Memory reduction:   73,12% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:   256 | Entropy:  99,99% | DDC:  102208 bytes | DDCLZW:   30896 bytes | Memory reduction:   69,77% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:   257 | Entropy: 100,00% | DDC:  202216 bytes | DDCLZW:   30992 bytes | Memory reduction:   84,67% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:   500 | Entropy: 100,00% | DDC:  204160 bytes | DDCLZW:   44152 bytes | Memory reduction:   78,37% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:  1000 | Entropy: 100,00% | DDC:  208160 bytes | DDCLZW:   64152 bytes | Memory reduction:   69,18% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique: 10000 | Entropy: 100,00% | DDC:  280160 bytes | DDCLZW:  240152 bytes | Memory reduction:   14,28% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique: 65536 | Entropy:  71,34% | DDC:  724448 bytes | DDCLZW:  787632 bytes | Memory reduction:   -8,72% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique: 65537 | Entropy:  71,34% | DDC:  824496 bytes | DDCLZW:  787648 bytes | Memory reduction:    4,47% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique: 80000 | Entropy:  84,43% | DDC:  940200 bytes | DDCLZW:  960952 bytes | Memory reduction:   -2,21% | De-/Compression speedup: 0,00/0,00 times


================================================================================
Benchmark: benchmarkDistributed using getExactSizeOnDisk
================================================================================

................................... Size: 100 ...................................
Size:     100 | nUnique:     2 | Entropy: 100,00% | DDC:      52 bytes | DDCLZW:     119 bytes | Memory reduction: -128,85% | De-/Compression speedup: 0,01/0,00 times
Size:     100 | nUnique:     3 | Entropy:  99,99% | DDC:     144 bytes | DDCLZW:     147 bytes | Memory reduction:   -2,08% | De-/Compression speedup: 0,01/0,00 times
Size:     100 | nUnique:     5 | Entropy: 100,00% | DDC:     160 bytes | DDCLZW:     183 bytes | Memory reduction:  -14,38% | De-/Compression speedup: 0,01/0,00 times
Size:     100 | nUnique:    10 | Entropy: 100,00% | DDC:     200 bytes | DDCLZW:     263 bytes | Memory reduction:  -31,50% | De-/Compression speedup: 0,02/0,00 times
Size:     100 | nUnique:    20 | Entropy: 100,00% | DDC:     280 bytes | DDCLZW:     423 bytes | Memory reduction:  -51,07% | De-/Compression speedup: 0,00/0,00 times
Size:     100 | nUnique:    50 | Entropy: 100,00% | DDC:     520 bytes | DDCLZW:     823 bytes | Memory reduction:  -58,27% | De-/Compression speedup: 0,01/0,00 times
Size:     100 | nUnique:   100 | Entropy: 100,00% | DDC:     920 bytes | DDCLZW:    1223 bytes | Memory reduction:  -32,93% | De-/Compression speedup: 0,00/0,00 times
................................... Size: 100000 ...................................
Size:  100000 | nUnique:     2 | Entropy: 100,00% | DDC:   12540 bytes | DDCLZW:    2567 bytes | Memory reduction:   79,53% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:     3 | Entropy: 100,00% | DDC:  100044 bytes | DDCLZW:    3147 bytes | Memory reduction:   96,85% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:     5 | Entropy: 100,00% | DDC:  100060 bytes | DDCLZW:    4063 bytes | Memory reduction:   95,94% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:    10 | Entropy: 100,00% | DDC:  100100 bytes | DDCLZW:    5743 bytes | Memory reduction:   94,26% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:    20 | Entropy: 100,00% | DDC:  100180 bytes | DDCLZW:    8183 bytes | Memory reduction:   91,83% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:    50 | Entropy: 100,00% | DDC:  100420 bytes | DDCLZW:   13023 bytes | Memory reduction:   87,03% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:   100 | Entropy: 100,00% | DDC:  100820 bytes | DDCLZW:   18823 bytes | Memory reduction:   81,33% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:   200 | Entropy: 100,00% | DDC:  101620 bytes | DDCLZW:   27223 bytes | Memory reduction:   73,21% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:   256 | Entropy:  99,99% | DDC:  102068 bytes | DDCLZW:   30767 bytes | Memory reduction:   69,86% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:   257 | Entropy: 100,00% | DDC:  202076 bytes | DDCLZW:   30867 bytes | Memory reduction:   84,73% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:   500 | Entropy: 100,00% | DDC:  204020 bytes | DDCLZW:   44023 bytes | Memory reduction:   78,42% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique:  1000 | Entropy: 100,00% | DDC:  208020 bytes | DDCLZW:   64023 bytes | Memory reduction:   69,22% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique: 10000 | Entropy: 100,00% | DDC:  280020 bytes | DDCLZW:  240023 bytes | Memory reduction:   14,28% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique: 65536 | Entropy:  71,34% | DDC:  724308 bytes | DDCLZW:  787507 bytes | Memory reduction:   -8,73% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique: 65537 | Entropy:  71,34% | DDC:  824316 bytes | DDCLZW:  787519 bytes | Memory reduction:    4,46% | De-/Compression speedup: 0,00/0,00 times
Size:  100000 | nUnique: 80000 | Entropy:  84,43% | DDC:  940020 bytes | DDCLZW:  960823 bytes | Memory reduction:   -2,21% | De-/Compression speedup: 0,00/0,00 times

Baunsgaard · 2026-02-10T13:25:20Z

okay, now the memory numbers sound more plausible !

Can you verify on all the tested instances that when we decompress either the DDC or the DDCLZW they reconstruct equivalent matrices?

…s in benchmark class. Improved testing class.

florian-jobs · 2026-02-15T16:20:47Z

I added tests to ensure the reconstructed matrices in the benchmarking class are equivalent.

janniklinde

Thanks for the continued contribution @florian-jobs. I have two more minor comments on the code, but overall I think we're good to merge the new column group.

janniklinde · 2026-02-23T08:47:00Z

src/main/java/org/apache/sysds/runtime/compress/colgroup/ColGroupDDCLZW.java

+					c[rowBaseOff + j] = values[rowIndex + j];
+			}
+		}
+		else {
+			for(int i = rl, offT = rl + offR; i < ru; i++, offT++) {
+				final double[] c = db.values(offT);
+				final int off = db.pos(offT) + offC;
+				final int dictIdx = it.next();
+				final int rowIndex = dictIdx * nCol;
+
+				for(int j = 0; j < nCol; j++) {
+					final int colIdx = _colIndexes.get(j);
+					c[off + colIdx] = values[rowIndex + j];


Setting instead of using += could be problematic, at least AColIndex.decrompressVec uses +=

janniklinde · 2026-02-23T09:10:20Z

src/main/java/org/apache/sysds/runtime/compress/colgroup/ColGroupDDCLZW.java

+			final double[] aval = sb.values(di);
+
+			for(int j = apos; j < alen; j++) {
+				sbr.append(_colIndexes.get(aix[j]), i, aval[apos]);


I think you mean aval[j] and not aval[apos], right?

I implemented decompressToSparseBlockTransposedSparseDictionary directly from ColGroupDDC, so I assumed it's correct. The only difference is that ColGroupDDCLZW uses an iterator to retrieve the data values.

I see, thanks for the clarification. However, it looks incorrect to me (both for DDC and DDCLZW then). @Baunsgaard am I missing something here?

I will change it to use j instead of aval @janniklinde, I think the current version is false.

It can very well be that i had a bug in the code.

You can always write a test, rather than guessing :) ?

Testing verified that aval[j] should be used instead of aval[apos]. I pushed the fix together with the corresponding test.

…oSparseBlockTransposedSparseDictionary.

florian-jobs and others added 14 commits January 7, 2026 13:39

Idea:

007611c

* - DDCLZW stores the mapping vector exclusively in compressed form. * - No persistent MapToData cache is maintained. * - Sequential operations decode on-the-fly, while operations requiring random access explicitly materialize and fall back to DDC. */

More TODOS written and cleaned up project.

b1bf906

Dictionary initialisierung für Compress und rudimentäre Implementieru…

8027458

…ng von Decompress

Uebersichtlichkeit verbessert

ef3b834

Minor error fixing. Redesigned compress method.

9886821

Added red/write methods to serialize and deserialize from stream.

e0d5d75

Commented code, error handling for compress. next step make compress …

beb4613

…and decompress and its used data structures compatible.

Added first stages of tests. improved compression and decompression a…

620e03a

…lgorithms and try made them compatible.

Added convertToDDCLZW() method to ColGroupDDC Class. Added convertToD…

b7911d7

…DC test for ColGroupDDCTest. Improved compress/decompress methods in LZW class.

Started working on ColGroupDDCLZW's other methods that need to be imp…

1dfe91e

…lemted from its Interface.

test commit

3156863

[SYSTEMDS-3779] Added new Compression and ColGroup Types DDCLZW.

10d5776

github-project-automation bot added this to SystemDS PR Queue Jan 13, 2026

github-project-automation bot moved this to In Progress in SystemDS PR Queue Jan 13, 2026

florian-jobs changed the title ~~Add ColGroupDDCLZW with LZW-compressed MapToData~~ [SYSTEMDS-3779] Add ColGroupDDCLZW with LZW-compressed MapToData Jan 13, 2026

Annika Lehmann added 2 commits January 15, 2026 13:18

Decompression to a specific index

a8df1fe

slice Rows

96cb6e9

janniklinde self-requested a review January 16, 2026 08:26

[SYSTEMDS-3779] Add imemdiate stop after index certain index in decom…

a30cc91

…press(). Decompress will now return an empty map if the index is zero.

janniklinde requested changes Jan 16, 2026

View reviewed changes

github-project-automation bot moved this from In Progress to In Review in SystemDS PR Queue Jan 16, 2026

Baunsgaard reviewed Jan 16, 2026

View reviewed changes

src/main/java/org/apache/sysds/runtime/compress/colgroup/ColGroupDDC.java Show resolved Hide resolved

Baunsgaard reviewed Jan 16, 2026

View reviewed changes

src/test/java/org/apache/sysds/test/component/compress/colgroup/ColGroupDDCTest.java Show resolved Hide resolved

florian-jobs and others added 4 commits January 16, 2026 16:26

[SYSTEMDS-3779] Reverted formatting of ColGroupDDC,ColGroupDDCLZW,Col…

d39fad0

…GroupDDCTest back to correct formatting. Added LZWMappingIterator to decompress values on the fly without having to allocate full compression map [WIP]. Added Test class ColGroupDDCLZWTest.

[SYSTEMDS-3779] Intermediate DDCLZW Scheme

9e2cf11

[SYSTEMDS-3779] Added getIdx using LZWMappingIterator. Reverted forma…

7de7f1d

…tting again.

[SYSTEMDS-3779] Fixed out of bounds logic

4f3f413

Signed-off-by: Luka Dekanozishvili <luka.dekanozishvili1@gmail.com>

florian-jobs and others added 10 commits January 30, 2026 09:11

[MINOR] Removed unnessecary imports and * imports. Reformated DDCLZWS…

591b8dc

…cheme according to guidelines.

[MINOR] Added License to DDCLZWScheme.

03e5a80

[SYSTEMDS-3779] Introduce DDCLZW as a new ColGroup type and integrate…

4d4a9e2

… it into the compression pipeline and serialization framework.

[MINOR] Improved imports for LZWTest class (removed * imports). Added…

9fe804a

… some documentation for non native ddc methods in ddclzw class.

[MINOR] Improved imports for LZWTest class (removed more * imports).

0abf888

[MINOR] Optimized imports for ColGroupDDCTest due to auto optmization…

e7f6539

… by IDE. Removed unneccesary comments from classes DDCLZW and DDCLZWTest. Optimized some tests to use compression framework.

[SYSTEMDS-3779] Objekte der DDC-Column Group mit dem CompressionSetti…

b2f158f

…ngsBuilder erstellen

[SYSTEMDS-3779] Removal of the tests for the no longer existing parti…

26f7a2f

…al decompression and adjusting the function decompress to become decompressFull

[MINOR] Code refractoring in test classes and removing of unnecessary…

a8cedd3

… comments.

[MINOR] Added licenses for DDCLZWSchemeMC/SC and renamed assertLossle…

c5cc4c0

…ssCompression_NoRepetition to assertLosslessCompressionNoRepetition in order to pass codestyle test.

florian-jobs force-pushed the ColGroupDDCLZW2 branch from 9241629 to c5cc4c0 Compare January 30, 2026 08:12

[SYSTEMDS-3779] refactored benchmark+test helpers in util file, minor…

105c449

… improvements to tests

[SYSTEMDS-3779] simplified unit tests, injected more randomized data

099b690

[SYSTEMDS-3779] Changed ColGroupDDCLZWBenchmark class to use estimate…

67b498c

…InMemorySize instead of getExactSizeOnDisk for memory estimation.

[SYSTEMDS-3779] Verified equivalent reconstruction for tested matrice…

ad03b4a

…s in benchmark class. Improved testing class.

janniklinde reviewed Feb 23, 2026

View reviewed changes

[SYSTEMDS-3779] Fixed usage of wrong indexing variable in decompressT…

923158a

…oSparseBlockTransposedSparseDictionary.

janniklinde closed this in f0af3ec Mar 17, 2026

github-project-automation bot moved this from In Review to Done in SystemDS PR Queue Mar 17, 2026

Conversation

florian-jobs commented Jan 13, 2026

Summary

Key design points

Current status

Uh oh!

janniklinde left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janniklinde Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

janniklinde commented Jan 16, 2026

Uh oh!

Uh oh!

Uh oh!

LukaDeka commented Jan 28, 2026

Update for benchmarks

Addressing the feedback

Remarks

Uh oh!

Baunsgaard commented Jan 30, 2026

Uh oh!

florian-jobs commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LukaDeka commented Jan 31, 2026

Uh oh!

LukaDeka commented Jan 31, 2026

Uh oh!

Baunsgaard commented Feb 2, 2026

Uh oh!

florian-jobs commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

florian-jobs commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Baunsgaard commented Feb 10, 2026

Uh oh!

florian-jobs commented Feb 15, 2026

Uh oh!

janniklinde left a comment

Choose a reason for hiding this comment

Uh oh!

janniklinde Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

janniklinde Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

florian-jobs Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

janniklinde Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

florian-jobs Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Baunsgaard Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

florian-jobs Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

janniklinde left a comment •

edited

Loading

florian-jobs commented Jan 30, 2026 •

edited

Loading

florian-jobs commented Feb 3, 2026 •

edited

Loading

florian-jobs commented Feb 4, 2026 •

edited

Loading

florian-jobs Feb 27, 2026 •

edited

Loading