-
Notifications
You must be signed in to change notification settings - Fork 44
Open
Description
Describe the new term or terms you would like to add.
Following on the discussion from the PSI Spring Workshop, since mzMLb did not receive uptake from the community for all the previously discussed reasons, we will revisit adding ZSTD compression to mzML.
The conversation proposes:
- Provide ZSTD compression with byte shuffling with a "trade name".
- Provide delta encoding followed by ZSTD compression with byte shuffling with a "trade name".
- Do not propose separate ZSTD without byte shuffling because it introduces an extra complexity.
I ran some experiments, generate points from 100.0 to 2100.0 for double precision floats spaced by 0.01 using ZSTD compression level 9 and zlib compression level 9:
| Raw | Zlib | Shuffle + Zstd | Delta + Shuffle + Zstd | |
|---|---|---|---|---|
| Bytes | 1600000 | 570704 | 12040 | 560 |
I did not run a timing experiment because that will be language-sensitive.
Begin Bikeshedding
We wanted to use a trade name because it seemed to the room that a verbose name would be harder to understand.
Some randomly generated names:
- ZSTD-Shuffle - The anti-trade name just call it what it is
- MZSTD - The least creative adaptation of the domain
- MZD - I was wrong, I could be less creative
Metadata
Metadata
Assignees
Labels
No labels