Automatically detect the
file format and optimize
the compression.
Compress FASTQ to 2% of the raw file size, which is a
6X reduction in size
compared to Gzip.
The compression speed is
up to 800MB/s under ideal
I/O condition.
Ensure reliability and
consistency by chunked
MD5 verification.
GTX.Zip supports lossless compression for any files, and highly optimized for genomic format, i.e., fastq, fastq.gz, bam. The compression ratio is especially improved when a genome reference is available.
GTX.Zip provides command line or GUI installations, supporting operation systems like Linux, Windows, Mac OSX. Users can adjust parallel threads. Besides, uses may decompress files without license.
GTX.Zip seamlessly works with other powerful software developed by Genetalks Inc., including GTX.Trans, GTX.CAT, GTX.Digest. That will unlock even better features, to transfer during compression, to compute during decompression, and to achieve compressed files. Moreover, integrating with 3rd-party software is handy, via SDK interface of python, C/C++.
In variety conditions, GTX.Zip compressed the sequencing data (fastq format) usually 3-4 times, and up to 6 times better than Gzip did. The sequencing centers and institutions then can achieve significant cost reductions on storage and transfer.
ID | Species | Sequencer | Raw data size(G) | Gzip | GTX.Zip | ||||
---|---|---|---|---|---|---|---|---|---|
time(s) | Compressed size(G) | Ratio(%) | time(s) | Compressed size(G) | Ratio(%) | ||||
SRR6737547 | mice | Illumina/Miseq | 5.2991 | 466 | 1.351 | 25.49 | 68 | 0.3752 | 7.08 |
ERR3929511 | horse | Illumina/Novaseq 6000 | 95.0393 | 6691 | 18.6465 | 19.62 | 226 | 3.0399 | 3.2 |
SRR12922210 | human | Illumina/Xten | 14.5808 | 1294 | 2.9289 | 20.09 | 81 | 0.5247 | 3.6 |
SRR12072893 | horse | Illumina/Hiseq | 18.6982 | 1460 | 4.3923 | 23.49 | 92 | 1.064 | 5.69 |
ERR3528872 | human | Illumina/Nextseq | 11.5441 | 985 | 2.6146 | 22.65 | 127 | 0.7087 | 6.14 |
SRR12845693 | rat | MGI/BGISEQ-500 | 2.9891 | 110 | 0.5098 | 17.05 | 48 | 0.1678 | 5.61 |
SRR15829874 | human | MGI/MGISEQ-2000 | 24.2479 | 2321 | 8.2369 | 33.97 | 90 | 3.8304 | 15.8 |
SRR14773546 | human | MGI/DNBSEQ-T7 | 16.3829 | 1569 | 4.9372 | 30.14 | 131 | 3.0061 | 18.35 |
SRR3206414 | human | IonTorent/Proton | 10.3925 | 964 | 4.5705 | 43.98 | 163 | 2.427 | 23.35 |
SRR12448025 | sesame | IonTorent/S5 | 2.7167 | 264 | 1.1783 | 43.37 | 64 | 0.6772 | 24.93 |
ERR1397639 | human | PacBio/RS | 1.522 | 137 | 0.6605 | 43.4 | 54 | 0.4718 | 31 |
SRR5943529 | human | PacBio/RS II | 2.6287 | 362 | 1.1272 | 42.88 | 72 | 0.8439 | 32.1 |
SRR11816799 | dog | ONT/GridION | 14.1278 | 1098 | 6.8405 | 48.42 | 169 | 4.5035 | 31.88 |
SRR11073097 | human | ONT/MinION | 31.5085 | 2355 | 15.9451 | 50.61 | 321 | 10.1803 | 32.31 |
ERR2585114 | human | ONT/PromethION | 53.2615 | 3999 | 25.3133 | 47.53 | 488 | 17.235 | 32.36 |