GTX.Zip genomic data compression system

GTX.Zip is a powerful compression system intended for the genomic industry, which supports lossless compression of any format, and highly optimizes for genomic files. It is one of the best and fastest systems able to compress genomic data at a speed of up to 800 MB/s and reduce the file size up to 98%. Therefore the GTX.Zip can reduce a lot of cost on storage and transmission.

Product benifits
Dynamic optimization

Automatically detect the
file format and optimize
the compression.

Best and lossless

Compress FASTQ to 2% of
the raw file size, which is a
6X reduction in size
compared to Gzip.

High speed

The compression speed is
up to 800MB/s under ideal
I/O condition.

Safe and Security

Ensure reliability and
consistency by chunked
MD5 verification.

Product features
1
Optimization
Specialized optimization

GTX.Zip supports lossless compression for any files, and highly optimized for genomic format, i.e., fastq, fastq.gz, bam. The compression ratio is especially improved when a genome reference is available.

2
Flexibility
Convenient and flexible

GTX.Zip provides command line or GUI installations, supporting operation systems like Linux, Windows, Mac OSX. Users can adjust parallel threads. Besides, uses may decompress files without license.

3
Ecosystem
Ecological integrity

GTX.Zip seamlessly works with other powerful software developed by Genetalks Inc., including GTX.Trans, GTX.CAT, GTX.Digest. That will unlock even better features, to transfer during compression, to compute during decompression, and to achieve compressed files. Moreover, integrating with 3rd-party software is handy, via SDK interface of python, C/C++.

Typical Application Scenarios
Compress the sequencing data

In variety conditions, GTX.Zip compressed the sequencing data (fastq format) usually 3-4 times, and up to 6 times better than Gzip did. The sequencing centers and institutions then can achieve significant cost reductions on storage and transfer.

Comparison Test
Compare the compression ratios across sequencing platforms
Benchmark of compression ratio and speed for real NGS data
IDSpeciesSequencerRaw data size(G)GzipGTX.Zip
time(s)Compressed size(G)Ratio(%)time(s)Compressed size(G)Ratio(%)
SRR6737547miceIllumina/Miseq5.29914661.35125.49680.37527.08
ERR3929511horseIllumina/Novaseq 600095.0393669118.646519.622263.03993.2
SRR12922210humanIllumina/Xten14.580812942.928920.09810.52473.6
SRR12072893horseIllumina/Hiseq18.698214604.392323.49921.0645.69
ERR3528872humanIllumina/Nextseq11.54419852.614622.651270.70876.14
SRR12845693ratMGI/BGISEQ-5002.98911100.509817.05480.16785.61
SRR15829874humanMGI/MGISEQ-200024.247923218.236933.97903.830415.8
SRR14773546humanMGI/DNBSEQ-T716.382915694.937230.141313.006118.35
SRR3206414humanIonTorent/Proton10.39259644.570543.981632.42723.35
SRR12448025sesameIonTorent/S52.71672641.178343.37640.677224.93
ERR1397639humanPacBio/RS1.5221370.660543.4540.471831
SRR5943529humanPacBio/RS II2.62873621.127242.88720.843932.1
SRR11816799dogONT/GridION14.127810986.840548.421694.503531.88
SRR11073097humanONT/MinION31.5085235515.945150.6132110.180332.31
ERR2585114humanONT/PromethION53.2615399925.313347.5348817.23532.36
Benchmark platform:Intel(R) Core(TM) i9-7980XE CPU @ 2.60GHz / 18 cores 36 threads / 128GB RAMStorage:HDD SATA
FeedbackPlease fill out the form below and let us know how we can help you.