Alphabetically sorted Dictionary compression test


File type : Alphabetically sorted English word-list (350,000 words)
# of files to compress in this test : 1
Total File Size (bytes) : 4,067,439
Sample of data :

galvanomagnetism
galvanometer
galvanometers
galvanometric
galvanometrical
galvanometrically
galvanometry
galvanoplastic
galvanoplastical
galvanoplastically
galvanoplastics
galvanoplasty
galvanopsychic
galvanopuncture

Conclusion: This test first puzzled me a bit. I would expect compression ratio of this alphabetically sorted English wordlist to be much better then normal text compression as similar words are already grouped and punctuation is totally absent. But the best program (PAQ8) 'only' achieves a compression ratio of 90% (versus 89% in text compression). Guess this can be explained by the fact that there isn't a single repeating word in the file.

Similar to text compression, the difference between the programs is relatively big. The #1 program PAQ8P 'out-compresses' #4 with not less then 48 KB (!), the gap to #6 is a huge 65 KB. The top 8 programs all compress to less then half the size of the resulting WinZip 8.0 archive. WinRK and PAQ8 are again the big winners here.

Note: The #4 of the text compression test (RKC) is not even in the top 28 in this test!.

Number of different compressors/archivers listed in this test: 214


Pos. Name Compressor Best switches combination Compressed Compress Bits per
  Size Ratio Byte
  (bytes) (%) (b/B)
001PAQ8P-738562090.52 0.7585
002WinRK 3.0.3PWCM 912MB39216190.36 0.7713
003LPAQ8841198089.87 0.8103
004PAQAR 4.5-843567489.29 0.8569
005NanoZip 0.04a-cc -m800m44367989.09 0.8726
006Ocamyd 1.66test1-m8 -s045000488.94 0.8851
007CMM4 0.2b2645597388.79 0.8968
008SLIM 0.23d-o8 -m91246369288.60 0.9120
009DURILCA 0.5-o547378488.35 0.9319
010PPMonstr J rev.1-o547747288.26 0.9391
011CCM 1.30cCCMx -648072188.18 0.9455
012COMPRESSIA 1.0bBS15 SE MC48625688.05 0.9564
013EPM r9c00439039300586566063231 -m91249166387.91 0.9670
014ASH 04a/o7 /s62 /m91249742587.77 0.9784
015ENC 0.15ag -o554094086.70 1.0639
016GRZipII 0.2.4-m3 L10 -l -a54823186.52 1.0783
017FreeARC 0.50a-m=grzip:d:a:l:m354846286.52 1.0787
018HOOK 1.36055391586.38 1.0895
019PIMPLE2(none)55900686.26 1.0995
020HIPP 0.5819/o556226586.18 1.1059
021PPMY SSE (9A9)/o5 /m91256472586.12 1.1107
022RINGS 1.5c456632686.08 1.1139
023TarsaLZP 21Aug2007(none)57799685.79 1.1368
024SZIP 1.12-b41o457858485.78 1.1380
025BIT 0.3-m lwcx -p best57937685.76 1.1395
026M1 0.1apaq58328285.66 1.1472
027UHARC 0.6b-mx -md-58810285.54 1.1567
028TC 5.2 dev2(none)58885685.52 1.1582
029RKC 1.02-M912m -o4 -ft -mx59068285.48 1.1618
030BZP 0.3(none)59123585.46 1.1629
031BEE 0.7.9-m3 -d659366985.40 1.1677
032GRZIP 0.7.3-a -f59710985.32 1.1744
033RZM 0.07h(none)61139984.97 1.2025
034Quark 0.95r-l962122384.73 1.2218
035NNTCP664443284.16 1.2675
036SR3ac264987684.02 1.2782
037LZTurbo 0.92-5968764483.09 1.3525
038FlashZIP 0.91b-m2 -s771463682.43 1.4056
039CTW 0.1-n16M -d772701382.13 1.4299
040LZXQ 0.4normal73664781.89 1.4489
041BruteCM 0.1d(none)73843881.85 1.4524
042BALZ 1.13ex73920781.83 1.4539
043777 0.04b1-m5 -mu475185681.52 1.4788
044UFA 0.04b1-m5 -mu475185681.52 1.4788
045SQUEEZ 5.5-ux77010381.07 1.5147
046WinTurtle 1.6.0(none)77272281.00 1.5198
0477-Zip 4.60b-m0=ppmd:mem=499807b:o=477908580.85 1.5323
048PPMN 1.00b1 km-O4 -M:2577923480.84 1.5326
049PPMVC 1.2-o4 -m1 -u77954780.83 1.5332
050PPMd rev J-o4 -m178104880.80 1.5362
051QC 0.050-078211080.77 1.5383
052STUFFIT 12.0Method 4, L=4, M=1978546480.69 1.5449
053WinRAR 3.80b5-ep -m5 -mdE -mc4:1t+78618380.67 1.5463
054ICEOWS 4.20bVery less78705480.65 1.5480
055CTXf 0.75 b1-mf79172180.54 1.5572
056QAZAR 0.0pre5-x0 -l079213780.52 1.5580
057TURTLE 0.07(none)82108779.81 1.6149
058Quad 1.12x82801879.64 1.6286
059X1 0.95aam#84679979.18 1.6655
060ARHANGEL 1.40-mc1600085123979.07 1.6743
061SYMBRA 0.2-m0 -c4 -p285306079.03 1.6778
062LZPX(J) 1.2h-885797578.91 1.6875
063LGHA 1.1g-287087078.59 1.7129
064HA 0.999ba2187087078.59 1.7129
065CODEC 3.21-c1087203078.56 1.7151
066QUANTUM 0.97-c7 -t1688559778.23 1.7418
067RK 1.04.1-mx1 -M52 -B2000088650478.20 1.7436
068KZIP 14-APR-2007/b29089068078.10 1.7518
069CABARC 1.00.0106-m LZX:2189093878.10 1.7523
070ShipInBottle 1.0 b17alg:ppm len:489114578.09 1.7527
071RKUC 1.04-x -o1689142878.08 1.7533
072BSSC 0.95a-et89584577.98 1.7620
073BIX 1.00b7-mdD90040577.86 1.7710
074LZPM 0.15991182377.58 1.7934
075WinHKI 1.74HKI2 Fastest92564177.24 1.8206
076LZAP 0.20.0b(none)94900376.67 1.8665
077BigCrunch 0.4a1(none)94971776.65 1.8679
078ACB 2.00cu96713176.22 1.9022
079RKIVE 1.92-mt2 -mm196783676.21 1.9036
080THOR 0.96e496977276.16 1.9074
081BJWFLATE 1.54-s51298520275.78 1.9377
082PSA 0.91a-o24 -m1152K99601775.51 1.9590
083PPMZ2 0.81-t99678575.49 1.9605
084ARI 2.2-t9100115175.39 1.9691
085HAP 3.06(none)100377675.32 1.9743
086SRANK 1.0c7100530775.28 1.9773
087LZDS v2.1-s1 -m5100551175.28 1.9777
088ACE 2.04-m5 -d32100983175.17 1.9862
089WinACE 2.69Max 32Kb100983175.17 1.9862
090WINIMP 1.21M1, Block 200,SUS 16 Mb101139175.13 1.9892
091DST 0.91b-1101558775.03 1.9975
092UHBC 1.0-m3 -d -b9k101708674.99 2.0004
093BioArc 1.9Fast Standard101764574.98 2.0015
094Blizzard 0.24b10000102046074.91 2.0071
095DARK 0.51-b10k102739374.74 2.0207
096LZ2ASzd8102913574.70 2.0241
097LHA32 1.88.3.14-e0 -je32768102953974.69 2.0249
098BVI 1.70-m4103052074.66 2.0269
099ARQ 3.2(none)103061874.66 2.0271
100IMP 1.12-1 -m3103105774.65 2.0279
101BOA 0.58b-m1103117674.65 2.0282
102LHARK 0.4d-ta1 -c5103233974.62 2.0304
103SEMONE 0.6-mf103440274.57 2.0345
104DACT 0.8.41-b11000103944674.44 2.0444
105DeepFreezer 1.06(none)103959174.44 2.0447
106AKT 0.62b(none)104031774.42 2.0461
107LHA 2.55(none)104164374.39 2.0487
108SAR 1.0(none)104164374.39 2.0487
109AR 1.0(none)104164374.39 2.0487
110PUT 3.47(none)104168774.39 2.0488
111ZOO 2.1ah104177574.39 2.0490
112CPAC 1.35+S format=binary104194774.38 2.0493
113PKZIP 2.50(none)104373274.34 2.0529
114PAC 17apr2004comp1104475774.31 2.0549
115LIMIT 1.2-ms104549274.30 2.0563
116Tornado 0.4a-h8104591874.29 2.0572
117HIT 2.10(none)104680874.26 2.0589
118BMA 1.35b-m8k104821574.23 2.0617
119Windows XP built-in(none)104823074.23 2.0617
120DZIP 2.90-5104856674.22 2.0624
121QLFC 6.6w12288104863374.22 2.0625
122GZIP 1.3.5-5104925074.20 2.0637
123ZIP 2.2-5104934574.20 2.0639
124WIN-GZ 1.2(None)104945574.20 2.0641
125vuZIP 1.8Fastest104955274.20 2.0643
126File2Pack 2.0(none)104957774.20 2.0643
127EAZEL 1.0(normal)104958974.20 2.0644
128LHA 2.67-e0104973874.19 2.0647
129 BCArchive 1.08.7(none)104987274.19 2.0649
130WINZIP 8.0(Max Compress)105058574.17 2.0663
131WINZIP 11.0(Legacy zip)105066274.17 2.0665
132BSA 2.00-+0105211874.13 2.0693
133LZA 1.01(none)105321374.11 2.0715
134DC 0.99.307b-b23 -fb105361474.10 2.0723
135ARJ 2.84-jm -e -jh21000105424274.08 2.0735
136ASD 0.2.0-m1 -mda105539274.05 2.0758
137ESP 1.92/M2105753174.00 2.0800
138AIN 2.32/m2105754874.00 2.0800
139ZET 0.10b-ex105903173.96 2.0829
140CHILE 0.5b16106110773.91 2.0870
141UC II v3.05b-TT106180873.89 2.0884
142QUARK 1.00b/p106198673.89 2.0888
143DCA 1.0.1bFaster106632173.78 2.0973
144AMG 2.2Max compression106913873.71 2.1028
145BWTZIP9000106968373.70 2.1039
146AKT 1.00a3(none)107096373.67 2.1064
147SQUEEZE 1.08.4/p1 /q4 /m2107632773.54 2.1170
148MAR-g107991473.45 2.1240
149ZZIP 0.36c-mx -k1108302073.37 2.1301
150BA 1.01-1108474273.33 2.1335
151PACKET 0.90b-m4 -s0108825773.24 2.1404
152M03BS=512Kb109022973.20 2.1443
153EXTREME 1.06-t1109177773.16 2.1474
154SQWEZ 2.3/s109288673.13 2.1495
155PIM 2.10(none)109561573.06 2.1549
156ALZip 7.0Normal110219872.90 2.1678
157ZPack(none)111071672.69 2.1846
158SLUG 1.27b(none)111239472.65 2.1879
159Archiver 1.0Dict=2M111515272.58 2.1933
160BZIP 0.21-1112161372.42 2.2060
161Chaos Comp 3.0(none)112442272.36 2.2116
162RAX 1.02-m3112450372.35 2.2117
163BZIP2 1.0.5-1112895872.24 2.2205
164RZIP 2.1-1113015472.21 2.2228
165ZAP32 0.15.0b(none)114866071.76 2.2592
166M99 2.2.1-m -1m115325971.65 2.2683
167GCA 0.9k(none)115417771.62 2.2701
168MNZIP0115515171.60 2.2720
169AI 1.1-m2115943571.49 2.2804
170DCGA b8(none)115995271.48 2.2814
171YBS 0.03f-m2m -r116092071.46 2.2833
172LZHUF(none)116415771.38 2.2897
173ABCOMP 2.06(none)116420571.38 2.2898
174ARX 1.0(none)116427171.38 2.2899
175ELI 5750(none)116428871.38 2.2900
176DATAC 1.03-f116429071.38 2.2900
177YAC 1.02(none)116683871.31 2.2950
178SBC 0.970 rev3-b1116816871.28 2.2976
179HYPER 2.5(none)116849571.27 2.2982
180BBB ver1(none)117048971.22 2.3022
181SQUISH 1.0(none)117182171.19 2.3048
182BICOM 1.01(none)117738671.05 2.3157
183PPMT 0.1(none)117760671.05 2.3162
184JAR 1.02-m3117917771.01 2.3193
185ABC 2.4-cv118118070.96 2.3232
186CA-ZIP 3.4-a118963670.75 2.3398
187BWIC(none)119458270.63 2.3496
188OrangeArchiver 1.05(none)119532170.61 2.3510
189ERI 5.1fre(none)120092870.47 2.3620
190DGCA 1.10(none)120260870.43 2.3653
191TRANSFORM 1.02Very Low121285270.18 2.3855
192EXP 1.0(none)122176369.96 2.4030
193PAR 2.00(none)122185669.96 2.4032
19412Ghosts 7.0(none)126016269.02 2.4785
195JCALG1 5.32-2126324268.94 2.4846
196RDMC 0.06c(none)126414268.92 2.4864
197BAR 1.1.2(none)126973068.78 2.4974
198HiP beta 11126979568.78 2.4975
199aPLib 0.43(none)127255068.71 2.5029
200XPA 1.0.2(none)128699868.36 2.5313
201BriefLZ 1.04(none)137653366.16 2.7074
202Secura 1.7(none)140457265.47 2.7626
203CODER 1.1-e7 4194304142830164.88 2.8092
204LZC 0.081144132564.56 2.8349
205QuickLZ 1.40b9mode3144623564.44 2.8445
206LZOP 1.02rc1-9146026864.10 2.8721
207HuffComp 1.3(none)149250663.31 2.9355
208DLC 0.6.1(none)152100462.61 2.9916
209LCSSR 0.2(none)167291258.87 3.2903
210LZRW1(none)168018958.69 3.3047
211LZ 1.0(none)168428558.59 3.3127
212LCW 0.2(none)171511057.83 3.3733
213Shindlet(none)211841647.92 4.1666
214SHcodec 1.0.1(none)221259345.60 4.3518
215english.dic40674390.00 8.0000

Show historic data

Lossless data compression ratio's of the best and some well know compression programs for an alphabetically sorted word-list file Next Test Home Previous Test


©2003-2008 MaximumCompression (lossless data compression software benchmarks)