Alphabetically sorted Dictionary compression test


File type : Alphabetically sorted English word-list (350,000 words)
# of files to compress in this test : 1
Total File Size (bytes) : 4,067,439
Sample of data :

galvanomagnetism
galvanometer
galvanometers
galvanometric
galvanometrical
galvanometrically
galvanometry
galvanoplastic
galvanoplastical
galvanoplastically
galvanoplastics
galvanoplasty
galvanopsychic
galvanopuncture

Conclusion: This test first puzzled me a bit. I would expect compression ratio of this alphabetically sorted English wordlist to be much better then normal text compression as similar words are already grouped and punctuation is totally absent. But the best program (PAQ8) 'only' achieves a compression ratio of 90% (versus 89% in text compression). Guess this can be explained by the fact that there isn't a single repeating word in the file.

Similar to text compression, the difference between the programs is relatively big. The #1 program PAQ8P 'out-compresses' #4 with not less then 49 KB (!), the gap to #6 is a huge 65 KB. The top 8 programs all compress to less then half the size of the resulting WinZip 8.0 archive. WinRK and PAQ8 are again the big winners here.

Note: The #4 of the text compression test (RKC) is not even in the top 28 in this test!.

Number of different compressors/archivers listed in this test: 232


Pos. Name Compressor Best switches combination Compressed Compress Bits per
  Size Ratio Byte
  (bytes) (%) (b/B)
001PAQ8PX-738603290.51 0.7593
002WinRK 3.1.2MAX (PWCM)39370490.32 0.7744
003LPAQ8841198089.87 0.8103
004PAQAR 4.5-843567489.29 0.8569
005NanoZip 0.09a-cc -m800m44311089.11 0.8715
006ZPAQ 2.05max.cfg 244992388.94 0.8849
007Ocamyd 1.66test1-m8 -s045000488.94 0.8851
008CMM4 0.2b2645597388.79 0.8968
009SLIM 0.23d-o8 -m91246369288.60 0.9120
010BIT 0.7-p=547240288.39 0.9291
011DURILCA 0.5-o547378488.35 0.9319
012PPMonstr J rev.1-o547747288.26 0.9391
013CCM 1.30cCCMx -648072188.18 0.9455
014COMPRESSIA 1.0bBS15 SE MC48625688.05 0.9564
015EPM r9c00439039300586566063231 -m91249166387.91 0.9670
016ASH 07/o9 /m91249900587.73 0.9815
017SCM 0.0.1b(none)52975586.98 1.0419
018ENC 0.15ag -o554094086.70 1.0639
019BSC 3.0.0-m4 -b10 -cp -p54675286.56 1.0754
020GRZipII 0.2.4-m3 L10 -l -a54823186.52 1.0783
021FreeARC 0.666-m=grzip:d:a:l:m354846286.52 1.0787
022HOOK 1.46054961286.49 1.0810
023PIMPLE2(none)55900686.26 1.0995
024M1 0.3btext profile56071786.21 1.1028
025HIPP 0.5819/o556226586.18 1.1059
026PPMY SSE (9A9)/o5 /m91256472586.12 1.1107
027RINGS 1.61056635286.08 1.1139
028TarsaLZP 21Aug2007(none)57799685.79 1.1368
029SZIP 1.12-b41o457858485.78 1.1380
030UHARC 0.6b-mx -md-58810285.54 1.1567
031TC 5.2 dev2(none)58885685.52 1.1582
032RKC 1.02-M912m -o4 -ft -mx59068285.48 1.1618
033BZP 0.3(none)59123585.46 1.1629
034BEE 0.7.9-m3 -d659366985.40 1.1677
035GRZIP 0.7.3-a -f59710985.32 1.1744
036RZM 0.07h(none)61139984.97 1.2025
037Quark 0.95r-l962122384.73 1.2218
038NNTCP664443284.16 1.2675
039SR3ac264987684.02 1.2782
040CTW 0.1-n16M -d772701382.13 1.4299
041LZXQ 0.4normal73664781.89 1.4489
042BruteCM 0.1d(none)73843881.85 1.4524
043FlashZIP 0.99b8-m3 -c774961281.57 1.4744
044777 0.04b1-m5 -mu475185681.52 1.4788
045UFA 0.04b1-m5 -mu475185681.52 1.4788
046BALZ 1.15ex76655481.15 1.5077
047PIM 2.90(none)76657481.15 1.5077
048SQUEEZ 5.63-ux77010381.07 1.5147
049WinTurtle 1.6.0(none)77272281.00 1.5198
0507-Zip 9.25a-m0=ppmd:mem=488k:o=477920080.84 1.5326
051PPMN 1.00b1 km-O4 -M:2577923480.84 1.5326
052PPMVC 1.2-o4 -m1 -u77954780.83 1.5332
053PPMd rev J-o4 -m178104880.80 1.5362
054QC 0.050-078211080.77 1.5383
055WinRAR 4.1b3-ep -m5 -mdE -mc4:1t+78618380.67 1.5463
056ICEOWS 4.20bVery less78705480.65 1.5480
057YZX 0.04(none)79054480.56 1.5549
058CTXf 0.75 b1-mf79172180.54 1.5572
059Ultra7z Opt 0.05(none)79211380.53 1.5580
060QAZAR 0.0pre5-x0 -l079213780.52 1.5580
061STUFFIT 14PPM L4 M28 no opt79215080.52 1.5580
062LZTurbo 0.95-5980359780.24 1.5805
063TURTLE 0.07(none)82108779.81 1.6149
064Quad 1.12x82801879.64 1.6286
065WINZIP 14Best Method84575379.21 1.6635
066X1 0.95aam#84679979.18 1.6655
067ARHANGEL 1.40-mc1600085123979.07 1.6743
068SYMBRA 0.2-m0 -c4 -p285306079.03 1.6778
069PPMX 0.07(none)85790278.91 1.6874
070LZPX(J) 1.2h-885797578.91 1.6875
071LGHA 1.1g-287087078.59 1.7129
072HA 0.999ba2187087078.59 1.7129
073CODEC 3.21-c1087203078.56 1.7151
074QUANTUM 0.97-c7 -t1688559778.23 1.7418
075LZPM 0.16ex88575978.22 1.7421
076RK 1.04.1-mx1 -M52 -B2000088650478.20 1.7436
077KZIP 14-APR-2007/b29089068078.10 1.7518
078CABARC 1.00.0106-m LZX:2189093878.10 1.7523
079ShipInBottle 1.0 b17alg:ppm len:489114578.09 1.7527
080RKUC 1.04-x -o1689142878.08 1.7533
081BSSC 0.95a-et89584577.98 1.7620
082BIX 1.00b7-mdD90040577.86 1.7710
083WinHKI 1.74HKI2 Fastest92564177.24 1.8206
084LZAP 0.20.0b(none)94900376.67 1.8665
085BigCrunch 0.4a1(none)94971776.65 1.8679
086XPv5c095661976.48 1.8815
087ACB 2.00cu96713176.22 1.9022
088RKIVE 1.92-mt2 -mm196783676.21 1.9036
089THOR 0.96e496977276.16 1.9074
090Comprox 0.3.0e197992975.91 1.9274
091BJWFLATE 1.54-s51298520275.78 1.9377
092PSA 0.91a-o24 -m1152K99601775.51 1.9590
093PPMZ2 0.81-t99678575.49 1.9605
094ARI 2.2-t9100115175.39 1.9691
095HAP 3.06(none)100377675.32 1.9743
096SRANK 1.0c7100530775.28 1.9773
097LZDS v2.1-s1 -m5100551175.28 1.9777
098ACE 2.6-m5 -d32100983175.17 1.9862
099WinACE 2.69Max 32Kb100983175.17 1.9862
100WINIMP 1.21M1, Block 200,SUS 16 Mb101139175.13 1.9892
101DST 0.91b-1101558775.03 1.9975
102UHBC 1.0-m3 -d -b9k101708674.99 2.0004
103BioArc 1.9Fast Standard101764574.98 2.0015
104Blizzard 0.24b10000102046074.91 2.0071
105DARK 0.51-b10k102739374.74 2.0207
106LZ2ASzd8102913574.70 2.0241
107LHA32 1.88.3.14-e0 -je32768102953974.69 2.0249
108CSC 3.2a6-m2 -dk32 -fo102989974.68 2.0256
109BVI 1.70-m4103052074.66 2.0269
110ARQ 3.2(none)103061874.66 2.0271
111IMP 1.12-1 -m3103105774.65 2.0279
112BOA 0.58b-m1103117674.65 2.0282
113LHARK 0.4d-ta1 -c5103233974.62 2.0304
114SEMONE 0.6-mf103440274.57 2.0345
115DACT 0.8.42-b11000103935674.45 2.0442
116DeepFreezer 1.06(none)103959174.44 2.0447
117AKT 0.62b(none)104031774.42 2.0461
118LHA 2.55(none)104164374.39 2.0487
119SAR 1.0(none)104164374.39 2.0487
120AR 1.0(none)104164374.39 2.0487
121PUT 3.47(none)104168774.39 2.0488
122ZOO 2.1ah104177574.39 2.0490
123CPAC 1.35+S format=binary104194774.38 2.0493
124PKZIP 2.50(none)104373274.34 2.0529
125PAC 17apr2004comp1104475774.31 2.0549
126LIMIT 1.2-ms104549274.30 2.0563
127Tornado 0.4a-h8104591874.29 2.0572
128HIT 2.10(none)104680874.26 2.0589
129BMA 1.35b-m8k104821574.23 2.0617
130Windows XP built-in(none)104823074.23 2.0617
131DZIP 2.90-5104856674.22 2.0624
132QLFC 6.6w12288104863374.22 2.0625
133GZIP 1.3.5-5104925074.20 2.0637
134ZIP 2.2-5104934574.20 2.0639
135WIN-GZ 1.2(None)104945574.20 2.0641
136vuZIP 1.8Fastest104955274.20 2.0643
137File2Pack 2.0(none)104957774.20 2.0643
138EAZEL 1.0(normal)104958974.20 2.0644
139LHA 2.67-e0104973874.19 2.0647
140 BCArchive 1.08.7(none)104987274.19 2.0649
141WINZIP 8.0(Max Compress)105058574.17 2.0663
142BSA 2.00-+0105211874.13 2.0693
143LZA 1.01(none)105321374.11 2.0715
144DC 0.99.307b-b23 -fb105361174.10 2.0723
145ARJ 2.85-jm -e -jh21000105423774.08 2.0735
146ASD 0.2.0-m1 -mda105539274.05 2.0758
147ESP 1.92/M2105753174.00 2.0800
148AIN 2.32/m2105754874.00 2.0800
149ZET 0.10b-ex105903173.96 2.0829
150CHILE 0.5b16106110773.91 2.0870
151UC II v3.05b-TT106180873.89 2.0884
152QUARK 1.00b/p106198673.89 2.0888
153DCA 1.0.1bFaster106632173.78 2.0973
154AMG 2.2Max compression106913873.71 2.1028
155BWTZIP9000106968373.70 2.1039
156AKT 1.00a3(none)107096373.67 2.1064
157SLUG X(none)107143273.66 2.1073
158SQUEEZE 1.08.4/p1 /q4 /m2107632773.54 2.1170
159MAR-g107991473.45 2.1240
160ZZIP 0.36c-mx -k1108302073.37 2.1301
161BA 1.01-1108474273.33 2.1335
162PACKET 0.91a-m6 -s0108635173.29 2.1367
163M03BS=512Kb109022973.20 2.1443
164EXTREME 1.06-t1109177773.16 2.1474
165SQWEZ 2.3/s109288673.13 2.1495
166ALZip 7.0Normal110219872.90 2.1678
167ZPack(none)111071672.69 2.1846
168Archiver 1.0Dict=2M111515272.58 2.1933
169BZIP 0.21-1112161372.42 2.2060
170Chaos Comp 3.0(none)112442272.36 2.2116
171RAX 1.02-m3112450372.35 2.2117
172BZIP2 1.0.5-1112895872.24 2.2205
173BCM 0.12-b1112916472.24 2.2209
174RZIP 2.1-1113015472.21 2.2228
175BWMonstr 0.02(none)113440272.11 2.2312
176ZAP32 0.15.0b(none)114866071.76 2.2592
177M99 2.2.1-m -1m115325971.65 2.2683
178GCA 0.9k(none)115417771.62 2.2701
179MNZIP0115515171.60 2.2720
180AI 1.1-m2115943571.49 2.2804
181DCGA b8(none)115995271.48 2.2814
182YBS 0.03f-m2m -r116092071.46 2.2833
183LZHUF(none)116415771.38 2.2897
184ABCOMP 2.06(none)116420571.38 2.2898
185ARX 1.0(none)116427171.38 2.2899
186ELI 5750(none)116428871.38 2.2900
187DATAC 1.03-f116429071.38 2.2900
188YAC 1.02(none)116683871.31 2.2950
189SBC 0.970 rev3-b1116816871.28 2.2976
190HYPER 2.5(none)116849571.27 2.2982
191BBB ver1(none)117048971.22 2.3022
192SQUISH 1.0(none)117182171.19 2.3048
193BICOM 1.01(none)117738671.05 2.3157
194PPMT 0.1(none)117760671.05 2.3162
195JAR 1.02-m3117917771.01 2.3193
196ABC 2.4-cv118118070.96 2.3232
197CA-ZIP 3.4-a118963670.75 2.3398
198BWIC(none)119458270.63 2.3496
199OrangeArchiver 1.05(none)119532170.61 2.3510
200ERI 5.1fre(none)120092870.47 2.3620
201DGCA 1.10(none)120260870.43 2.3653
202Crush 0.01cf120446470.39 2.3690
203TRANSFORM 1.02Very Low121285270.18 2.3855
204EXP 1.0(none)122176369.96 2.4030
205PAR 2.00(none)122185669.96 2.4032
20612Ghosts 7.0(none)126016269.02 2.4785
207JCALG1 5.32-2126324268.94 2.4846
208RDMC 0.06c(none)126414268.92 2.4864
209BAR 1.1.2(none)126973068.78 2.4974
210HiP beta 11126979568.78 2.4975
211aPLib 0.43(none)127255068.71 2.5029
212XPA 1.0.2(none)128699868.36 2.5313
213Etincelle RC2(none)135236366.75 2.6599
214BriefLZ 1.04(none)137653366.16 2.7074
215ULZ 0.0.2c6139959765.59 2.7528
216Secura 1.7(none)140457265.47 2.7626
217CODER 1.1-e7 4194304142830164.88 2.8092
218Zhuff 0.2(none)143747764.66 2.8273
219LZC 0.081144132564.56 2.8349
220QuickLZ 1.40b9mode3144623564.44 2.8445
221QPress 0.38b-L3145293464.28 2.8577
222LZOP 1.02rc1-9146026864.10 2.8721
223HuffComp 1.3(none)149250663.31 2.9355
224DLC 0.6.1(none)152100462.61 2.9916
225LCSSR 0.2(none)167291258.87 3.2903
226LZRW1(none)168018958.69 3.3047
227LZP2 0.7d(none)168313158.62 3.3104
228LZ 1.0(none)168428558.59 3.3127
229LCW 0.2(none)171511057.83 3.3733
230Shindlet(none)211841647.92 4.1666
231SHcodec 1.0.1(none)221259345.60 4.3518
232LZBW1 0.8(none)268198734.06 5.2750
233english.dic40674390.00 8.0000

Show historic data

Lossless data compression ratio's of the best and some well know compression programs for an alphabetically sorted word-list file Next Test Home Previous Test


©2003-2011 MaximumCompression (lossless data compression software benchmarks)