Alphabetically sorted Dictionary compression test


File type : Alphabetically sorted English word-list (350,000 words)
# of files to compress in this test : 1
Total File Size (bytes) : 4,067,439
Sample of data :

galvanomagnetism
galvanometer
galvanometers
galvanometric
galvanometrical
galvanometrically
galvanometry
galvanoplastic
galvanoplastical
galvanoplastically
galvanoplastics
galvanoplasty
galvanopsychic
galvanopuncture

Conclusion: This test first puzzled me a bit. I would expect compression ratio of this alphabetically sorted English wordlist to be much better then normal text compression as similar words are already grouped and punctuation is totally absent. But the best program (PAQ8) 'only' achieves a compression ratio of 90% (versus 89% in text compression). Guess this can be explained by the fact that there isn't a single repeating word in the file.

Similar to text compression, the difference between the programs is relatively big. The #1 program PAQ8P 'out-compresses' #4 with not less then 49 KB (!), the gap to #6 is a huge 65 KB. The top 8 programs all compress to less then half the size of the resulting WinZip 8.0 archive. WinRK and PAQ8 are again the big winners here.

Note: The #4 of the text compression test (RKC) is not even in the top 28 in this test!.

Number of different compressors/archivers listed in this test: 228


Pos. Name Compressor Best switches combination Compressed Compress Bits per
  Size Ratio Byte
  (bytes) (%) (b/B)
001PAQ8PX-738603290.51 0.7593
002WinRK 3.1.2MAX (PWCM)39370490.32 0.7744
003LPAQ8841198089.87 0.8103
004PAQAR 4.5-843567489.29 0.8569
005NanoZip 0.08a-cc -m640m44424389.08 0.8738
006ZPAQ 1.10max.cfg 244992388.94 0.8849
007Ocamyd 1.66test1-m8 -s045000488.94 0.8851
008CMM4 0.2b2645597388.79 0.8968
009SLIM 0.23d-o8 -m91246369288.60 0.9120
010BIT 0.7-p=547240288.39 0.9291
011DURILCA 0.5-o547378488.35 0.9319
012PPMonstr J rev.1-o547747288.26 0.9391
013CCM 1.30cCCMx -648072188.18 0.9455
014COMPRESSIA 1.0bBS15 SE MC48625688.05 0.9564
015EPM r9c00439039300586566063231 -m91249166387.91 0.9670
016ASH 07/o9 /m91249900587.73 0.9815
017ENC 0.15ag -o554094086.70 1.0639
018BSC 2.2.5-m1 -b12 -cp54759586.54 1.0770
019GRZipII 0.2.4-m3 L10 -l -a54823186.52 1.0783
020FreeARC 0.666-m=grzip:d:a:l:m354846286.52 1.0787
021HOOK 1.46054961286.49 1.0810
022PIMPLE2(none)55900686.26 1.0995
023M1 0.3btext profile56071786.21 1.1028
024HIPP 0.5819/o556226586.18 1.1059
025PPMY SSE (9A9)/o5 /m91256472586.12 1.1107
026RINGS 1.61056635286.08 1.1139
027TarsaLZP 21Aug2007(none)57799685.79 1.1368
028SZIP 1.12-b41o457858485.78 1.1380
029UHARC 0.6b-mx -md-58810285.54 1.1567
030TC 5.2 dev2(none)58885685.52 1.1582
031RKC 1.02-M912m -o4 -ft -mx59068285.48 1.1618
032BZP 0.3(none)59123585.46 1.1629
033BEE 0.7.9-m3 -d659366985.40 1.1677
034GRZIP 0.7.3-a -f59710985.32 1.1744
035RZM 0.07h(none)61139984.97 1.2025
036Quark 0.95r-l962122384.73 1.2218
037NNTCP664443284.16 1.2675
038SR3ac264987684.02 1.2782
039CTW 0.1-n16M -d772701382.13 1.4299
040LZXQ 0.4normal73664781.89 1.4489
041BruteCM 0.1d(none)73843881.85 1.4524
042FlashZIP 0.99b8-m3 -c774961281.57 1.4744
043UFA 0.04b1-m5 -mu475185681.52 1.4788
044777 0.04b1-m5 -mu475185681.52 1.4788
045BALZ 1.15ex76655481.15 1.5077
046PIM 2.90(none)76657481.15 1.5077
047SQUEEZ 5.63-ux77010381.07 1.5147
048WinTurtle 1.6.0(none)77272281.00 1.5198
0497-Zip 9.15-m0=ppmd:mem=488k:o=477920080.84 1.5326
050PPMN 1.00b1 km-O4 -M:2577923480.84 1.5326
051PPMVC 1.2-o4 -m1 -u77954780.83 1.5332
052PPMd rev J-o4 -m178104880.80 1.5362
053QC 0.050-078211080.77 1.5383
054WinRAR 3.91-ep -m5 -mdE -mc4:1t+78618380.67 1.5463
055ICEOWS 4.20bVery less78705480.65 1.5480
056YZX 0.04(none)79054480.56 1.5549
057CTXf 0.75 b1-mf79172180.54 1.5572
058Ultra7z Opt 0.05(none)79211380.53 1.5580
059QAZAR 0.0pre5-x0 -l079213780.52 1.5580
060STUFFIT 14PPM L4 M28 no opt79215080.52 1.5580
061LZTurbo 0.95-5980359780.24 1.5805
062PPMX 0.05(none)81034980.08 1.5938
063TURTLE 0.07(none)82108779.81 1.6149
064Quad 1.12x82801879.64 1.6286
065WINZIP 14Best Method84575379.21 1.6635
066X1 0.95aam#84679979.18 1.6655
067ARHANGEL 1.40-mc1600085123979.07 1.6743
068SYMBRA 0.2-m0 -c4 -p285306079.03 1.6778
069LZPX(J) 1.2h-885797578.91 1.6875
070HA 0.999ba2187087078.59 1.7129
071LGHA 1.1g-287087078.59 1.7129
072CODEC 3.21-c1087203078.56 1.7151
073QUANTUM 0.97-c7 -t1688559778.23 1.7418
074LZPM 0.16ex88575978.22 1.7421
075RK 1.04.1-mx1 -M52 -B2000088650478.20 1.7436
076KZIP 14-APR-2007/b29089068078.10 1.7518
077CABARC 1.00.0106-m LZX:2189093878.10 1.7523
078ShipInBottle 1.0 b17alg:ppm len:489114578.09 1.7527
079RKUC 1.04-x -o1689142878.08 1.7533
080BSSC 0.95a-et89584577.98 1.7620
081BIX 1.00b7-mdD90040577.86 1.7710
082WinHKI 1.74HKI2 Fastest92564177.24 1.8206
083LZAP 0.20.0b(none)94900376.67 1.8665
084BigCrunch 0.4a1(none)94971776.65 1.8679
085ACB 2.00cu96713176.22 1.9022
086RKIVE 1.92-mt2 -mm196783676.21 1.9036
087THOR 0.96e496977276.16 1.9074
088BJWFLATE 1.54-s51298520275.78 1.9377
089PSA 0.91a-o24 -m1152K99601775.51 1.9590
090PPMZ2 0.81-t99678575.49 1.9605
091ARI 2.2-t9100115175.39 1.9691
092HAP 3.06(none)100377675.32 1.9743
093SRANK 1.0c7100530775.28 1.9773
094LZDS v2.1-s1 -m5100551175.28 1.9777
095ACE 2.6-m5 -d32100983175.17 1.9862
096WinACE 2.69Max 32Kb100983175.17 1.9862
097WINIMP 1.21M1, Block 200,SUS 16 Mb101139175.13 1.9892
098DST 0.91b-1101558775.03 1.9975
099UHBC 1.0-m3 -d -b9k101708674.99 2.0004
100BioArc 1.9Fast Standard101764574.98 2.0015
101Blizzard 0.24b10000102046074.91 2.0071
102DARK 0.51-b10k102739374.74 2.0207
103LZ2ASzd8102913574.70 2.0241
104LHA32 1.88.3.14-e0 -je32768102953974.69 2.0249
105CSC 3.2a6-m2 -dk32 -fo102989974.68 2.0256
106BVI 1.70-m4103052074.66 2.0269
107ARQ 3.2(none)103061874.66 2.0271
108IMP 1.12-1 -m3103105774.65 2.0279
109BOA 0.58b-m1103117674.65 2.0282
110LHARK 0.4d-ta1 -c5103233974.62 2.0304
111SEMONE 0.6-mf103440274.57 2.0345
112DACT 0.8.41-b11000103944674.44 2.0444
113DeepFreezer 1.06(none)103959174.44 2.0447
114AKT 0.62b(none)104031774.42 2.0461
115AR 1.0(none)104164374.39 2.0487
116SAR 1.0(none)104164374.39 2.0487
117LHA 2.55(none)104164374.39 2.0487
118PUT 3.47(none)104168774.39 2.0488
119ZOO 2.1ah104177574.39 2.0490
120CPAC 1.35+S format=binary104194774.38 2.0493
121PKZIP 2.50(none)104373274.34 2.0529
122PAC 17apr2004comp1104475774.31 2.0549
123LIMIT 1.2-ms104549274.30 2.0563
124Tornado 0.4a-h8104591874.29 2.0572
125HIT 2.10(none)104680874.26 2.0589
126BMA 1.35b-m8k104821574.23 2.0617
127Windows XP built-in(none)104823074.23 2.0617
128DZIP 2.90-5104856674.22 2.0624
129QLFC 6.6w12288104863374.22 2.0625
130GZIP 1.3.5-5104925074.20 2.0637
131ZIP 2.2-5104934574.20 2.0639
132WIN-GZ 1.2(None)104945574.20 2.0641
133vuZIP 1.8Fastest104955274.20 2.0643
134File2Pack 2.0(none)104957774.20 2.0643
135EAZEL 1.0(normal)104958974.20 2.0644
136LHA 2.67-e0104973874.19 2.0647
137 BCArchive 1.08.7(none)104987274.19 2.0649
138WINZIP 8.0(Max Compress)105058574.17 2.0663
139BSA 2.00-+0105211874.13 2.0693
140LZA 1.01(none)105321374.11 2.0715
141DC 0.99.307b-b23 -fb105361174.10 2.0723
142ARJ 2.85-jm -e -jh21000105423774.08 2.0735
143ASD 0.2.0-m1 -mda105539274.05 2.0758
144ESP 1.92/M2105753174.00 2.0800
145AIN 2.32/m2105754874.00 2.0800
146ZET 0.10b-ex105903173.96 2.0829
147CHILE 0.5b16106110773.91 2.0870
148UC II v3.05b-TT106180873.89 2.0884
149QUARK 1.00b/p106198673.89 2.0888
150DCA 1.0.1bFaster106632173.78 2.0973
151AMG 2.2Max compression106913873.71 2.1028
152BWTZIP9000106968373.70 2.1039
153AKT 1.00a3(none)107096373.67 2.1064
154SLUG X(none)107143273.66 2.1073
155SQUEEZE 1.08.4/p1 /q4 /m2107632773.54 2.1170
156MAR-g107991473.45 2.1240
157ZZIP 0.36c-mx -k1108302073.37 2.1301
158BA 1.01-1108474273.33 2.1335
159PACKET 0.91a-m6 -s0108635173.29 2.1367
160M03BS=512Kb109022973.20 2.1443
161EXTREME 1.06-t1109177773.16 2.1474
162SQWEZ 2.3/s109288673.13 2.1495
163ALZip 7.0Normal110219872.90 2.1678
164ZPack(none)111071672.69 2.1846
165Archiver 1.0Dict=2M111515272.58 2.1933
166BZIP 0.21-1112161372.42 2.2060
167Chaos Comp 3.0(none)112442272.36 2.2116
168RAX 1.02-m3112450372.35 2.2117
169BCM 0.11-b1112496972.34 2.2126
170BZIP2 1.0.5-1112895872.24 2.2205
171RZIP 2.1-1113015472.21 2.2228
172BWMonstr 0.02(none)113440272.11 2.2312
173ZAP32 0.15.0b(none)114866071.76 2.2592
174M99 2.2.1-m -1m115325971.65 2.2683
175GCA 0.9k(none)115417771.62 2.2701
176MNZIP0115515171.60 2.2720
177AI 1.1-m2115943571.49 2.2804
178DCGA b8(none)115995271.48 2.2814
179YBS 0.03f-m2m -r116092071.46 2.2833
180LZHUF(none)116415771.38 2.2897
181ABCOMP 2.06(none)116420571.38 2.2898
182ARX 1.0(none)116427171.38 2.2899
183ELI 5750(none)116428871.38 2.2900
184DATAC 1.03-f116429071.38 2.2900
185YAC 1.02(none)116683871.31 2.2950
186SBC 0.970 rev3-b1116816871.28 2.2976
187HYPER 2.5(none)116849571.27 2.2982
188BBB ver1(none)117048971.22 2.3022
189SQUISH 1.0(none)117182171.19 2.3048
190BICOM 1.01(none)117738671.05 2.3157
191PPMT 0.1(none)117760671.05 2.3162
192JAR 1.02-m3117917771.01 2.3193
193ABC 2.4-cv118118070.96 2.3232
194CA-ZIP 3.4-a118963670.75 2.3398
195BWIC(none)119458270.63 2.3496
196OrangeArchiver 1.05(none)119532170.61 2.3510
197ERI 5.1fre(none)120092870.47 2.3620
198DGCA 1.10(none)120260870.43 2.3653
199TRANSFORM 1.02Very Low121285270.18 2.3855
200EXP 1.0(none)122176369.96 2.4030
201PAR 2.00(none)122185669.96 2.4032
20212Ghosts 7.0(none)126016269.02 2.4785
203JCALG1 5.32-2126324268.94 2.4846
204RDMC 0.06c(none)126414268.92 2.4864
205BAR 1.1.2(none)126973068.78 2.4974
206HiP beta 11126979568.78 2.4975
207aPLib 0.43(none)127255068.71 2.5029
208XPA 1.0.2(none)128699868.36 2.5313
209Etincelle RC2(none)135236366.75 2.6599
210BriefLZ 1.04(none)137653366.16 2.7074
211ULZ 0.0.2c6139959765.59 2.7528
212Secura 1.7(none)140457265.47 2.7626
213CODER 1.1-e7 4194304142830164.88 2.8092
214Zhuff 0.2(none)143747764.66 2.8273
215LZC 0.081144132564.56 2.8349
216QuickLZ 1.40b9mode3144623564.44 2.8445
217QPress 0.38b-L3145293464.28 2.8577
218LZOP 1.02rc1-9146026864.10 2.8721
219HuffComp 1.3(none)149250663.31 2.9355
220DLC 0.6.1(none)152100462.61 2.9916
221LCSSR 0.2(none)167291258.87 3.2903
222LZRW1(none)168018958.69 3.3047
223LZP2 0.7d(none)168313158.62 3.3104
224LZ 1.0(none)168428558.59 3.3127
225LCW 0.2(none)171511057.83 3.3733
226Shindlet(none)211841647.92 4.1666
227SHcodec 1.0.1(none)221259345.60 4.3518
228LZBW1 0.8(none)268198734.06 5.2750
229english.dic40674390.00 8.0000

Show historic data

Lossless data compression ratio's of the best and some well know compression programs for an alphabetically sorted word-list file Next Test Home Previous Test


©2003-2009 MaximumCompression (lossless data compression software benchmarks)