Alphabetically sorted Dictionary compression test


File type : Alphabetically sorted English word-list (350,000 words)
# of files to compress in this test : 1
Total File Size (bytes) : 4,067,439
Sample of data :

galvanomagnetism
galvanometer
galvanometers
galvanometric
galvanometrical
galvanometrically
galvanometry
galvanoplastic
galvanoplastical
galvanoplastically
galvanoplastics
galvanoplasty
galvanopsychic
galvanopuncture

Conclusion: This test first puzzled me a bit. I would expect compression ratio of this alphabetically sorted English wordlist to be much better then normal text compression as similar words are already grouped and punctuation is totally absent. But the best program (PAQ8) 'only' achieves a compression ratio of 90% (versus 89% in text compression). Guess this can be explained by the fact that there isn't a single repeating word in the file.

Similar to text compression, the difference between the programs is relatively big. The #1 program PAQ8P 'out-compresses' #4 with not less then 49 KB (!), the gap to #6 is a huge 65 KB. The top 8 programs all compress to less then half the size of the resulting WinZip 8.0 archive. WinRK and PAQ8 are again the big winners here.

Note: The #4 of the text compression test (RKC) is not even in the top 28 in this test!.

Number of different compressors/archivers listed in this test: 224


Pos. Name Compressor Best switches combination Compressed Compress Bits per
  Size Ratio Byte
  (bytes) (%) (b/B)
001PAQ8PX-738603290.51 0.7593
002WinRK 3.1.2MAX (PWCM)39370490.32 0.7744
003LPAQ8841198089.87 0.8103
004PAQAR 4.5-843567489.29 0.8569
005NanoZip 0.07a-cc -m640m44368289.09 0.8727
006ZPAQ 1.09max.cfg 244992388.94 0.8849
007Ocamyd 1.66test1-m8 -s045000488.94 0.8851
008CMM4 0.2b2645597388.79 0.8968
009SLIM 0.23d-o8 -m91246369288.60 0.9120
010BIT 0.7-p=547240288.39 0.9291
011DURILCA 0.5-o547378488.35 0.9319
012PPMonstr J rev.1-o547747288.26 0.9391
013CCM 1.30cCCMx -648072188.18 0.9455
014COMPRESSIA 1.0bBS15 SE MC48625688.05 0.9564
015EPM r9c00439039300586566063231 -m91249166387.91 0.9670
016ASH 07/o9 /m91249900587.73 0.9815
017ENC 0.15ag -o554094086.70 1.0639
018GRZipII 0.2.4-m3 L10 -l -a54823186.52 1.0783
019FreeARC 0.60-m=grzip:d:a:l:m354846186.52 1.0787
020HOOK 1.46054961286.49 1.0810
021PIMPLE2(none)55900686.26 1.0995
022M1 0.3btext profile56071786.21 1.1028
023HIPP 0.5819/o556226586.18 1.1059
024PPMY SSE (9A9)/o5 /m91256472586.12 1.1107
025RINGS 1.61056635286.08 1.1139
026TarsaLZP 21Aug2007(none)57799685.79 1.1368
027SZIP 1.12-b41o457858485.78 1.1380
028UHARC 0.6b-mx -md-58810285.54 1.1567
029TC 5.2 dev2(none)58885685.52 1.1582
030RKC 1.02-M912m -o4 -ft -mx59068285.48 1.1618
031BZP 0.3(none)59123585.46 1.1629
032BEE 0.7.9-m3 -d659366985.40 1.1677
033GRZIP 0.7.3-a -f59710985.32 1.1744
034RZM 0.07h(none)61139984.97 1.2025
035Quark 0.95r-l962122384.73 1.2218
036NNTCP664443284.16 1.2675
037SR3ac264987684.02 1.2782
038FlashZIP 0.99b5-m2 -c771429382.44 1.4049
039CTW 0.1-n16M -d772701382.13 1.4299
040LZXQ 0.4normal73664781.89 1.4489
041BruteCM 0.1d(none)73843881.85 1.4524
042UFA 0.04b1-m5 -mu475185681.52 1.4788
043777 0.04b1-m5 -mu475185681.52 1.4788
044BALZ 1.15ex76655481.15 1.5077
045PIM 2.90(none)76657481.15 1.5077
046SQUEEZ 5.63-ux77010381.07 1.5147
047WinTurtle 1.6.0(none)77272281.00 1.5198
0487-Zip 9.10-m0=ppmd:mem=499807b:o=477908580.85 1.5323
049PPMN 1.00b1 km-O4 -M:2577923480.84 1.5326
050PPMVC 1.2-o4 -m1 -u77954780.83 1.5332
051PPMd rev J-o4 -m178104880.80 1.5362
052QC 0.050-078211080.77 1.5383
053WinRAR 3.91-ep -m5 -mdE -mc4:1t+78618380.67 1.5463
054ICEOWS 4.20bVery less78705480.65 1.5480
055CTXf 0.75 b1-mf79172180.54 1.5572
056Ultra7z Opt 0.05(none)79211380.53 1.5580
057QAZAR 0.0pre5-x0 -l079213780.52 1.5580
058STUFFIT 14PPM L4 M28 no opt79215080.52 1.5580
059LZTurbo 0.95-5980359780.24 1.5805
060PPMX 0.05(none)81034980.08 1.5938
061TURTLE 0.07(none)82108779.81 1.6149
062Quad 1.12x82801879.64 1.6286
063WINZIP 14Best Method84575379.21 1.6635
064X1 0.95aam#84679979.18 1.6655
065ARHANGEL 1.40-mc1600085123979.07 1.6743
066SYMBRA 0.2-m0 -c4 -p285306079.03 1.6778
067LZPX(J) 1.2h-885797578.91 1.6875
068HA 0.999ba2187087078.59 1.7129
069LGHA 1.1g-287087078.59 1.7129
070CODEC 3.21-c1087203078.56 1.7151
071QUANTUM 0.97-c7 -t1688559778.23 1.7418
072LZPM 0.16ex88575978.22 1.7421
073RK 1.04.1-mx1 -M52 -B2000088650478.20 1.7436
074KZIP 14-APR-2007/b29089068078.10 1.7518
075CABARC 1.00.0106-m LZX:2189093878.10 1.7523
076ShipInBottle 1.0 b17alg:ppm len:489114578.09 1.7527
077RKUC 1.04-x -o1689142878.08 1.7533
078BSSC 0.95a-et89584577.98 1.7620
079BIX 1.00b7-mdD90040577.86 1.7710
080WinHKI 1.74HKI2 Fastest92564177.24 1.8206
081LZAP 0.20.0b(none)94900376.67 1.8665
082BigCrunch 0.4a1(none)94971776.65 1.8679
083ACB 2.00cu96713176.22 1.9022
084RKIVE 1.92-mt2 -mm196783676.21 1.9036
085THOR 0.96e496977276.16 1.9074
086BJWFLATE 1.54-s51298520275.78 1.9377
087PSA 0.91a-o24 -m1152K99601775.51 1.9590
088PPMZ2 0.81-t99678575.49 1.9605
089ARI 2.2-t9100115175.39 1.9691
090HAP 3.06(none)100377675.32 1.9743
091SRANK 1.0c7100530775.28 1.9773
092LZDS v2.1-s1 -m5100551175.28 1.9777
093WinACE 2.69Max 32Kb100983175.17 1.9862
094ACE 2.6-m5 -d32100983175.17 1.9862
095WINIMP 1.21M1, Block 200,SUS 16 Mb101139175.13 1.9892
096DST 0.91b-1101558775.03 1.9975
097UHBC 1.0-m3 -d -b9k101708674.99 2.0004
098BioArc 1.9Fast Standard101764574.98 2.0015
099Blizzard 0.24b10000102046074.91 2.0071
100DARK 0.51-b10k102739374.74 2.0207
101LZ2ASzd8102913574.70 2.0241
102LHA32 1.88.3.14-e0 -je32768102953974.69 2.0249
103BVI 1.70-m4103052074.66 2.0269
104ARQ 3.2(none)103061874.66 2.0271
105IMP 1.12-1 -m3103105774.65 2.0279
106BOA 0.58b-m1103117674.65 2.0282
107LHARK 0.4d-ta1 -c5103233974.62 2.0304
108SEMONE 0.6-mf103440274.57 2.0345
109DACT 0.8.41-b11000103944674.44 2.0444
110DeepFreezer 1.06(none)103959174.44 2.0447
111AKT 0.62b(none)104031774.42 2.0461
112AR 1.0(none)104164374.39 2.0487
113SAR 1.0(none)104164374.39 2.0487
114LHA 2.55(none)104164374.39 2.0487
115PUT 3.47(none)104168774.39 2.0488
116ZOO 2.1ah104177574.39 2.0490
117CPAC 1.35+S format=binary104194774.38 2.0493
118PKZIP 2.50(none)104373274.34 2.0529
119PAC 17apr2004comp1104475774.31 2.0549
120LIMIT 1.2-ms104549274.30 2.0563
121Tornado 0.4a-h8104591874.29 2.0572
122HIT 2.10(none)104680874.26 2.0589
123BMA 1.35b-m8k104821574.23 2.0617
124Windows XP built-in(none)104823074.23 2.0617
125DZIP 2.90-5104856674.22 2.0624
126QLFC 6.6w12288104863374.22 2.0625
127GZIP 1.3.5-5104925074.20 2.0637
128ZIP 2.2-5104934574.20 2.0639
129WIN-GZ 1.2(None)104945574.20 2.0641
130vuZIP 1.8Fastest104955274.20 2.0643
131File2Pack 2.0(none)104957774.20 2.0643
132EAZEL 1.0(normal)104958974.20 2.0644
133LHA 2.67-e0104973874.19 2.0647
134 BCArchive 1.08.7(none)104987274.19 2.0649
135WINZIP 8.0(Max Compress)105058574.17 2.0663
136BSA 2.00-+0105211874.13 2.0693
137LZA 1.01(none)105321374.11 2.0715
138DC 0.99.307b-b23 -fb105361174.10 2.0723
139ARJ 2.85-jm -e -jh21000105423774.08 2.0735
140ASD 0.2.0-m1 -mda105539274.05 2.0758
141ESP 1.92/M2105753174.00 2.0800
142AIN 2.32/m2105754874.00 2.0800
143CSC 3.1-m0 -d1105782573.99 2.0806
144ZET 0.10b-ex105903173.96 2.0829
145CHILE 0.5b16106110773.91 2.0870
146UC II v3.05b-TT106180873.89 2.0884
147QUARK 1.00b/p106198673.89 2.0888
148DCA 1.0.1bFaster106632173.78 2.0973
149AMG 2.2Max compression106913873.71 2.1028
150BWTZIP9000106968373.70 2.1039
151AKT 1.00a3(none)107096373.67 2.1064
152SLUG X(none)107143273.66 2.1073
153SQUEEZE 1.08.4/p1 /q4 /m2107632773.54 2.1170
154MAR-g107991473.45 2.1240
155ZZIP 0.36c-mx -k1108302073.37 2.1301
156BA 1.01-1108474273.33 2.1335
157PACKET 0.91a-m6 -s0108635173.29 2.1367
158M03BS=512Kb109022973.20 2.1443
159EXTREME 1.06-t1109177773.16 2.1474
160SQWEZ 2.3/s109288673.13 2.1495
161ALZip 7.0Normal110219872.90 2.1678
162ZPack(none)111071672.69 2.1846
163Archiver 1.0Dict=2M111515272.58 2.1933
164BZIP 0.21-1112161372.42 2.2060
165Chaos Comp 3.0(none)112442272.36 2.2116
166RAX 1.02-m3112450372.35 2.2117
167BCM 0.10-b1112684172.30 2.2163
168BZIP2 1.0.5-1112895872.24 2.2205
169RZIP 2.1-1113015472.21 2.2228
170BWMonstr 0.02(none)113440272.11 2.2312
171ZAP32 0.15.0b(none)114866071.76 2.2592
172M99 2.2.1-m -1m115325971.65 2.2683
173GCA 0.9k(none)115417771.62 2.2701
174MNZIP0115515171.60 2.2720
175AI 1.1-m2115943571.49 2.2804
176DCGA b8(none)115995271.48 2.2814
177YBS 0.03f-m2m -r116092071.46 2.2833
178LZHUF(none)116415771.38 2.2897
179ABCOMP 2.06(none)116420571.38 2.2898
180ARX 1.0(none)116427171.38 2.2899
181ELI 5750(none)116428871.38 2.2900
182DATAC 1.03-f116429071.38 2.2900
183YAC 1.02(none)116683871.31 2.2950
184SBC 0.970 rev3-b1116816871.28 2.2976
185HYPER 2.5(none)116849571.27 2.2982
186BBB ver1(none)117048971.22 2.3022
187SQUISH 1.0(none)117182171.19 2.3048
188BICOM 1.01(none)117738671.05 2.3157
189PPMT 0.1(none)117760671.05 2.3162
190JAR 1.02-m3117917771.01 2.3193
191ABC 2.4-cv118118070.96 2.3232
192CA-ZIP 3.4-a118963670.75 2.3398
193BWIC(none)119458270.63 2.3496
194OrangeArchiver 1.05(none)119532170.61 2.3510
195ERI 5.1fre(none)120092870.47 2.3620
196DGCA 1.10(none)120260870.43 2.3653
197TRANSFORM 1.02Very Low121285270.18 2.3855
198EXP 1.0(none)122176369.96 2.4030
199PAR 2.00(none)122185669.96 2.4032
20012Ghosts 7.0(none)126016269.02 2.4785
201JCALG1 5.32-2126324268.94 2.4846
202RDMC 0.06c(none)126414268.92 2.4864
203BAR 1.1.2(none)126973068.78 2.4974
204HiP beta 11126979568.78 2.4975
205aPLib 0.43(none)127255068.71 2.5029
206XPA 1.0.2(none)128699868.36 2.5313
207BriefLZ 1.04(none)137653366.16 2.7074
208Secura 1.7(none)140457265.47 2.7626
209CODER 1.1-e7 4194304142830164.88 2.8092
210Zhuff 0.2(none)143747764.66 2.8273
211LZC 0.081144132564.56 2.8349
212QuickLZ 1.40b9mode3144623564.44 2.8445
213QPress 0.38b-L3145293464.28 2.8577
214LZOP 1.02rc1-9146026864.10 2.8721
215HuffComp 1.3(none)149250663.31 2.9355
216DLC 0.6.1(none)152100462.61 2.9916
217LCSSR 0.2(none)167291258.87 3.2903
218LZRW1(none)168018958.69 3.3047
219LZP2 0.7d(none)168313158.62 3.3104
220LZ 1.0(none)168428558.59 3.3127
221LCW 0.2(none)171511057.83 3.3733
222Shindlet(none)211841647.92 4.1666
223SHcodec 1.0.1(none)221259345.60 4.3518
224LZBW1 0.8(none)268198734.06 5.2750
225english.dic40674390.00 8.0000

Show historic data

Lossless data compression ratio's of the best and some well know compression programs for an alphabetically sorted word-list file Next Test Home Previous Test


©2003-2009 MaximumCompression (lossless data compression software benchmarks)