Alphabetically sorted Dictionary compression test


File type : Alphabetically sorted English word-list (350,000 words)
# of files to compress in this test : 1
Total File Size (bytes) : 4,067,439
Sample of data :

galvanomagnetism
galvanometer
galvanometers
galvanometric
galvanometrical
galvanometrically
galvanometry
galvanoplastic
galvanoplastical
galvanoplastically
galvanoplastics
galvanoplasty
galvanopsychic
galvanopuncture

Conclusion: This test first puzzled me a bit. I would expect compression ratio of this alphabetically sorted English wordlist to be much better then normal text compression as similar words are already grouped and punctuation is totally absent. But the best program (PAQ8) 'only' achieves a compression ratio of 90% (versus 89% in text compression). Guess this can be explained by the fact that there isn't a single repeating word in the file.

Similar to text compression, the difference between the programs is relatively big. The #1 program PAQ8P 'out-compresses' #4 with not less then 49 KB (!), the gap to #6 is a huge 65 KB. The top 8 programs all compress to less then half the size of the resulting WinZip 8.0 archive. WinRK and PAQ8 are again the big winners here.

Note: The #4 of the text compression test (RKC) is not even in the top 28 in this test!.

Number of different compressors/archivers listed in this test: 228


Current Version Previous Version
Pos. Name Compressor Best switch combination Size
(bytes)
Ratio
(%)
Bits per Byte (b/B) Name Size
(bytes)
Delta
(bytes)
001PAQ8PX-738603290.51 0.7593 PAQ8P385620-412
002WinRK 3.1.2MAX (PWCM)39370490.32 0.7744 WinRK 3.0.3392161-1543
003LPAQ8841198089.87 0.8103 LPAQ742502413044
004PAQAR 4.5-843567489.29 0.8569 PAQAR 4.1434827-847
005NanoZip 0.08a-cc -m640m44424389.08 0.8738 NanoZip 0.07a443682-561
006ZPAQ 1.10max.cfg 244992388.94 0.8849 ZPAQ 1.00450658735
007Ocamyd 1.66test1-m8 -s045000488.94 0.8851 Ocamyd 1.66 final449263-741
008CMM4 0.2b2645597388.79 0.8968 CMM4 0.1e453648-2325
009SLIM 0.23d-o8 -m91246369288.60 0.9120 SLIM 00224648491157
010BIT 0.7-p=547240288.39 0.9291 BIT 0.3579376106974
011DURILCA 0.5-o547378488.35 0.9319 DURILCA 0.4b4804376653
012PPMonstr J rev.1-o547747288.26 0.9391 PPMonstr Ir150043722965
013CCM 1.30cCCMx -648072188.18 0.9455 CCM 1.26b475539-5182
014COMPRESSIA 1.0bBS15 SE MC48625688.05 0.9564    
015EPM r9c00439039300586566063231 -m91249166387.91 0.9670 EPM r850735115688
016ASH 07/o9 /m91249900587.73 0.9815 ASH 06a49904944
017ENC 0.15ag -o554094086.70 1.0639    
018BSC 2.2.5-m1 -b12 -cp54759586.54 1.0770 BSC 1.0.256585618261
019GRZipII 0.2.4-m3 L10 -l -a54823186.52 1.0783    
020FreeARC 0.666-m=grzip:d:a:l:m354846286.52 1.0787 FreeARC 0.60548461-1
021HOOK 1.46054961286.49 1.0810 HOOK 1.35539154303
022PIMPLE2(none)55900686.26 1.0995 PIMPLE 1.43b58271123705
023M1 0.3btext profile56071786.21 1.1028 M1 0.1a58328222565
024HIPP 0.5819/o556226586.18 1.1059    
025PPMY SSE (9A9)/o5 /m91256472586.12 1.1107    
026RINGS 1.61056635286.08 1.1139 RINGS 1.5c566326-26
027TarsaLZP 21Aug2007(none)57799685.79 1.1368 TarsaLZP 07Aug2007578865869
028SZIP 1.12-b41o457858485.78 1.1380 SZIP 1.05f572928-5656
029UHARC 0.6b-mx -md-58810285.54 1.1567 UHARC 0.55913203218
030TC 5.2 dev2(none)58885685.52 1.1582 TC 5.1 dev7825510236654
031RKC 1.02-M912m -o4 -ft -mx59068285.48 1.1618    
032BZP 0.3(none)59123585.46 1.1629    
033BEE 0.7.9-m3 -d659366985.40 1.1677 BEE 0.7.7b943817350148
034GRZIP 0.7.3-a -f59710985.32 1.1744    
035RZM 0.07h(none)61139984.97 1.2025 RZM 0.0467384662447
036Quark 0.95r-l962122384.73 1.2218 Quark 0.9369499373770
037NNTCP664443284.16 1.2675    
038SR3ac264987684.02 1.2782 SR371260162725
039CTW 0.1-n16M -d772701382.13 1.4299    
040LZXQ 0.4normal73664781.89 1.4489 LZXQ 0.1703997-32650
041BruteCM 0.1d(none)73843881.85 1.4524 BruteCM 0.1a875257136819
042FlashZIP 0.99b8-m3 -c774961281.57 1.4744 FlashZIP 0.94714266-35346
043UFA 0.04b1-m5 -mu475185681.52 1.4788    
044777 0.04b1-m5 -mu475185681.52 1.4788    
045BALZ 1.15ex76655481.15 1.5077 BALZ 1.13739207-27347
046PIM 2.90(none)76657481.15 1.5077 PIM 2.101095615329041
047SQUEEZ 5.63-ux77010381.07 1.5147 SQUEEZ 4.2894252124149
048WinTurtle 1.6.0(none)77272281.00 1.5198 WinTurtle 1.3.07748042082
0497-Zip 9.15-m0=ppmd:mem=488k:o=477920080.84 1.5326 7-Zip 9.10779085-115
050PPMN 1.00b1 km-O4 -M:2577923480.84 1.5326    
051PPMVC 1.2-o4 -m1 -u77954780.83 1.5332 PPMVC 1.17795470
052PPMd rev J-o4 -m178104880.80 1.5362 PPMd I rev 17853134265
053QC 0.050-078211080.77 1.5383 QC 0.03386188579775
054WinRAR 3.91-ep -m5 -mdE -mc4:1t+78618380.67 1.5463 WinRAR 3.627861830
055ICEOWS 4.20bVery less78705480.65 1.5480    
056YZX 0.04(none)79054480.56 1.5549    
057CTXf 0.75 b1-mf79172180.54 1.5572    
058Ultra7z Opt 0.05(none)79211380.53 1.5580    
059QAZAR 0.0pre5-x0 -l079213780.52 1.5580 QAZAR 0.0pre4d88165989522
060STUFFIT 14PPM L4 M28 no opt79215080.52 1.5580 STUFFIT 12.0785464-6686
061LZTurbo 0.95-5980359780.24 1.5805 LZTurbo 0.92687644-115953
062PPMX 0.05(none)81034980.08 1.5938    
063TURTLE 0.07(none)82108779.81 1.6149    
064Quad 1.12x82801879.64 1.6286 Quad 1.07b823666-4352
065WINZIP 14Best Method84575379.21 1.6635 WINZIP 12846310557
066X1 0.95aam#84679979.18 1.6655 X1 0.94h8467990
067ARHANGEL 1.40-mc1600085123979.07 1.6743    
068SYMBRA 0.2-m0 -c4 -p285306079.03 1.6778    
069LZPX(J) 1.2h-885797578.91 1.6875 LZPX(J) 1.2g87466616691
070HA 0.999ba2187087078.59 1.7129    
071LGHA 1.1g-287087078.59 1.7129    
072CODEC 3.21-c1087203078.56 1.7151    
073QUANTUM 0.97-c7 -t1688559778.23 1.7418 QUANTUM 0.968895513954
074LZPM 0.16ex88575978.22 1.7421 LZPM 0.1591182326064
075RK 1.04.1-mx1 -M52 -B2000088650478.20 1.7436    
076KZIP 14-APR-2007/b29089068078.10 1.7518 KZIP 11-OCT-200689073050
077CABARC 1.00.0106-m LZX:2189093878.10 1.7523    
078ShipInBottle 1.0 b17alg:ppm len:489114578.09 1.7527 ShipInBottle 1.0 b16891143-2
079RKUC 1.04-x -o1689142878.08 1.7533    
080BSSC 0.95a-et89584577.98 1.7620 BSSC 0.93a8989733128
081BIX 1.00b7-mdD90040577.86 1.7710    
082WinHKI 1.74HKI2 Fastest92564177.24 1.8206 WinHKI 1.3g9256410
083LZAP 0.20.0b(none)94900376.67 1.8665    
084BigCrunch 0.4a1(none)94971776.65 1.8679    
085ACB 2.00cu96713176.22 1.9022    
086RKIVE 1.92-mt2 -mm196783676.21 1.9036    
087THOR 0.96e496977276.16 1.9074 THOR 0.959697720
088BJWFLATE 1.54-s51298520275.78 1.9377    
089PSA 0.91a-o24 -m1152K99601775.51 1.9590    
090PPMZ2 0.81-t99678575.49 1.9605 PPMZ2 0.8997574789
091ARI 2.2-t9100115175.39 1.9691    
092HAP 3.06(none)100377675.32 1.9743    
093SRANK 1.0c7100530775.28 1.9773    
094LZDS v2.1-s1 -m5100551175.28 1.9777    
095ACE 2.6-m5 -d32100983175.17 1.9862 ACE 2.0410098310
096WinACE 2.69Max 32Kb100983175.17 1.9862 WINACE 2.6b110098310
097WINIMP 1.21M1, Block 200,SUS 16 Mb101139175.13 1.9892    
098DST 0.91b-1101558775.03 1.9975    
099UHBC 1.0-m3 -d -b9k101708674.99 2.0004    
100BioArc 1.9Fast Standard101764574.98 2.0015    
101Blizzard 0.24b10000102046074.91 2.0071    
102DARK 0.51-b10k102739374.74 2.0207 DARK 0.50c1027345-48
103LZ2ASzd8102913574.70 2.0241    
104LHA32 1.88.3.14-e0 -je32768102953974.69 2.0249    
105CSC 3.2a6-m2 -dk32 -fo102989974.68 2.0256 CSC 3.1105782527926
106BVI 1.70-m4103052074.66 2.0269    
107ARQ 3.2(none)103061874.66 2.0271    
108IMP 1.12-1 -m3103105774.65 2.0279    
109BOA 0.58b-m1103117674.65 2.0282    
110LHARK 0.4d-ta1 -c5103233974.62 2.0304    
111SEMONE 0.6-mf103440274.57 2.0345    
112DACT 0.8.41-b11000103944674.44 2.0444    
113DeepFreezer 1.06(none)103959174.44 2.0447    
114AKT 0.62b(none)104031774.42 2.0461    
115AR 1.0(none)104164374.39 2.0487    
116SAR 1.0(none)104164374.39 2.0487    
117LHA 2.55(none)104164374.39 2.0487    
118PUT 3.47(none)104168774.39 2.0488    
119ZOO 2.1ah104177574.39 2.0490    
120CPAC 1.35+S format=binary104194774.38 2.0493    
121PKZIP 2.50(none)104373274.34 2.0529 PKZIP 2.0610453031571
122PAC 17apr2004comp1104475774.31 2.0549    
123LIMIT 1.2-ms104549274.30 2.0563    
124Tornado 0.4a-h8104591874.29 2.0572 Tornado 0.410459180
125HIT 2.10(none)104680874.26 2.0589    
126BMA 1.35b-m8k104821574.23 2.0617 BMA 1.34b1048827612
127Windows XP built-in(none)104823074.23 2.0617    
128DZIP 2.90-5104856674.22 2.0624    
129QLFC 6.6w12288104863374.22 2.0625    
130GZIP 1.3.5-5104925074.20 2.0637 GZIP 1.2.41046946-2304
131ZIP 2.2-5104934574.20 2.0639    
132WIN-GZ 1.2(None)104945574.20 2.0641    
133vuZIP 1.8Fastest104955274.20 2.0643    
134File2Pack 2.0(none)104957774.20 2.0643    
135EAZEL 1.0(normal)104958974.20 2.0644    
136LHA 2.67-e0104973874.19 2.0647    
137 BCArchive 1.08.7(none)104987274.19 2.0649 BCArchive 1.00b10498720
138WINZIP 8.0(Max Compress)105058574.17 2.0663    
139BSA 2.00-+0105211874.13 2.0693    
140LZA 1.01(none)105321374.11 2.0715    
141DC 0.99.307b-b23 -fb105361174.10 2.0723    
142ARJ 2.85-jm -e -jh21000105423774.08 2.0735 ARJ 2.82b11054217-20
143ASD 0.2.0-m1 -mda105539274.05 2.0758 ASD 0.1.51429408374016
144ESP 1.92/M2105753174.00 2.0800 ESP 1.910575310
145AIN 2.32/m2105754874.00 2.0800    
146ZET 0.10b-ex105903173.96 2.0829    
147CHILE 0.5b16106110773.91 2.0870 CHILE 0.3d107230111194
148UC II v3.05b-TT106180873.89 2.0884    
149QUARK 1.00b/p106198673.89 2.0888    
150DCA 1.0.1bFaster106632173.78 2.0973    
151AMG 2.2Max compression106913873.71 2.1028    
152BWTZIP9000106968373.70 2.1039    
153AKT 1.00a3(none)107096373.67 2.1064    
154SLUG X(none)107143273.66 2.1073 SLUG 1.27b111239440962
155SQUEEZE 1.08.4/p1 /q4 /m2107632773.54 2.1170    
156MAR-g107991473.45 2.1240    
157ZZIP 0.36c-mx -k1108302073.37 2.1301    
158BA 1.01-1108474273.33 2.1335    
159PACKET 0.91a-m6 -s0108635173.29 2.1367 PACKET 0.90b10882571906
160M03BS=512Kb109022973.20 2.1443    
161EXTREME 1.06-t1109177773.16 2.1474    
162SQWEZ 2.3/s109288673.13 2.1495    
163ALZip 7.0Normal110219872.90 2.1678 ALZip 6.3211021980
164ZPack(none)111071672.69 2.1846    
165Archiver 1.0Dict=2M111515272.58 2.1933    
166BZIP 0.21-1112161372.42 2.2060    
167Chaos Comp 3.0(none)112442272.36 2.2116    
168RAX 1.02-m3112450372.35 2.2117    
169BCM 0.11-b1112496972.34 2.2126 BCM 0.1011268411872
170BZIP2 1.0.5-1112895872.24 2.2205 BZIP2 1.0.311289580
171RZIP 2.1-1113015472.21 2.2228 RZIP 2.0113023985
172BWMonstr 0.02(none)113440272.11 2.2312 BWMonstr 0.0111423057903
173ZAP32 0.15.0b(none)114866071.76 2.2592    
174M99 2.2.1-m -1m115325971.65 2.2683 M99 2.11153258-1
175GCA 0.9k(none)115417771.62 2.2701    
176MNZIP0115515171.60 2.2720    
177AI 1.1-m2115943571.49 2.2804    
178DCGA b8(none)115995271.48 2.2814    
179YBS 0.03f-m2m -r116092071.46 2.2833    
180LZHUF(none)116415771.38 2.2897    
181ABCOMP 2.06(none)116420571.38 2.2898    
182ARX 1.0(none)116427171.38 2.2899    
183ELI 5750(none)116428871.38 2.2900    
184DATAC 1.03-f116429071.38 2.2900    
185YAC 1.02(none)116683871.31 2.2950    
186SBC 0.970 rev3-b1116816871.28 2.2976    
187HYPER 2.5(none)116849571.27 2.2982    
188BBB ver1(none)117048971.22 2.3022    
189SQUISH 1.0(none)117182171.19 2.3048    
190BICOM 1.01(none)117738671.05 2.3157    
191PPMT 0.1(none)117760671.05 2.3162    
192JAR 1.02-m3117917771.01 2.3193    
193ABC 2.4-cv118118070.96 2.3232    
194CA-ZIP 3.4-a118963670.75 2.3398    
195BWIC(none)119458270.63 2.3496    
196OrangeArchiver 1.05(none)119532170.61 2.3510    
197ERI 5.1fre(none)120092870.47 2.3620    
198DGCA 1.10(none)120260870.43 2.3653 DGCA 1.0812026080
199TRANSFORM 1.02Very Low121285270.18 2.3855    
200EXP 1.0(none)122176369.96 2.4030    
201PAR 2.00(none)122185669.96 2.4032    
20212Ghosts 7.0(none)126016269.02 2.4785    
203JCALG1 5.32-2126324268.94 2.4846    
204RDMC 0.06c(none)126414268.92 2.4864    
205BAR 1.1.2(none)126973068.78 2.4974    
206HiP beta 11126979568.78 2.4975    
207aPLib 0.43(none)127255068.71 2.5029    
208XPA 1.0.2(none)128699868.36 2.5313    
209Etincelle RC2(none)135236366.75 2.6599 Etincelle beta41352691328
210BriefLZ 1.04(none)137653366.16 2.7074    
211ULZ 0.0.2c6139959765.59 2.7528    
212Secura 1.7(none)140457265.47 2.7626    
213CODER 1.1-e7 4194304142830164.88 2.8092    
214Zhuff 0.2(none)143747764.66 2.8273    
215LZC 0.081144132564.56 2.8349 LZC 0.061424160-17165
216QuickLZ 1.40b9mode3144623564.44 2.8445 QuickLZ 1.40b61446099-136
217QPress 0.38b-L3145293464.28 2.8577 QPress 0.35b14529340
218LZOP 1.02rc1-9146026864.10 2.8721    
219HuffComp 1.3(none)149250663.31 2.9355    
220DLC 0.6.1(none)152100462.61 2.9916    
221LCSSR 0.2(none)167291258.87 3.2903    
222LZRW1(none)168018958.69 3.3047    
223LZP2 0.7d(none)168313158.62 3.3104 LZP22303227620096
224LZ 1.0(none)168428558.59 3.3127    
225LCW 0.2(none)171511057.83 3.3733    
226Shindlet(none)211841647.92 4.1666    
227SHcodec 1.0.1(none)221259345.60 4.3518    
228LZBW1 0.8(none)268198734.06 5.2750    
229english.dic40674390.00 8.0000    

Hide historic data

Lossless data compression ratio's of the best and some well know compression programs for an alphabetically sorted word-list file Next Test Home Previous Test


©2003-2009 MaximumCompression (lossless data compression software benchmarks)