Alphabetically sorted Dictionary compression test


File type : Alphabetically sorted English word-list (350,000 words)
# of files to compress in this test : 1
Total File Size (bytes) : 4,067,439
Sample of data :

galvanomagnetism
galvanometer
galvanometers
galvanometric
galvanometrical
galvanometrically
galvanometry
galvanoplastic
galvanoplastical
galvanoplastically
galvanoplastics
galvanoplasty
galvanopsychic
galvanopuncture

Conclusion: This test first puzzled me a bit. I would expect compression ratio of this alphabetically sorted English wordlist to be much better then normal text compression as similar words are already grouped and punctuation is totally absent. But the best program (PAQ8) 'only' achieves a compression ratio of 90% (versus 89% in text compression). Guess this can be explained by the fact that there isn't a single repeating word in the file.

Similar to text compression, the difference between the programs is relatively big. The #1 program PAQ8P 'out-compresses' #4 with not less then 49 KB (!), the gap to #6 is a huge 65 KB. The top 8 programs all compress to less then half the size of the resulting WinZip 8.0 archive. WinRK and PAQ8 are again the big winners here.

Note: The #4 of the text compression test (RKC) is not even in the top 28 in this test!.

Fatal error: Uncaught Error: Call to undefined function mysql_connect() in /var/www/vhosts/maximumcompression.com/httpdocs/data/connect.php:3 Stack trace: #0 /var/www/vhosts/maximumcompression.com/httpdocs/data/fill_table.php(2): include() #1 /var/www/vhosts/maximumcompression.com/httpdocs/data/dict.php(56): include('/var/www/vhosts...') #2 {main} thrown in /var/www/vhosts/maximumcompression.com/httpdocs/data/connect.php on line 3