RT Journal Article
JF 2008 Data Compression Conference
TI Improving HTML Compression
A1 Przemyslaw Skibinski,
AB In this work, we describe a lossless HTML transform which, combined with generally used LZ77 and PPM compression algorithms, allows to attain high compression ratios. Its core is a fully reversible transform featuring substitution of words in an HTML document using a static dictionary or a semi-static dictionary, effective encoding of dictionary indices and numbers.The test results show the proposed transform to improve the HTML compression efficiency of general purpose compressors on average by 17% in case of Deflate and 8% in case of PPMVC.
PB IEEE Computer Society, [URL:http://www.computer.org]