CIPR / RPI Homepage of Amir Said


Compression of Compound Images and Video

Compound images are those like the example below, which contains a combination of natural images (photos), text, and graphics. This type of image occurs in many important applications, like document imaging and printing [1-6]. Compound video is the equivalent for moving images, and is commonly found in applications like remote visualization [5].
The standard image and video compression methods that use linear transforms (e.g., discrete cosine or wavelet) effectively exploit the properties of the natural images and of the human visual system, and are able to change the natural images considerably, while keeping an excellent visual quality. The images containing text and graphics, on the other hand, have edges that are harder to represent with the linear transforms, and consequently many noticeable artifacts tend to appear around edges.
What motivates the development of special compression for compound content is the severity of this problem in the graphics and text regions. The complex structure of the many text edges produces an accumulation of artifacts around text, and the problem is made worse by the fact that the visual quality expectations for text are very different from those for natural images. People expect text to be crisp, and easily perceive how artifacts reduce legibility and reading speed, and lower the overall document quality.
What seems to be the obvious solution to the problem is to use high-level graphics languages. In reference [5] we explain why, for a variety of practical reasons, in remote visualization applications the current graphics languages are de facto much less useful than expected, and compound image and video compression is a better solution.
It also seems that we can use the large variety of general image and video segmentation methods for this application. However, the sophisticated segmentation methods have a computational complexity much higher than that needed for this problem. The segmentation needed for compression can be much simpler [1,5,6], with a slight increase in bit rate, but still maintaining good visual quality.
An alternative to segmentation for compression is the decomposition ot he image in layers, as in the MRC format and standards [2,4].
For applications in which the image must fit a memory buffer, an interesting problem is how to guarantee, in a single pass, a minimum compression ratio [3].

References

[1] Amir Said and Alexander I. Drukarev,
"Simplified segmentation for compound image compression," Proc. IEEE Int. Conf. Image Processing, vol. 1, pp. 24-28, Oct. 1999. (PDF)
[2] Debargha Mukherjee, Nasir Memon, and Amir Said,
"JPEG-Matched MRC compression of compound documents," Proc. IEEE Int. Conf. Image Processing, vol. 3, pp. 434-437, Oct. 2001. (PDF)
[3] Debargha Mukherjee, Christos Chrysafis, and Amir Said,
"Low complexity guaranteed fit compound document compression," Proc. IEEE Int. Conf. Image Processing, vol. 1, pp. 225-228, Sept. 2002. (PDF)
[4] Debargha Mukherjee, Christos Chrysafis, and Amir Said,
"JPEG2000-Matched MRC compression of compound documents," Proc. IEEE Int. Conf. Image Processing, vol. 3, pp. 73-76, Sept. 2002. (PDF)
[5] Amir Said,
"Compression of compound images and video for enabling rich media in embedded systems," SPIE/IS&T Visual Communications and Image Processing Conference, SPIE Proc. vol. 5308, pp. 69-82, Jan. 2004. (Also HP Labs Report HPL-2004-89). (PDF)
[6] Amir Said,
"Efficient and reliable dynamic quality control for compression of compound document images," Proc. IEEE Int. Conf. Image Processing, Oct. 2004. (PDF)
The material downloadable from the links above is made avaiable to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders (IEEE, SPIE, and Hewlett-Packard Co.). All persons copying this information are expected to adhere to the terms and constraints invoked by each copyright holder. These works should not be reposted without the explicit permission of the copyright holder.


Site Index

Bio Page
Journal Page
Arithmetic and Huffman Coding
Structured Scalable Metaformats
Virtual Environment Design Automation
Set Partitioning in Hierarchical Trees
SPIE/IS&T Conf. on Image and Video Communications and Processing
Other Links