WO2002032147A1 - Scalable coding of multi-media objects - Google Patents

Scalable coding of multi-media objects Download PDF

Info

Publication number
WO2002032147A1
WO2002032147A1 PCT/EP2001/011565 EP0111565W WO0232147A1 WO 2002032147 A1 WO2002032147 A1 WO 2002032147A1 EP 0111565 W EP0111565 W EP 0111565W WO 0232147 A1 WO0232147 A1 WO 0232147A1
Authority
WO
WIPO (PCT)
Prior art keywords
bit
stream
quality
quality information
given
Prior art date
Application number
PCT/EP2001/011565
Other languages
French (fr)
Inventor
Renatus J. Van Der Vleuten
Mihaela Van Der Schaar
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to AU2002220595A priority Critical patent/AU2002220595A1/en
Priority to JP2002535403A priority patent/JP4063666B2/en
Priority to EP01986831A priority patent/EP1327360A1/en
Priority to BR0107307-9A priority patent/BR0107307A/en
Publication of WO2002032147A1 publication Critical patent/WO2002032147A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2662Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/34Scalability techniques involving progressive bit-plane based encoding of the enhancement layer, e.g. fine granular scalability [FGS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/115Selection of the code volume for a coding unit prior to coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/154Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/20Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
    • H04N19/29Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding involving scalability at the object level, e.g. video object layer [VOL]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/86Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness

Definitions

  • the invention relates to coding multi-media objects.
  • Scalable compression e.g. fine-granularity scalable compression of objects such as multi-media objects has the useful feature that the encoded bit stream may be truncated at a given point, while the remaining stream can still be decoded (although at a lower object quality).
  • a standard of such a scalable coding i.e. MPEG-4 Fine Granularity Scalability, is currently being defined, see ISO/TEC 14496-2 / AMD 4, document ISO/TEC JTC1/SC29/WG11 N3315, March 2000 (further called N3315), which is incorporated by reference herein.
  • a further scalable coding method is described in non pre-published European Patent Application 00201037.9, filed 2000.03.23 (our reference PHNL000153), which is also incorporated by reference herein.
  • bit stream consisted of a few large layers, i.e. a base layer and e.g. one or two enhancement layers.
  • Such scalability is defined e.g. in the JPEG standard (hierarchical coding) as well as in the MPEG2 standard (SNR scalability, spatial scalability, temporal scalability).
  • An object of the invention is to provide advantageous coding.
  • the invention provides coding of a multi-media object to obtain a bit-stream, controlling a bit- stream, transmitting a bit-stream, receiving a bit-stream, a multiplexer or network node, a (scalable) bit-stream representing a multi-media object, a storage medium, a computer program, and a signal carrying a computer program as defined in the independent claims.
  • Advantageous embodiments are defined in the dependent claims.
  • a multi-media object is coded to obtain a bit-stream, and quality information is added to the bit-stream, which quality information indicates a quality of the object in relation to a given position in (or a given part of) the bit-stream.
  • the bit stream syntax is enhanced by adding quality (distortion) information. This can be done at no or a negligible increase in bit rate and extends the range of applications for several coding schemes.
  • the multi-media object may be an audio and/or video object or any other reproducible object for which a quality is relevant.
  • the multi-media object may also be a picture or a sequence of pictures such as a program.
  • the coding is a scalable coding and the resulting bit-stream is a scalable bit-stream.
  • quality information is advantageous because these bit-streams are suitable for truncating.
  • the quality information represents object reproduction quality.
  • Information on object reproduction quality versus number of bits is then easily determined.
  • SNR signal-to-noise ratio
  • PSNR peak-signal-to- noise-ratio
  • the encoded stream of a single object may be truncated optimally by just fitting it to the available bandwidth/storage, this is not the case when simultaneously dealing with multiple objects.
  • quality tags added to the scalable bit-stream represent the quality of the reproduction of the encoded object when the bit-stream is truncated at a point related to a given tag.
  • this overhead can be kept small.
  • An important advantage is that the quality information makes it easy to jointly optimally truncate the bit-streams of multiple objects. Such a multiple truncation problem occurs for example in an elastic memory as described in non pre-published European Patent Application 00200890.2, filed 2000.03.13 (our reference PHNL000110), which is incorporated by reference herein.
  • Another application is a multiplexer or a network node in which the outgoing bandwidth is temporarily lower than the incoming bandwidth and consequently the incoming scalably compressed bit-streams need to be truncated.
  • the input data is usually compressed in multiple units (such as e.g. parts of DCT blocks, parts of frequency bands of a wavelet transformed image or layers).
  • Each coded part usually contains some headers with various parameters or tags.
  • a parameter is added indicating the quality of the object when it is truncated just after (or alternatively just before) the current encoded data part.
  • a quality parameter is to add a number related to the mean squared error (or PSNR or SNR) of the reproduction; the number might also represent a visually weighted (P)SNR.
  • PSNR mean squared error
  • SNR mean squared error
  • the type (or multiple types) of quality indication might be standardized, so all encoders will use the same or a limited number of different quality indicators.
  • the quality could also be relative (for example a percentage), so an encoder would not have to disclose its quality measure.
  • the relative quality might then range from 0 to 100% of a certain scale for each individual object, with an additional scale/weight parameter for each object to enable different weighting of the various objects.
  • the quality tags may be placed at approximately equal distances (number of bits) throughout the encoded stream or they may be used more frequently when the bit rate versus quality curve is quickly changing.
  • the quality for each truncation point not corresponding to a tag location can be approximated by interpolation (linear or more complex) of the quality tag values.
  • the quality information is added to the encoded bit stream of MPEG-4 FGS using a tag that has already been defined in the standard, see the above-mentioned reference N3315.
  • the quality information may be used for adaptive post-processing or for scalable video processing algorithms etc for non-scalable compression methods.
  • the quality information can help to determine the 'strength' or amount or type (blocking artifacts/ ringing reduction) of post-processing required.
  • the quality information can help to better estimate the number of CPU cycles required to achieve a certain desired processing quality level using a certain selected video processing algorithm.
  • the quality information may be added as side information to the bit-stream, i.e. not included in the bit-stream itself.
  • the quality information is unencrypted.
  • the quality of a given part of the bit-stream (e.g. layer) can then be determined in a decoder without decrypting the bit-stream.
  • Quality information can also be advantageously applied for applications in which source coding and channel coding are not carried out at the same time or location.
  • the quality information is then used in the channel coding, e.g. to determine the protection rates.
  • Fig. 1 shows a system according to an embodiment of the invention
  • FIG. 2 shows more advantageous embodiments of the invention.
  • the drawings only show those elements that are necessary to understand the invention.
  • Fig. 1 shows a system according to an embodiment of the invention, the system comprising a transmitter 11 having an input unit or object generation unit 110 and an encoder 12.
  • the encoder 12 comprises a scalable encoder 120 and a quality information generation unit 121.
  • the scalable encoder 120 codes objects obtained from the input unit 110 to provide one or more scalable bit-streams.
  • the quality information generation unit 121 extracts the object quality from the signals obtained from the input unit 110 as well as the signals and/or parameters provided by the encoder 120.
  • the quality information from the generation unit 121 is provided to the encoder unit 120, which generates the quality information tags and inserts them in the scalable bit-stream.
  • the system further comprises a truncator 3 for truncating the one or more scalable bit-streams, and a truncator control unit 4.
  • the truncator control unit 4 extracts quality information from the scalable bit-stream provided by the encoder 12 and controls the truncator 3 in dependence on the received quality information/tags. In the case of only one scalable bit-stream, the scalable bit-stream is truncated when the desired quality has been reached.
  • Truncator 3 and control unit 4 together may constitute part of a multiplexer, bit-rate control unit, network node, etc. and may be present in a channel, but also in a receiver.
  • Unit 5 may alternatively be a reproduction unit and/or decoder, e.g. being present together with truncator 3 and control unit 4 in a receiver according to an embodiment of the invention.
  • multiple scalable bit- streams are provided by transmitters 21,31,41, wherein at least some of the multiple scalable bit-streams have quality tags included in them.
  • the transmitters 21,31,41 and their components are similar to transmitter 11 shown in FIG. 1.
  • the scalable bit-streams are more or less truncated, under dependence of the quality information/tags that are present in the scalable bit-streams.
  • Such a multiple truncation can be done using the principle of elastic memory described in non pre-published European Patent Application 00200890.2, filed 2000.03.13 (our reference PHNL000110), which is incorporated by reference herein.
  • Multiplexer 16 combines the streams from the transmitters.
  • Truncator 13 and control unit 14 together may constitute part of a multiplexer (e.g. 16), bit-rate control unit, network node, etc. and may be present in a channel, but also in a receiver.
  • Unit 15 may alternatively be a reproduction unit and/or decoder, e.g. being present together with truncator 13 and control unit 14 in a receiver according to an embodiment of the invention.
  • the distortion is a significant parameter for the MPEG-4 FGS scheme. If distortion information is not available, the usability of FGS is limited, as is demonstrated below by giving various applications that do need this information.
  • the FGS bit stream syntax is enhanced by adding quality (distortion) information. This can be done at no or a negligible increase in bit rate and extends the range of applications for FGS.
  • a first application of the invention is the coding for a constant-quality (thus variable bit rate) output.
  • This can be used, for example, for recording video data with constant quality on a storage medium that allows for a variable bit rate.
  • the quality information the final bit stream does not need to be produced during the initial encoding but it can be obtained by processing the encoded bit stream at a later time.
  • Selling the same content at different qualities can be efficiently implemented using a scalable (fine granularity or layered) compression method such as described above followed by encryption of one or more of the layers: a property of many scalable compression methods is that when the lowest scalability layer is not available, the higher scalability layers are useless, i.e. cannot be used to increase the quality.
  • the scalably compressed content is encrypted, it is still possible to use it for elastic storage, i.e. to reduce the amount of storage space by throwing away some of the enhancement layer(s).
  • PHNLOOOl 10 For elastic storage, reference is made to PHNLOOOl 10 as mentioned before. To decide how much data to remove, some information about the associated quality loss should be available, since this information can no longer be derived from the compressed bit stream without decrypting it.
  • the quality information is sent as unencrypted information, e.g. as side information.
  • the business model of selling the same content at different quality levels is closely related to elastic storage, since there too the same content is stored at multiple quality levels using scalable compression.
  • the quality levels that are offered for purchase to the consumer preferably directly correspond to the quality levels used in the elastic storage system. This implies that when the elastic storage device wants to lower the quality of a certain content item, it can remove the highest encrypted quality layer, without needing to decrypt it. Since the device does thus not decrypt any data, there is no security or theft risk.
  • the content is preferably compressed (using a scalable compression method) and encrypted at the desired quality levels by the content owner and then distributed in encrypted form to the elastic storage device (either directly transmitted or downloaded or indirectly via e.g. intermediate storage on an optical disk).
  • the user may optionally select a certain desired minimum quality level. I.e. content that is currently available at a higher quality level than minimally desired by the user may be reduced in quality, to make room for more different content, until it reaches the lowest quality acceptable to the user. When the content is still available in a higher quality, however, the user still has the option to purchase the higher quality.
  • the user may also preset different desired minimum quality levels for different types of content (like sports, talk shows, or movies).
  • it is also possible to let the service provider manage the storage space and determine which quality levels should be removed in that case, the service provider keeps track of the qualities and may carry out the elastic storage functions).
  • a storage function e.g. hard disk
  • the quality information also allows source encoding and channel coding to be carried out at a different time or location. This is useful because at the time of encoding the channel characteristics may not yet be known. Also, the same encoded bit stream may serve different channels with different error characteristics. Finally, no storage space is wasted for storing the error correction overhead. It can be generated when needed, since the quality fields give the required information for adding the channel coding (using unequal error protection). Then there are applications where multiple encoded FGS frames have to be jointly processed. This can occur, for example, in a congested network node, where temporarily less bandwidth is available. The network node can then use the quality information to optimally truncate the bit streams with the minimal loss of quality. Because multiple objects are involved, with possibly very different rate-distortion curves, the truncation cannot be satisfactorily done without the quality information.
  • the quality information can provide the server with a good tool to perform the rate-control at transmission time and also the tradeoff between SNR and temporal enhancements (FGS versus FGST, see for definitions N3315).
  • Solution A The start of a bit plane is a good point for adding quality information/fields, because it allows to easily retrieve the information and also provides sufficient samples to accurately describe the rate-distortion curve.
  • the "quality code” would be similar to the current fgs_bp_start_code, whose last 5 bits indicate the ID of the bit plane. Instead of the bit plane ID, the quality information can be inserted in these 5 bits.
  • a bit is added to the syntax to indicate whether the fgs_bp_start_code contains either the bit plane ID or the quality information.
  • Solution B Alternatively, a new code could be defined for the quality information and be inserted after the fgs_bp__start_code. In this case, the quality tag can have 8 bits.
  • the information we propose to store in the quality field is the distortion after completely decoding the bit plane following the quality field.
  • the approximate quality may be obtained by interpolation. This is easier than the extrapolation that would be required if the quality field were to contain information about the distortion before decoding the current bit plane.
  • Ql be the quality before decoding a bit plane
  • a more accurate approximation can be made by also taking into account the number of decoded DCT blocks. For example, if the enhancement information for the current bit plane has been received for n out of the N total blocks for a frame, the true quality can be approximated as Q --- Q1+(Q2 - Ql) * n N.
  • a first quality field for an enhancement NOP i.e. the field for the most significant (MSB) bit plane
  • MSB most significant
  • additional fields contain quality improvements (distortion reductions) relative to the previous quality.
  • the absolute quality can be used to compare different objects. Putting quality improvements in the additional fields allows these improvements to be represented with a higher accuracy then when absolute qualities would be used. This is particularly important when only 5 bits are available for each field.
  • PS ⁇ R values are used.
  • the first quality field contains the absolute quality.
  • This range covers all practically relevant PS ⁇ R values: when the PS ⁇ R is above 49 dB, the base layer already contains a near-lossless representation of the object.
  • a PS ⁇ R that is lower than 18 dB would mean the base layer provides an extremely low quality, which is not very likely.
  • values outside the range do occur they will be clipped to either 18 or 49 dB, depending on whether they fall below or above the allowed range.
  • the next quality fields will then contain the improvement in quality for completely decoding the following bit plane, relative to the quality after decoding the previous bit plane.
  • the 5 bits are used for giving these quality improvements the range of 0...6.2 dB in steps of 0.2 dB. Since a single bit is added, the improvement cannot exceed 6.02 dB, so this range is sufficient.
  • Solution B when 8 bits are used for the quality tag, the quality values (both absolute and differential) could be represented with even finer grain. However, Solution B would also allow us to simply use only absolute (i.e. non-differential) quality values. The quality range would then be 18...60.5 dB in steps of 1/6 «0J67 dB (or 18...49.875 dB in steps of 0.125 dB).
  • the invention can also be advantageously applied for applications in which source coding and channel coding are not carried out at the same time or location.
  • the quality tags then give the required information for adding the channel coding (unequal error protection, e.g. more protection for parts of the bit-stream that represent higher quality, or more protection for parts of the bit-stream with a high quality to number of bits ratio).
  • the invention may also be advantageously applied in the context of scalable image processing schemes such as JPEG2000, see document ISO/TEC JTC 1/SC 29/WG 1 N1646, dated 16 March 2000, which is incorporated by reference herein.
  • the quality information may conveniently be included in JPEG2000, because a Comment and Extension Marker (CME) has already been defined (see page 51 of document N 1646), which allows unstructured data in the header.
  • Quality information is advantageously included in a given CME.
  • a separate Rcme type is defined for quality tags.

Abstract

Coding is provided in which a multi-media object is coded to obtain a bit-stream, and quality information is added to the bit-stream, which quality information indicates a quality of the object in relation to a given position in (or a given part of) the bit-stream. By adding quality information to the bit-stream, jointly storing or transmitting multiple coded objects can be optimized in that the quality of the object can be easily taken into account.

Description

SCALABLE CODING OF MULTI-MEDIA OBJECTS
The invention relates to coding multi-media objects.
Scalable compression, e.g. fine-granularity scalable compression of objects such as multi-media objects has the useful feature that the encoded bit stream may be truncated at a given point, while the remaining stream can still be decoded (although at a lower object quality). A standard of such a scalable coding, i.e. MPEG-4 Fine Granularity Scalability, is currently being defined, see ISO/TEC 14496-2 / AMD 4, document ISO/TEC JTC1/SC29/WG11 N3315, March 2000 (further called N3315), which is incorporated by reference herein. A further scalable coding method is described in non pre-published European Patent Application 00201037.9, filed 2000.03.23 (our reference PHNL000153), which is also incorporated by reference herein.
The availability of such a scalable bit stream considerably simplifies system designs by practically eliminating the need for a buffer control method when fitting the encoded bit stream to a certain given bit rate or memory size. In particular, the same single bit stream simultaneously serves different channels with different capacities, without the need to re-encode the original data. Thus, real-time adaptation to varying channel capacities (with application to the Internet or wireless communication channels) is very much simplified.
Before fine granularity scalability, already some forms of limited scalability existed. There, the bit stream consisted of a few large layers, i.e. a base layer and e.g. one or two enhancement layers. Such scalability is defined e.g. in the JPEG standard (hierarchical coding) as well as in the MPEG2 standard (SNR scalability, spatial scalability, temporal scalability).
An object of the invention is to provide advantageous coding. To this end, the invention provides coding of a multi-media object to obtain a bit-stream, controlling a bit- stream, transmitting a bit-stream, receiving a bit-stream, a multiplexer or network node, a (scalable) bit-stream representing a multi-media object, a storage medium, a computer program, and a signal carrying a computer program as defined in the independent claims. Advantageous embodiments are defined in the dependent claims. According to a first aspect of the invention, a multi-media object is coded to obtain a bit-stream, and quality information is added to the bit-stream, which quality information indicates a quality of the object in relation to a given position in (or a given part of) the bit-stream. By adding quality information to the bit-stream, jointly storing or transmitting multiple coded objects can be optimized in that the quality of the object can be easily taken into account. This aspect of the invention is based on the insight that it is easy to determine the rate of a compressed object, but that another important parameter, a quality measure, e.g. distortion, is not so easily determined. In fact, the distortion can only be accurately obtained at the time of coding, when the complete source information is still available. According to this aspect of the invention, the bit stream syntax is enhanced by adding quality (distortion) information. This can be done at no or a negligible increase in bit rate and extends the range of applications for several coding schemes. The multi-media object may be an audio and/or video object or any other reproducible object for which a quality is relevant. The multi-media object may also be a picture or a sequence of pictures such as a program.
Preferably, the coding is a scalable coding and the resulting bit-stream is a scalable bit-stream. Especially for scalable coding schemes, quality information is advantageous because these bit-streams are suitable for truncating. For a lot of applications, wherein scalable bit-streams are truncated, it is important to have a quality indication of the bit-stream resulting after truncation, which is easily provided by the quality information included in the scalable bit-stream.
Preferably, the quality information represents object reproduction quality. Information on object reproduction quality versus number of bits is then easily determined.To quantify the quality, preferably signal-to-noise ratio (SNR) or peak-signal-to- noise-ratio (PSNR) values are used
Whereas the encoded stream of a single object may be truncated optimally by just fitting it to the available bandwidth/storage, this is not the case when simultaneously dealing with multiple objects. To optimally allocate a certain bandwidth or storage space to multiple objects simultaneously, one has to know the differential rate-distortion curve for each encoded object. While this curve is relatively easily obtained during the encoding of an object (when the original is available), it is non-trivial to obtain (by estimation) later, when only a truncated version of the encoded bit stream is available. Estimation of the quality at a later time requires detailed knowledge of the compression method as well as at least partial decoding of the encoded bit stream. In a practical embodiment, quality tags added to the scalable bit-stream represent the quality of the reproduction of the encoded object when the bit-stream is truncated at a point related to a given tag. Although the addition of the quality information may require a given overhead, this overhead can be kept small. An important advantage is that the quality information makes it easy to jointly optimally truncate the bit-streams of multiple objects. Such a multiple truncation problem occurs for example in an elastic memory as described in non pre-published European Patent Application 00200890.2, filed 2000.03.13 (our reference PHNL000110), which is incorporated by reference herein. Another application is a multiplexer or a network node in which the outgoing bandwidth is temporarily lower than the incoming bandwidth and consequently the incoming scalably compressed bit-streams need to be truncated.
When doing compression, the input data is usually compressed in multiple units (such as e.g. parts of DCT blocks, parts of frequency bands of a wavelet transformed image or layers). Each coded part usually contains some headers with various parameters or tags. In an advantageous embodiment of the invention, in such a header, a parameter is added indicating the quality of the object when it is truncated just after (or alternatively just before) the current encoded data part. One example of a quality parameter is to add a number related to the mean squared error (or PSNR or SNR) of the reproduction; the number might also represent a visually weighted (P)SNR. The type (or multiple types) of quality indication might be standardized, so all encoders will use the same or a limited number of different quality indicators. The quality could also be relative (for example a percentage), so an encoder would not have to disclose its quality measure. The relative quality might then range from 0 to 100% of a certain scale for each individual object, with an additional scale/weight parameter for each object to enable different weighting of the various objects. The quality tags may be placed at approximately equal distances (number of bits) throughout the encoded stream or they may be used more frequently when the bit rate versus quality curve is quickly changing. When an encoded bit stream has to be truncated, the quality for each truncation point not corresponding to a tag location can be approximated by interpolation (linear or more complex) of the quality tag values. In an embodiment of the invention, the quality information is added to the encoded bit stream of MPEG-4 FGS using a tag that has already been defined in the standard, see the above-mentioned reference N3315. In this way, the quality information can be added without having to change the proposed standard, which is a tremendous advantage. For decoded multi-media objects, the quality information may be used for adaptive post-processing or for scalable video processing algorithms etc for non-scalable compression methods. For example, for post-processing of MPEG-compressed video, the quality information can help to determine the 'strength' or amount or type (blocking artifacts/ ringing reduction) of post-processing required. For scalable video algorithms, the quality information can help to better estimate the number of CPU cycles required to achieve a certain desired processing quality level using a certain selected video processing algorithm. The quality information may be added as side information to the bit-stream, i.e. not included in the bit-stream itself. For encrypted bit-streams, it is advantageous that the quality information is unencrypted. The quality of a given part of the bit-stream (e.g. layer) can then be determined in a decoder without decrypting the bit-stream.
Quality information can also be advantageously applied for applications in which source coding and channel coding are not carried out at the same time or location. The quality information is then used in the channel coding, e.g. to determine the protection rates. The aforementioned and other aspects of the invention will be apparent from and elucidated with reference to the embodiments described hereinafter.
In the drawings: Fig. 1 shows a system according to an embodiment of the invention, and
Fig. 2 shows more advantageous embodiments of the invention. The drawings only show those elements that are necessary to understand the invention.
Fig. 1 shows a system according to an embodiment of the invention, the system comprising a transmitter 11 having an input unit or object generation unit 110 and an encoder 12. The encoder 12 comprises a scalable encoder 120 and a quality information generation unit 121. The scalable encoder 120 codes objects obtained from the input unit 110 to provide one or more scalable bit-streams. The quality information generation unit 121 extracts the object quality from the signals obtained from the input unit 110 as well as the signals and/or parameters provided by the encoder 120. The quality information from the generation unit 121 is provided to the encoder unit 120, which generates the quality information tags and inserts them in the scalable bit-stream. The system further comprises a truncator 3 for truncating the one or more scalable bit-streams, and a truncator control unit 4. The truncator control unit 4 extracts quality information from the scalable bit-stream provided by the encoder 12 and controls the truncator 3 in dependence on the received quality information/tags. In the case of only one scalable bit-stream, the scalable bit-stream is truncated when the desired quality has been reached. Truncator 3 and control unit 4 together may constitute part of a multiplexer, bit-rate control unit, network node, etc. and may be present in a channel, but also in a receiver. Unit 5 may alternatively be a reproduction unit and/or decoder, e.g. being present together with truncator 3 and control unit 4 in a receiver according to an embodiment of the invention.
In more advantageous applications, as shown in Fig. 2, multiple scalable bit- streams are provided by transmitters 21,31,41, wherein at least some of the multiple scalable bit-streams have quality tags included in them. The transmitters 21,31,41 and their components are similar to transmitter 11 shown in FIG. 1. Depending on the available bandwidth or storage capacity on a channel or storage medium 15, the scalable bit-streams are more or less truncated, under dependence of the quality information/tags that are present in the scalable bit-streams. Such a multiple truncation can be done using the principle of elastic memory described in non pre-published European Patent Application 00200890.2, filed 2000.03.13 (our reference PHNL000110), which is incorporated by reference herein. Multiplexer 16 combines the streams from the transmitters. Truncator 13 and control unit 14 together may constitute part of a multiplexer (e.g. 16), bit-rate control unit, network node, etc. and may be present in a channel, but also in a receiver. Unit 15 may alternatively be a reproduction unit and/or decoder, e.g. being present together with truncator 13 and control unit 14 in a receiver according to an embodiment of the invention.
In the following, some examples of applications of MPEG-4 FGS that need the quality information are given. Although the following is addressed in particular to MPEG-4 FGS, it will be clear to a person skilled in the art that the invention can be advantageously applied to any scalable coding scheme. From an application point of view, the distortion is a significant parameter for the MPEG-4 FGS scheme. If distortion information is not available, the usability of FGS is limited, as is demonstrated below by giving various applications that do need this information. According to an embodiment of the invention, the FGS bit stream syntax is enhanced by adding quality (distortion) information. This can be done at no or a negligible increase in bit rate and extends the range of applications for FGS.
A first application of the invention is the coding for a constant-quality (thus variable bit rate) output. This can be used, for example, for recording video data with constant quality on a storage medium that allows for a variable bit rate. Using the quality information, the final bit stream does not need to be produced during the initial encoding but it can be obtained by processing the encoded bit stream at a later time.
Selling the same content at different qualities can be efficiently implemented using a scalable (fine granularity or layered) compression method such as described above followed by encryption of one or more of the layers: a property of many scalable compression methods is that when the lowest scalability layer is not available, the higher scalability layers are useless, i.e. cannot be used to increase the quality. When the scalably compressed content is encrypted, it is still possible to use it for elastic storage, i.e. to reduce the amount of storage space by throwing away some of the enhancement layer(s). For elastic storage, reference is made to PHNLOOOl 10 as mentioned before. To decide how much data to remove, some information about the associated quality loss should be available, since this information can no longer be derived from the compressed bit stream without decrypting it. In the current embodiment of the invention, the quality information is sent as unencrypted information, e.g. as side information. The business model of selling the same content at different quality levels is closely related to elastic storage, since there too the same content is stored at multiple quality levels using scalable compression. The quality levels that are offered for purchase to the consumer preferably directly correspond to the quality levels used in the elastic storage system. This implies that when the elastic storage device wants to lower the quality of a certain content item, it can remove the highest encrypted quality layer, without needing to decrypt it. Since the device does thus not decrypt any data, there is no security or theft risk. To maintain security in the whole chain from content owner or service provider to consumer, the content is preferably compressed (using a scalable compression method) and encrypted at the desired quality levels by the content owner and then distributed in encrypted form to the elastic storage device (either directly transmitted or downloaded or indirectly via e.g. intermediate storage on an optical disk).
In an elastic storage application, the user (or the device based on what it knows about the preferences of the user) may optionally select a certain desired minimum quality level. I.e. content that is currently available at a higher quality level than minimally desired by the user may be reduced in quality, to make room for more different content, until it reaches the lowest quality acceptable to the user. When the content is still available in a higher quality, however, the user still has the option to purchase the higher quality. Of course the user (or device) may also preset different desired minimum quality levels for different types of content (like sports, talk shows, or movies). Alternatively, it is also possible to let the service provider manage the storage space and determine which quality levels should be removed (in that case, the service provider keeps track of the qualities and may carry out the elastic storage functions). This could be useful e.g. when content is put on a set-top box containing a storage function (e.g. hard disk) by the service provider. Initially, the content could be offered to the user at a high quality. When the user does not watch/buy the content within a certain time, the quality level stored on the set-top box could be lowered to make room for different content.
The quality information also allows source encoding and channel coding to be carried out at a different time or location. This is useful because at the time of encoding the channel characteristics may not yet be known. Also, the same encoded bit stream may serve different channels with different error characteristics. Finally, no storage space is wasted for storing the error correction overhead. It can be generated when needed, since the quality fields give the required information for adding the channel coding (using unequal error protection). Then there are applications where multiple encoded FGS frames have to be jointly processed. This can occur, for example, in a congested network node, where temporarily less bandwidth is available. The network node can then use the quality information to optimally truncate the bit streams with the minimal loss of quality. Because multiple objects are involved, with possibly very different rate-distortion curves, the truncation cannot be satisfactorily done without the quality information.
Additionally, for streaming applications the quality information can provide the server with a good tool to perform the rate-control at transmission time and also the tradeoff between SNR and temporal enhancements (FGS versus FGST, see for definitions N3315).
In MPEG-4 FGS, the quality information that is needed is actually the rate- distortion curve for the scalable enhancement layer. Since the rate is obvious, only the distortion information has to be added. Two solutions are proposed that allow adding this information with no or minimal modifications of the current FGS bit stream syntax.
Solution A. The start of a bit plane is a good point for adding quality information/fields, because it allows to easily retrieve the information and also provides sufficient samples to accurately describe the rate-distortion curve. The "quality code" would be similar to the current fgs_bp_start_code, whose last 5 bits indicate the ID of the bit plane. Instead of the bit plane ID, the quality information can be inserted in these 5 bits. In a preferred embodiment, a bit is added to the syntax to indicate whether the fgs_bp_start_code contains either the bit plane ID or the quality information.
Solution B. Alternatively, a new code could be defined for the quality information and be inserted after the fgs_bp__start_code. In this case, the quality tag can have 8 bits.
The information we propose to store in the quality field is the distortion after completely decoding the bit plane following the quality field. In this way, when the stream is truncated inside a bit plane, the approximate quality may be obtained by interpolation. This is easier than the extrapolation that would be required if the quality field were to contain information about the distortion before decoding the current bit plane. For example, let Ql be the quality before decoding a bit plane and Q2 the quality after decoding it (as proposed above, Q2 is known to the decoder already at the start of the bit plane). If the stream is truncated inside the bit plane, it is thus known that the true quality Q lies inside the interval <Q1,Q2>. It can therefore simply be approximated as Q = (Ql+Q2)/2. A more accurate approximation can be made by also taking into account the number of decoded DCT blocks. For example, if the enhancement information for the current bit plane has been received for n out of the N total blocks for a frame, the true quality can be approximated as Q --- Q1+(Q2 - Ql) * n N.
In a preferred embodiment for Solution A, a first quality field for an enhancement NOP, i.e. the field for the most significant (MSB) bit plane, contains an absolute quality (distortion), whereas the additional fields contain quality improvements (distortion reductions) relative to the previous quality. The absolute quality can be used to compare different objects. Putting quality improvements in the additional fields allows these improvements to be represented with a higher accuracy then when absolute qualities would be used. This is particularly important when only 5 bits are available for each field.
To quantify the quality, preferably PSΝR values are used. As stated above, the first quality field contains the absolute quality. We propose to use the 5 bits to give the PSΝR after decoding the first (MSB) bit plane, with a range of 18...49 dB in steps of 1 dB. This range covers all practically relevant PSΝR values: when the PSΝR is above 49 dB, the base layer already contains a near-lossless representation of the object. A PSΝR that is lower than 18 dB would mean the base layer provides an extremely low quality, which is not very likely. When values outside the range do occur, they will be clipped to either 18 or 49 dB, depending on whether they fall below or above the allowed range.
The next quality fields will then contain the improvement in quality for completely decoding the following bit plane, relative to the quality after decoding the previous bit plane. Preferably, the 5 bits are used for giving these quality improvements the range of 0...6.2 dB in steps of 0.2 dB. Since a single bit is added, the improvement cannot exceed 6.02 dB, so this range is sufficient.
For Solution B, when 8 bits are used for the quality tag, the quality values (both absolute and differential) could be represented with even finer grain. However, Solution B would also allow us to simply use only absolute (i.e. non-differential) quality values. The quality range would then be 18...60.5 dB in steps of 1/6«0J67 dB (or 18...49.875 dB in steps of 0.125 dB).
Various applications of MPEG-4 FGS have been discussed above, which application need quality information. Since this information is only completely available when the original encoding takes place, it is added to the bit stream to make it available for later use. This can be done at no or a negligible increase in bit rate with minimal modifications of the current bit stream syntax. Two detailed solutions have been presented for adding the PSNR quality values. Solution B using absolute quality values is preferred. The invention applies to all cases where multiple scalably compressed multi- media objects have to be jointly stored or transmitted and some of these objects have been compressed by MPEG-4 FGS incorporating the invention. Particular applications are the elastic memory applications as well as transmission channels or networks dealing with multiple objects/users. When the memory/channel/network has to be shared by few objects/users they get a high quality. The quality is automatically reduced to accommodate more objects/users. This can be done efficiently, i.e. with low overhead, because of the presence of the quality tags according embodiments of this invention.
The invention can also be advantageously applied for applications in which source coding and channel coding are not carried out at the same time or location. The quality tags then give the required information for adding the channel coding (unequal error protection, e.g. more protection for parts of the bit-stream that represent higher quality, or more protection for parts of the bit-stream with a high quality to number of bits ratio).
The invention may also be advantageously applied in the context of scalable image processing schemes such as JPEG2000, see document ISO/TEC JTC 1/SC 29/WG 1 N1646, dated 16 March 2000, which is incorporated by reference herein. The quality information may conveniently be included in JPEG2000, because a Comment and Extension Marker (CME) has already been defined (see page 51 of document N 1646), which allows unstructured data in the header. Quality information is advantageously included in a given CME. E.g. binary data can be included (Rcme=0). Further, according to an embodiment of the invention, a separate Rcme type is defined for quality tags.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design many alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word 'comprising' does not exclude the presence of other elements or steps than those listed in a claim. The invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a device claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.

Claims

CLAIMS:
1. A method of coding a multi-media object, the method comprising the steps of: coding the object to obtain a bit-stream, and adding quality information to the bit-stream, which quality information indicates a quality of the object in relation to a given position in (or a given part of) the bit- stream.
2. A method as claimed in claim 1, wherein the coding step is a scalable coding step to obtain a scalable bit-stream.
3. A method as claimed in claim 1 or 2, wherein the quality information relates to an object reproduction quality.
4. A method as claimed in claim 3, wherein the quality information is based on a signal to noise ratio value.
5. A method as claimed in any of the preceding claims, wherein quality tags are added at given locations in the bit-stream, indicating a quality of the object when the bit- stream is truncated just after (or alternatively just before) the given location in the bit-stream.
6. A method as claimed in claims 1, wherein the quality information is incorporated in existing fields of a given scalable coding standard.
7. A method as claimed in claim 2, wherein the scalable bit-stream includes several layers and wherein respective layers include respective quality information.
8. A method as claimed in claim 1, wherein the bit-stream is encrypted and the quality information is unencrypted.
9. A method of controlling at least one bit-stream representing a multi-media object in which bit-stream quality information has been added, the quality information indicating a quality of the object in relation to a given position in (or a given part of) the bit- stream, the method comprising the steps of: receiving the at least one bit-stream, extracting the quality information from the bit-stream, transcoding or truncating the at least one bit-stream in the case a desired combination of bit-rate and quality of the at least one bit-stream differs from a current combination of bit-rate and quality of the at least one received bit-stream, providing the at least one bit-stream at the desired combination of bit-rate and quality.
10. A method of transmitting at least one multi-media object, the method comprising the steps of: coding the object to obtain a bit-stream, adding quality information to the bit-stream, which quality information indicates a quality of the object in relation to a given position in (or a given part of) the bit- stream, and transmitting the bit-stream in which the quality information has been added.
11. A method of receiving at least one bit-stream representing a multi-media object in which bit-stream quality information has been added, the quality information indicating a quality of the object in relation to a given position in (or a given part of) the bit- stream, the method comprising the steps of: extracting the quality information from the bit-stream, transcoding or truncating the at least one bit-stream in the case a desired combination of bit-rate and quality of the at least one bit-stream differs from a current combination of bit-rate and quality of the at least one received bit-stream, providing the at least one bit-stream at the desired combination of bit-rate and quality, and decoding the at least one bit-stream at the desired combination of bit-rate and quality.
12. A method of receiving at least one bit-stream representing a multi-media object in which bit-stream quality information has been added, the quality information indicating a quality of the object in relation to a given position in (or a given part of) the bit- stream, the method comprising the steps of: extracting the quality information from the bit-stream; decoding the bit-stream to obtain a decoded multi-media object; and processing the multi-media object in dependence on the extracted quality.
13. A device of coding a multi-media object, the device comprising: means for coding the object to obtain a bit-stream, and means for adding quality information to the bit-stream, which quality information indicates a quality of the object in relation to a given position in (or a given part of) the bit-stream.
14. A transmitter comprising a device as claimed in claim 13.
15. A controller for controlling at least one bit-stream representing a multi-media object in which bit-stream quality information has been added, the quality information indicating a quality of the object in relation to a given position in (or a given part of) the bit- stream, the controller comprising: means for receiving the at least one bit-stream, means for extracting the quality information from the bit-stream, means for truncating the at least one bit-stream in the case a desired combination of bit-rate and quality of the at least one bit-stream differs from a current combination of bit-rate and quality of the at least one received bit-stream, means for providing the at least one bit-stream at the desired combination of bit-rate and quality.
16. A receiver comprising a controller as claimed in claim 15.
17. A receiver for receiving at least one bit-stream representing a multi-media object in which bit-stream quality information has been added, the quality information indicating a quality of the object in relation to a given position in (or a given part of) the bit- stream, the receiver comprising: means for extracting the quality information from the bit-stream; means for decoding the bit-stream to obtain a decoded multi-media object; and means for processing the multi-media object in dependence on the extracted quality.
18. A multiplexer or network node comprising a controller as claimed in claim 15.
19. A bit-stream representing a multi-media obj ect in which bit-stream quality information has been added, the quality infonnation indicating a quality of the object in relation to a given position in (or a given part of) the bit-stream
20. A storage medium on which a signal as claimed in claim 19 has been stored.
PCT/EP2001/011565 2000-10-11 2001-10-04 Scalable coding of multi-media objects WO2002032147A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
AU2002220595A AU2002220595A1 (en) 2000-10-11 2001-10-04 Scalable coding of multi-media objects
JP2002535403A JP4063666B2 (en) 2000-10-11 2001-10-04 Scalable coding of multimedia objects
EP01986831A EP1327360A1 (en) 2000-10-11 2001-10-04 Scalable coding of multi-media objects
BR0107307-9A BR0107307A (en) 2000-10-11 2001-10-04 Methods to encode a multimedia object, to control at least one bit stream, to transmit at least one multimedia object, and to receive at least one bit stream, device to encode a multimedia objects, transmitter, controller, receiver, multiplexer or network node, bit stream, and storage medium

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US23934500P 2000-10-11 2000-10-11
US60/239,345 2000-10-11
US23965900P 2000-10-12 2000-10-12
US60/239,659 2000-10-12

Publications (1)

Publication Number Publication Date
WO2002032147A1 true WO2002032147A1 (en) 2002-04-18

Family

ID=26932482

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2001/011565 WO2002032147A1 (en) 2000-10-11 2001-10-04 Scalable coding of multi-media objects

Country Status (10)

Country Link
US (2) US20020076043A1 (en)
EP (1) EP1327360A1 (en)
JP (1) JP4063666B2 (en)
KR (1) KR100887165B1 (en)
CN (1) CN100499817C (en)
AU (1) AU2002220595A1 (en)
BR (1) BR0107307A (en)
PL (1) PL356718A1 (en)
RU (1) RU2284671C2 (en)
WO (1) WO2002032147A1 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1351506A2 (en) * 2002-03-27 2003-10-08 Eastman Kodak Company Encoding rate-distortion information to facilitate optimal transcoding of a compressed digital image
JP2005538651A (en) * 2002-09-10 2005-12-15 トムソン ライセンシング Video on demand server system and method
EP1665799A1 (en) * 2003-08-26 2006-06-07 Samsung Electronics Co., Ltd. Scalable video coding method and apparatus using pre-decoder
WO2008047257A2 (en) 2006-10-20 2008-04-24 Nokia Corporation System and method for providing picture output indications in video coding
FR2933837A1 (en) * 2008-12-10 2010-01-15 Thomson Licensing Video images sequence coding method for e.g. satellite distribution network, involves coding auxiliary transcoding aid data e.g. coding parameters, into supplemental enhancement information message of coded image data stream
US7844992B2 (en) 2003-09-10 2010-11-30 Thomson Licensing Video on demand server system and method
DE102004029872B4 (en) * 2004-06-16 2011-05-05 Deutsche Telekom Ag Method and device for improving the quality of transmission of coded audio / video signals
US8270496B2 (en) 2005-10-12 2012-09-18 Thomson Licensing Region of interest H.264 scalable video coding
US8275051B2 (en) 2007-04-17 2012-09-25 Nokia Corporation Feedback based scalable video coding
US8677504B2 (en) 2005-07-14 2014-03-18 Qualcomm Incorporated Method and apparatus for encrypting/decrypting multimedia content to allow random access
US9501817B2 (en) 2011-04-08 2016-11-22 Dolby Laboratories Licensing Corporation Image range expansion control methods and apparatus

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8391482B2 (en) * 2001-05-04 2013-03-05 Hewlett-Packard Development Company, L.P. Signal format that facilitates easy scalability of data streams
US20030041257A1 (en) * 2001-05-04 2003-02-27 Wee Susie J. Systems, methods and storage devices for scalable data streaming
US7155010B2 (en) * 2001-05-04 2006-12-26 Hewlett-Packard Development Company, L.P. Signal format that facilitates easy scalability of encrypted streams
US7124442B2 (en) * 2001-07-25 2006-10-17 440 Pammel, Inc. System and method for insertion and retrieval of microthreads in transmitted data
US6925120B2 (en) * 2001-09-24 2005-08-02 Mitsubishi Electric Research Labs, Inc. Transcoder for scalable multi-layer constant quality video bitstreams
US20030076858A1 (en) * 2001-10-19 2003-04-24 Sharp Laboratories Of America, Inc. Multi-layer data transmission system
WO2003077457A1 (en) 2002-03-08 2003-09-18 Aware, Inc. Systems and methods for high rate ofdm communications
JP3952459B2 (en) * 2002-11-15 2007-08-01 株式会社リコー Image processing apparatus, program, storage medium, and image processing method
US7406176B2 (en) * 2003-04-01 2008-07-29 Microsoft Corporation Fully scalable encryption for scalable multimedia
FR2857204A1 (en) * 2003-07-03 2005-01-07 Thomson Licensing Sa Digital images compressing and decompressing device for telecommunication network e.g. Internet, has determining unit to determine compression step relative to image encoded by base and improvement layers
US20050076214A1 (en) * 2003-10-03 2005-04-07 Thomas David Andrew Method and system for file downloads to portable computing devices
US7430336B2 (en) 2004-05-06 2008-09-30 Qualcomm Incorporated Method and apparatus for image enhancement for low bit rate video compression
CA2583603A1 (en) * 2004-10-12 2006-04-20 Droplet Technology, Inc. Mobile imaging application, device architecture, and service platform architecture
KR20060059782A (en) * 2004-11-29 2006-06-02 엘지전자 주식회사 Method for supporting scalable progressive downloading of video signal
US7668205B2 (en) * 2005-09-20 2010-02-23 Celo Data, Inc. Method, system and program product for the insertion and retrieval of identifying artifacts in transmitted lossy and lossless data
US8566857B2 (en) * 2005-09-20 2013-10-22 Forefront Assets Limited Liability Company Method, system and program product for broadcast advertising and other broadcast content performance verification utilizing digital artifacts
US8566858B2 (en) * 2005-09-20 2013-10-22 Forefront Assets Limited Liability Company Method, system and program product for broadcast error protection of content elements utilizing digital artifacts
US8966517B2 (en) 2005-09-20 2015-02-24 Forefront Assets Limited Liability Company Method, system and program product for broadcast operations utilizing internet protocol and digital artifacts
KR100733261B1 (en) * 2005-09-21 2007-06-27 에스케이 텔레콤주식회사 Systems and Methods for Objective Video Quality Measurement Using Embedded Video Quality Scores, and Encoding Apparatuses and Methods therefor
MY148628A (en) * 2006-01-11 2013-05-15 Nokia Corp Backward-compatible aggregation of pictures in scalable video coding
CA2657267C (en) 2006-07-13 2013-07-16 Qualcomm Incorporated Video coding with fine granularity scalability using cycle-aligned fragments
KR100937590B1 (en) * 2007-10-23 2010-01-20 한국전자통신연구원 Multiple quality image contents service system and upgrade method thereof
EP2081381A1 (en) * 2008-01-17 2009-07-22 Thomson Licensing Method and apparatus for selective data encryption
CN101552913B (en) * 2009-05-12 2011-07-06 腾讯科技(深圳)有限公司 Multi-channel video communication system and processing method
WO2011028175A1 (en) * 2009-09-01 2011-03-10 Agency For Science, Technology And Research Terminal device and method for processing an encrypted bit stream
US9986252B2 (en) * 2010-04-21 2018-05-29 Mykhaylo Sabelkin Method and apparatus for efficient data communications
US9602841B2 (en) * 2012-10-30 2017-03-21 Texas Instruments Incorporated System and method for decoding scalable video coding
JP6977422B2 (en) * 2017-09-13 2021-12-08 株式会社Jvcケンウッド Transcoding device, transcoding method and transcoding program
RU198678U1 (en) * 2019-12-31 2020-07-22 Федеральное государственное казенное военное образовательное учреждение высшего образования "Краснодарское высшее военное орденов Жукова и Октябрьской Революции Краснознаменное училище имени генерала армии С.М.Штеменко" Министерства обороны Российской Федерации Device for adaptive compression and selective encryption of digital image data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997001934A1 (en) * 1995-06-29 1997-01-16 Thomson Multimedia S.A. System for encoding and decoding layered compressed video data
WO1997011934A1 (en) 1995-09-28 1997-04-03 Nps Pharmaceuticals, Inc. Chiral reductions of imines leading to the syntheses of optically active amines
EP0853407A2 (en) * 1997-01-08 1998-07-15 Digital Vision Laboratories Corporation Data transmission system and method
EP0939545A2 (en) * 1998-02-27 1999-09-01 Hitachi, Ltd. Video service system

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4671849A (en) 1985-05-06 1987-06-09 International Business Machines Corporation Method for control of etch profile
US4918523A (en) * 1987-10-05 1990-04-17 Intel Corporation Digital video formatting and transmission system and method
US5506623A (en) * 1991-12-19 1996-04-09 Sony Corporation Data compression methods and systems with quantization distortion measurement means
EP0731614B1 (en) * 1995-03-10 2002-02-06 Kabushiki Kaisha Toshiba Video coding/decoding apparatus
US6415398B1 (en) * 1995-09-29 2002-07-02 Kabushiki Kaisha Toshiba Coding system and decoding system
US5956674A (en) * 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
SE515256C2 (en) 1996-02-12 2001-07-02 Telia Ab LAN network and method of providing uninterrupted transmission between two switches in a LAN
ATE240579T1 (en) * 1996-03-21 2003-05-15 Toshiba Kk DATA ARRANGEMENT METHOD AND MEDIUM FOR DATA RECORDING OR TRANSFER, AND SIGNAL PROCESSING DEVICE FOR THE METHOD AND MEDIUM
US5894478A (en) 1996-07-24 1999-04-13 Lucent Technologies Inc. Protocol converter and router for multi-mode wireless data communications
US5809139A (en) * 1996-09-13 1998-09-15 Vivo Software, Inc. Watermarking method and apparatus for compressed digital video
KR100261253B1 (en) * 1997-04-02 2000-07-01 윤종용 Scalable audio encoder/decoder and audio encoding/decoding method
KR100261254B1 (en) * 1997-04-02 2000-07-01 윤종용 Scalable audio data encoding/decoding method and apparatus
JPH10333698A (en) 1997-05-27 1998-12-18 Matsushita Electric Ind Co Ltd Vice encoding method, voice decoding method, voice encoder, and recording medium
JPH11331852A (en) 1998-05-14 1999-11-30 Matsushita Electric Ind Co Ltd Reversible coding method and reversible coder
US6845159B1 (en) * 1998-10-07 2005-01-18 Protego Information Ab Processing method and apparatus for converting information from a first format into a second format
US6570922B1 (en) * 1998-11-24 2003-05-27 General Instrument Corporation Rate control for an MPEG transcoder without a priori knowledge of picture type
US6915319B1 (en) * 1999-10-08 2005-07-05 Kabushiki Kaisha Kenwood Method and apparatus for interpolating digital signal
ATE417660T1 (en) * 1999-10-29 2009-01-15 Hollingsworth & Vose Co FILTER MATERIAL
US6493387B1 (en) * 2000-04-10 2002-12-10 Samsung Electronics Co., Ltd. Moving picture coding/decoding method and apparatus having spatially scalable architecture and signal-to-noise ratio scalable architecture together
US6925249B1 (en) * 2000-07-28 2005-08-02 Microsoft Corporation System and method for storing compressed data onto a storage medium
US6658057B1 (en) * 2000-08-31 2003-12-02 General Instrument Corporation Digital transcoder with logo insertion
US7586631B2 (en) * 2000-10-06 2009-09-08 Sony Corporation Image quality correction method, image data processing device, data storing/reproducing method, data batch-processing system, data processing method, and data processing system
US7155010B2 (en) * 2001-05-04 2006-12-26 Hewlett-Packard Development Company, L.P. Signal format that facilitates easy scalability of encrypted streams
US7231587B2 (en) * 2004-03-29 2007-06-12 Lsi Corporation Embedded picture PSNR/CRC data in compressed video bitstream

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997001934A1 (en) * 1995-06-29 1997-01-16 Thomson Multimedia S.A. System for encoding and decoding layered compressed video data
WO1997011934A1 (en) 1995-09-28 1997-04-03 Nps Pharmaceuticals, Inc. Chiral reductions of imines leading to the syntheses of optically active amines
EP0853407A2 (en) * 1997-01-08 1998-07-15 Digital Vision Laboratories Corporation Data transmission system and method
EP0939545A2 (en) * 1998-02-27 1999-09-01 Hitachi, Ltd. Video service system

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
"INFORMATION TECHNOLOGY - CODING OF AUDIO-VISUAL OBJECTS - PART 2 : VISUAL AMENDMENT 4: STREAMING VIDEO PROFILE", ISO/IEC JTC1/SC29/WG11 N3315, XX, XX, 1 March 2000 (2000-03-01), XX, pages 01 - 55, XP001014369 *
"JPEG 2000 PART I FINAL COMMITTEE DRAFT VERSION 1.0", ISO/IEC JTC1/SC29/WG1 N1646, XX, XX, 16 March 2000 (2000-03-16), pages A,B,I - XII,1-190, XP001004858 *
LI W: "FINE GRANULARITY SCALABILITY IN MPEG-4 FOR STREAMING VIDEO", ISCAS 2000. PROCEEDINGS OF THE 2000 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS. GENEVA, SWITZERLAND, MAY 28-31, 2000, IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, NEW YORK, NY: IEEE, US, vol. 5 OF 5, 2000, pages 299 - 302, XP000965729, ISBN: 0-7803-5483-4 *
RAMANUJAN R S ET AL: "Adaptive streaming of MPEG video over IP networks", LOCAL COMPUTER NETWORKS, 1997. PROCEEDINGS., 22ND ANNUAL CONFERENCE ON MINNEAPOLIS, MN, USA 2-5 NOV. 1997, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, 2 November 1997 (1997-11-02), pages 398 - 409, XP010252445, ISBN: 0-8186-8141-1 *
See also references of EP1327360A1 *

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1351506A3 (en) * 2002-03-27 2005-04-06 Eastman Kodak Company Encoding rate-distortion information to facilitate optimal transcoding of a compressed digital image
US6987890B2 (en) 2002-03-27 2006-01-17 Eastman Kodak Company Producing and encoding rate-distortion information allowing optimal transcoding of compressed digital image
EP1351506A2 (en) * 2002-03-27 2003-10-08 Eastman Kodak Company Encoding rate-distortion information to facilitate optimal transcoding of a compressed digital image
JP4643988B2 (en) * 2002-09-10 2011-03-02 トムソン ライセンシング Video on demand server system and method
JP2005538651A (en) * 2002-09-10 2005-12-15 トムソン ライセンシング Video on demand server system and method
KR101014451B1 (en) * 2002-09-10 2011-02-14 톰슨 라이센싱 Video on demand server system and method
EP1665799A1 (en) * 2003-08-26 2006-06-07 Samsung Electronics Co., Ltd. Scalable video coding method and apparatus using pre-decoder
EP1665799A4 (en) * 2003-08-26 2010-03-31 Samsung Electronics Co Ltd Scalable video coding method and apparatus using pre-decoder
US7844992B2 (en) 2003-09-10 2010-11-30 Thomson Licensing Video on demand server system and method
DE102004029872B4 (en) * 2004-06-16 2011-05-05 Deutsche Telekom Ag Method and device for improving the quality of transmission of coded audio / video signals
US9167280B2 (en) 2005-07-14 2015-10-20 Qualcomm Incorporated Method and apparatus for encrypting/decrypting multimedia content to allow random access
US8677504B2 (en) 2005-07-14 2014-03-18 Qualcomm Incorporated Method and apparatus for encrypting/decrypting multimedia content to allow random access
US8270496B2 (en) 2005-10-12 2012-09-18 Thomson Licensing Region of interest H.264 scalable video coding
AU2007311526B2 (en) * 2006-10-20 2011-12-15 Nokia Technologies Oy System and method for providing picture output indications in video coding
EP2080375A4 (en) * 2006-10-20 2009-12-02 Nokia Corp System and method for providing picture output indications in video coding
EP2080375A2 (en) * 2006-10-20 2009-07-22 Nokia Corporation System and method for providing picture output indications in video coding
WO2008047257A2 (en) 2006-10-20 2008-04-24 Nokia Corporation System and method for providing picture output indications in video coding
US8275051B2 (en) 2007-04-17 2012-09-25 Nokia Corporation Feedback based scalable video coding
FR2933837A1 (en) * 2008-12-10 2010-01-15 Thomson Licensing Video images sequence coding method for e.g. satellite distribution network, involves coding auxiliary transcoding aid data e.g. coding parameters, into supplemental enhancement information message of coded image data stream
US9501817B2 (en) 2011-04-08 2016-11-22 Dolby Laboratories Licensing Corporation Image range expansion control methods and apparatus
US10395351B2 (en) 2011-04-08 2019-08-27 Dolby Laboratories Licensing Corporation Image range expansion control methods and apparatus

Also Published As

Publication number Publication date
AU2002220595A1 (en) 2002-04-22
KR20020064930A (en) 2002-08-10
KR100887165B1 (en) 2009-03-10
US20110019729A1 (en) 2011-01-27
US8374344B2 (en) 2013-02-12
CN1398489A (en) 2003-02-19
EP1327360A1 (en) 2003-07-16
JP4063666B2 (en) 2008-03-19
RU2284671C2 (en) 2006-09-27
CN100499817C (en) 2009-06-10
JP2004511979A (en) 2004-04-15
US20020076043A1 (en) 2002-06-20
BR0107307A (en) 2002-08-13
PL356718A1 (en) 2004-06-28

Similar Documents

Publication Publication Date Title
US8374344B2 (en) Coding
US9781431B2 (en) Image coding and decoding method and apparatus considering human visual characteristics
US6542546B1 (en) Adaptable compressed bitstream transcoder
JP5042856B2 (en) Apparatus and method for scalable encoding and decoding of an image data stream, signal, computer program, and image quality adaptation module
US6493386B1 (en) Object based bitstream transcoder
US20060072667A1 (en) Transcoder for a variable length coded data stream
US20040179606A1 (en) Method for transcoding fine-granular-scalability enhancement layer of video to minimized spatial variations
WO1998053613A1 (en) Apparatus, method and computer readable medium for scalable coding of video information
Zhao et al. Highly scalable differential JPEG 2000 wavelet video codec for Internet video streaming
EP1639828A1 (en) Method for transcoding a jpeg2000 compressed image

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

ENP Entry into the national phase

Ref document number: 2002 535403

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: IN/PCT/2002/874/CHE

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 1020027007446

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 2001986831

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 018047726

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 1020027007446

Country of ref document: KR

121 Ep: the epo has been informed by wipo that ep was designated in this application
ENP Entry into the national phase

Ref country code: RU

Ref document number: RU A

Ref document number: 2003113331

Country of ref document: RU

Kind code of ref document: A

WWP Wipo information: published in national office

Ref document number: 2001986831

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642