Qualcomm Patent | Reducing Seam Artifacts in 360-Degree Video

Patent: Reducing Seam Artifacts in 360-Degree Video

Publication Number: 20190007683

Publication Date: 2019-01-03

Applicants: Qualcomm

Abstract

Provided are systems and methods for processing 360-degree video data by obtaining a 360-degree rectangular formatted projected picture, the 360-degree rectangular formatted projected picture including a first region with a region boundary and a first region area; identifying coding tree units (CTUs) within the first region area; selectively identifying a first coding unit (CU) and a second CU from within the CTUs; determining an initial QP value for the first CU; determining an initial QP value for the second CU; identifying the first CU as a region boundary CU; identifying the second CU as a non-region boundary CU; reducing the initial QP value for the first CU to a final first QP value in response to identifying the first CU as a region boundary CU, and generating an encoded video bitstream comprising: the final QP value for the first CU, and the initial QP value for the second CU.

Background

Video coding standards include Joint Video Exploration Team (JVET) of MPEG and VCEG, ITU-T H.261, ISO/IEC MPEG-1 Visual, ITU-T H.262 or ISO/IEC MPEG-2 Visual, ITU-T H.263, ISO/IEC MPEG-4 Visual, ITU-T H.264 (also known as ISO/IEC MPEG-4 AVC), including its Scalable Video Coding (SVC) and Multiview Video Coding (MVC) extensions and ITU-T H.265 (also known as ISO/IEC MPEG-4 HEVC) with its extensions.

Summary

In various implementations, methods, devices, apparatus, and computer-readable media for processing 360-degree video data to obtain better coding efficiency are described herein.

A method can include obtaining a 360-degree rectangular formatted projected picture, wherein the 360-degree rectangular formatted projected picture comprises at least a first region, wherein the at least first region comprises at least one region boundary, and wherein the at least first region comprises a first region area; identifying a plurality of coding tree units (CTUs) within the first region area; selectively identifying a first coding unit (CU) and a second CU from within the plurality of CTUs; determining an initial QP value for the first CU; determining an initial QP value for the second CU; identifying the first CU as a region boundary CU; identifying the second CU as a non-region boundary CU; reducing the initial QP value for the first CU to a final first QP value in response to identifying the first CU as a region boundary CU, and generating an encoded video bitstream comprising: the final QP value for the first CU, and the initial QP value for the second CU.

In some embodiments, reducing the initial QP value to a final first QP value in response to identifying the first CU as a region boundary CU, further comprises reducing the initial QP value for the first CU below a threshold value.

In further embodiments, the threshold value can be 15.

In additional embodiments, identifying the first CU as a region boundary CU, further comprises: determining a CU size based on a residual, wherein the CU size is one of: 8.times.8, 16.times.16, 32.times.32 and 64.times.64.

In some implementations, identifying the first CU as a region boundary CU, further comprises: identifying the region boundary CU as located adjacent to at least one region boundary.

In further implementations, identifying the first CU as a region boundary CU, further comprises: identifying the region boundary CU as located in a QP adjustment area, wherein the QP adjustment area is located adjacent to the at least one region boundary.

In additional implementations, the QP adjustment area has a QP adjustment area width, and wherein the QP adjustment area width is based on delta-QP coding.

In some embodiments, the a 360-degree rectangular formatted projected picture is a picture type comprising one of: intra-coded picture type, P picture type or B picture type.

In further embodiments, the 360-degree rectangular formatted projected picture is a picture type that is only intra-coded.

In additional embodiments, the 360-degree rectangular formatted projected picture is at or above a first level of a picture prediction hierarchy in a random-access (RA) configuration.

In some implementations, the first level of the picture prediction hierarchy in an RA configuration further includes intra-coded pictures and first level B pictures.

In further implementations, reducing the initial QP value for the first CU to a lower final first QP value in response to identifying the first CU as a region boundary CU, further comprises: reducing a QP value for the first CU based on a QP value of the 360-degree rectangular formatted projected picture.

In additional implementations, the method further comprises identifying a third CU as a region boundary CU; and selectively not reducing a QP value for the third CU when a QP value of the 360-degree rectangular formatted picture is below a threshold value.

A device can include a memory and a processor configured to perform a method that includes obtaining a 360-degree rectangular formatted projected picture, wherein the 360-degree rectangular formatted projected picture comprises at least a first region, wherein the at least first region comprises at least one region boundary, and wherein the at least first region comprises a first region area; identifying a plurality of coding tree units (CTUs) within the first region area; selectively identifying a first coding unit (CU) and a second CU from within the plurality of CTUs; determining an initial QP value for the first CU; determining an initial QP value for the second CU; identifying the first CU as a region boundary CU; identifying the second CU as a non-region boundary CU; reducing the initial QP value for the first CU to a final first QP value in response to identifying the first CU as a region boundary CU, and generating an encoded video bitstream comprising: the final QP value for the first CU, and the initial QP value for the second CU.

A computer-readable medium can a non-transitory computer-readable medium storing a program containing instructions that, when executed by a processor of a device, cause the device to perform a method that includes obtaining a 360-degree rectangular formatted projected picture, wherein the 360-degree rectangular formatted projected picture comprises at least a first region, wherein the at least first region comprises at least one region boundary, and wherein the at least first region comprises a first region area; identifying a plurality of coding tree units (CTUs) within the first region area; selectively identifying a first coding unit (CU) and a second CU from within the plurality of CTUs; determining an initial QP value for the first CU; determining an initial QP value for the second CU; identifying the first CU as a region boundary CU; identifying the second CU as a non-region boundary CU; reducing the initial QP value for the first CU to a final first QP value in response to identifying the first CU as a region boundary CU, and generating an encoded video bitstream comprising: the final QP value for the first CU, and the initial QP value for the second CU.

An apparatus can include means for obtaining a 360-degree rectangular formatted projected picture, wherein the 360-degree rectangular formatted projected picture comprises at least a first region, wherein the at least first region comprises at least one region boundary, and wherein the at least first region comprises a first region area; means for identifying a plurality of coding tree units (CTUs) within the first region area; means for selectively identifying a first coding unit (CU) and a second CU from within the plurality of CTUs; means for determining an initial QP value for the first CU; means for determining an initial QP value for the second CU; means for identifying the first CU as a region boundary CU; means for identifying the second CU as a non-region boundary CU; means for reducing the initial QP value for the first CU to a final first QP value in response to identifying the first CU as a region boundary CU, and means for generating an encoded video bitstream comprising: the final QP value for the first CU, and the initial QP value for the second CU.

发表评论

电子邮件地址不会被公开。 必填项已用*标注