Skip to content

Architectural analysis of a baseline ISP pipeline #21

@standing-o

Description

@standing-o

Architectural analysis of a baseline ISP pipeline

Abstract

  • A number of functions are incorporated in an ISP
  • ISP functions are divided into pixel-based and frame-based ones, and are dedicated to one of three color domains in Bayer, RGB, or YCbCr.

Introduction

  • Pixel-based Functions
    ➔ Utilizing an input pixel and its surrounding pixels
    ➔ Exploiting spatial information: spatial filter

  • Frame-based Functions
    ➔ Requiring the whole pixels of an image.
    ➔ Divided by how many images are exploited to get the outcome.

  • Global features
    ➔ Dynamic range extension, Auto-white balance, Auto-exposure, Contrast enhancement

  • Temporal correlation
    ➔ Noise reduction, Rolling-shutter removal, Image stabilization

  • Traditional ISPs
    ➔ Limited frame-based functions (Auto-exposure, Auto-white balance, Auto-focus (3A))

  • Proposed baseline ISP pipeline


1. Embedded ISP Inside an AP (Application Processor)

  • The AP provides abundant memory space as well as bandwidth.
  • So the pixel-based functions can be processed with a legacy baseline ISP, while the frame-based functions can be processed by programming GPU/GPGPU.
  • ISP implementation consumes much energy since it uses the power-hungry memory device and the hot computing units.
    ➔ It can provide the best quaility of an image for end-user satisfaction.

2. Primary ISP Architecture for Bayer Image Sensors

  • The ISP contains three components: quantization, color space conversion and data formatter

  • The image sensor is assumed to produce analog R, G, and B signals at every pixel position.

    • Y, CR, and CB signals are calculated trom these digital R. G and B signals.
  • While ISP itself isn't standardized, the standardization of digital video, particularly in Rec. ITU-R BT.601 and Rec. ITU-R BT.656 since 1982, includes basic components of an ISP.

  • Rec. ITU-R BT.601: This standard focuses on studio encoding parameters for digital television, defining regulations for digitizing SDTV video with a resolution of 720 × 480 or 720 × 576 at a 13.5 MHz sampling frequency.

  • Quantization Formula: The 8-bit quantization formula, specified in Rec. ITU-R BT.601, converts analog R, G, and B signals to digital RGB signals, ensuring consistency in calculation results across different implementations.

  • Color Space Conversion: Rec. ITU-R BT.601 provides a specific formula for converting digital R-G-B signals to Y-CR-CB signals, emphasizing the importance of adhering to the recommended formula for color compatibility among different implementations.

    • ITU-R BT.601 regulates Y-CR-CB subsampling formats like 4:4:4 and 4:2:2, chosen for effective data reduction without significant visual quality loss.
  • The ISP's Data Formatter handles subsampling and interleaving of Y-CR-CB signals, supporting the standardized 4:2:2 chroma subsampling format.

    • Timing Reference in Rec. ITU-R BT.656: Timing reference signals in video data, derived from reserved codewords (SAV and EAV), ensure proper synchronization between transmitter and receiver.
  • Color Filter Arrays (CFAs) and Bayer Patterns
    ➔ Image sensors, often using Bayer patterns for spatial color subsampling, necessitate interpolation (demosaicing) to restore deficient color components.

  • Edge-Directed Interpolation
    ➔ Adaptive interpolation techniques, like edge-directed interpolation, help reduce artifacts like pseudo-color and zipper noise in the demosaicing process.

  • Anti-Aliasing Noise Filter in ISP
    ➔ An ISP evolved for Bayer sensors addresses artifacts with functions like anti-aliasing noise filter, compensating for challenges posed by Bayer array sensors.

  • ISP architecture to recover artifacts from a Bayer image sensor

    • Anti-aliasing Noise Filter: Salt-and-pepper noise produced
      during the manufacturing of image sensor has to be removed before color interpolation.
    • Color Filter Array Interpolation: Restore the original color components from the sampled ones.
      ➔ It results in zipper noise and pseudo-color. The zipper noise can be suppressed considering edge direction during color interpolation process.
    • Noise Filter for Luma: In an anti-aliasing noise filter, it is not possible to exploit correlation with the
      adjacent pixels because they are of different color attributes.
    • Noise Filter for Chrominance: Removing pseudo-color caused by subsampling and interpolation
      process.
      ➔ Because human eyes are very sensitive to rapid color changes, it is necessary to build a natural image by suppressing excessive color changes.

3. ISP Architecture for Color Reproduction

  • Color Perception Difference
    ➔ Due to the varying responses of silicon sensors and human eyes to light, a process for restoring natural color is essential.

    • Color spaces like CIERGB, CIEXYZ, and sRGB are used to represent colors in a 3D system, with sRGB being widely used in consumer electronics.
  • Gamma Correction
    ➔ Nonlinear gamma correction is crucial for adapting the linear response of image sensors to the nonlinear perception of human eyes.

  • Defining an RGB color space involves specifying red, green, blue primaries, a white point, and a gamma correction curve.

    • An ISP pipeline includes both color correction (linear) and gamma correction (nonlinear), supporting a specific RGB color space.
  • Auto-White Balance (AWB)
    ➔ AWB compensates for color distortion due to varying light spectra, adjusting color temperature to match D65 illumination.

  • Chromaticity, represented by hue and saturation, is crucial for color perception. ISP performs hue/saturation control in the Y-CR-CB domain.

  • Color Domain Functions in ISP
    ➔ AWB in Bayer, gamma/color correction in RGB, and hue/saturation control in Y-CR-CB domains collectively reproduce accurate colors perceived by human eyes.

  • ISP architecture for color reproduction

4. ISP Architecture with Pre-/Post-processing

  • Pre-Processing Functions
    ➔ Additional pre-processing compensates for sensor distortions, including dead pixel concealment (DPC) to handle defective pixels.

  • Black Level Compensation (BLC)
    ➔ BLC corrects non-linear sensor responses, estimating the sensor response in no-light conditions by subtracting optical black area averages.

  • Lens-Shading Correction (LSC)
    ➔ LSC compensates for shading effects caused by lens systems, ensuring consistent light-to-voltage gain for all pixels.

  • Flat Field Compensation (FFC)
    ➔ FFC, a type of LSC, compensates for shading by correction gain estimated and stored in advance, improving image uniformity.

  • Noise Reduction
    ➔ Noise reduction, crucial for high-resolution sensors, is performed after achieving consistent linearity and is a key factor determining camera system performance.

  • Visual Quality Enhancement
    ➔ Techniques like edge enhancement and contrast control are applied post-processing for subjective visual quality improvement, with considerations for potential artifacts.

  • ISP architecture for handling sensor derating factors

5. Further Works on ISP

  • For a legacy ISP pipeline, addressing color-related functions to ensure robust color quality across ambient color temperatures is crucial.

    • Global information might require significant memory, which can be achieved with external SDRAM for more sophisticated functions.
  • Ongoing improvements in color interpolation and noise reduction are key focus areas for ISP development. Additionally, suppressing false colors or removing pseudo-colors has become a significant function to prevent distortion in human perception.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions