Introduction to Digital Images

An overview of digital images using the C++ Utils graphics classes.

Setup

You can follow this guide from your own machine, or try from lab.cs50.io. If you are not using CS50, you will see some markdown for cs50 using curly braces and percentage signs, which you can ignore.

We'll need a reference to the graphics::Image library:

git clone https://github.com/ILXL/cpputils.git

We can now #include "cpputils/graphics/image.h" to use the classes graphics::Image and graphics::Color.

Create main.cc with a basic main() function and include image.h.

Now you are ready to start creating images in C++!

{% spoiler Example %}

#include "cpputils/graphics/image.h"

int main() {
  return 0;
}

{% endspoiler %}

{% next %}

What is a pixel?

A pixel is a set of three tiny lights on a display: red, green and blue. Below you can see a zoomed in photograph of an LCD display showing a pixel grid.

https://commons.wikimedia.org/wiki/File:LCD_RGB.jpg

Images are made from two-dimensional grids of pixels. Pixels are so small that our eyes cannot make out the individual red, green and blue components. Instead, our brains merge the three colors together, creating an illusion of multi-colored light.

Color

To specify a color, you can specify the brightness of the red, green and blue components of each pixel. Brightnesses may range between 0 and 255 (256 options, 256 is 2^8).

By convention, we list red, then green, then blue.

Here are some examples:

Red: 0, Green: 0, Blue: 0

Red: 255, Green: 255, Blue: 255

Red: 255, Green: 0, Blue: 0

Red: 0, Green: 255, Blue: 0

Red: 0, Green: 0, Blue: 255

Red: 255, Green: 191, Blue: 0

Red: 138, Green: 84, Blue: 179

Try making more colors with Google's HTML color picker.

Aside: Unlike mixing paint, where combining red, green and blue would make dark brown, mixing red, green and blue light creates White light. Why? Consider: white light from the sun goes through a prism to create all the colors of the rainbow. Re-combining those colored lights can produce white again! However, because computers only have three pixel colors it is physically impossible for monitors to reproduce every color your eye can see!

Image coordinates

Images are made from a grid of pixels. We can reference individual pixels by their coordinates within the grid by specifying their X location and their Y location.

Unlike the Cartesian coordinate system used in math classes, an image coordinate system begins in the top left corner at (0, 0), and then X increases to the right, while Y increases going down.

https://processing.org/tutorials/drawing/imgs/drawing-03.svg

In the image above, the top left pixel is (0, 0) while the bottom right pixel is (6, 6).

{% next %}

`graphics::Image` class

You can get the graphics::Image class for drawing and displaying images when you #include "cpputils/graphics/image.h".

To create a new image which is 100x100 pixels::

const int size = 100;
graphics::Image image;
image.Initialize(size, size);

This creates a new, all-white image size 100 by 100 pixels.

You can display an image using the ShowUntilClosed function:

image.ShowUntilClosed();

Your turn: Open main.cc and create an image that's 200x150 pixels, then show it. Make sure you've initialized cpputils using the git clone command from the Setup step (check: when you type ls you should see that cpputils/ directory exists).

{% spoiler Example %}

#include "cpputils/graphics/image.h"

int main() {
  const int width = 200;
  const int height = 150;
  graphics::Image image;
  image.Initialize(width, height);
  image.ShowUntilClosed();
  return 0;
}

{% endspoiler %}

Now we can compile and run main.cc. There's a few extra arguments we need for clang++ to properly build a graphical module. Try running the clang++ command below and then executing main, you should get a blank image to show up!

clang++ -std=c++17 main.cc cpputils/graphics/image.cc -o main -lm -lX11 -lpthread
./main

Note: If you are running this on a Mac you will need to add some compile commands instead of -lm -lX11 -lpthread

-lm -I/opt/X11/include -lpthread -lX11 -lstdc++ -I/usr/X11R6/include -L/usr/X11R6/lib

{% next %}

Interacting with pixels

You can set and get the red, green and blue channels of an individual pixel in a graphics::Image using its x and y coordinates. Here are the function prototypes to get the color channels:

/**
 * Returns the red component of the RGB pixel at position
 * (x, y) in the image. Returns -1 if (x, y) is out of bounds.
 */
int GetRed(int x, int y) const;

/**
 * Returns the green component of the RGB pixel at position
 * (x, y) in the image. Returns -1 if (x, y) is out of bounds.
 */
int GetGreen(int x, int y) const;

/**
 * Returns the blue component of the RGB pixel at position
 * (x, y) in the image. Returns -1 if (x, y) is out of bounds.
 */
int GetBlue(int x, int y) const;

And here are the function prototypes to set the color channels. Note these return false if the channel could not be set.

/**
 * Sets the red component of the RGB pixel at position (x, y)
 * in the image. Returns false if (x, y) is out of bounds or
 * |r| is out of range [0, 255].
 */
bool SetRed(int x, int y, int r);

/**
 * Sets the green component of the RGB pixel at position (x, y)
 * in the image. Returns false if (x, y) is out of bounds or
 * |g| is out of range [0, 255].
 */
bool SetGreen(int x, int y, int g);

/**
 * Sets the blue component of the RGB pixel at position (x, y)
 * in the image. Returns false if (x, y) is out of bounds or
 * |b| is out of range [0, 255].
 */
bool SetBlue(int x, int y, int b);

You can use these to manipulate images pixel by pixel. For example, the following code changes the colors of some pixels:

// Make the pixel at (10, 10) red.
image.SetRed(10, 10, 255);
image.SetGreen(10, 10, 0);
image.SetBlue(10, 10, 0);

// Get the red channel of the pixel at 10, 10.
int value = image.GetRed(10, 10);
std::cout << "The red channel is " << value << std::endl;

Your turn: Try drawing some colored pixels on your image, and then display the image. Can you see the recolored pixels on your screen? That's how big a pixel is!

{% next %}

Drawing shapes

You can draw circles, rectangles and lines on a graphics::Image.

Circles

Here is the function prototype to draw a circle:

/**
 * Draws a circle centered at (x, y) with radius |radius|, and color
 * specified by |red|, |green| and |blue| channels. Returns false if
 * params are out of bounds.
 */
bool DrawCircle(int x, int y, int radius, int red, int green, int blue);

Now we can draw circles on the graphics::Image in main.cc.

For example, to draw a red circle with radius 10 centered at x = 25, y = 25:

image.DrawCircle(25, 25, 10, 255, 0, 0);

To draw a black circle with radius 30 centered at x = 50, y = 60:

image.DrawCircle(50, 60, 30, 0, 0, 0);

Your turn: Try drawing a bunch of circles of different colors on your image. Can you make a giant teal circle? What about a small pink circle? Can you make something that looks like this emoji? 😮 How about the Japanese flag?

{% next %}

Rectangles

Drawing a rectangle is very similar to drawing a circle. Here is the function prototype to draw a rectangle:

/**
 * Draws a rectangle with upper left corner at (x, y) and size
 * |width| by |height|, colored by |red|, |green| and |blue|.
 * Returns false if params are out of bounds.
 */
bool DrawRectangle(int x, int y, int width, int height, int red, int green,
                   int blue);

You can specify the top left corner of the rectangle with x and y, and then the width and height.

For example, to draw an orange rectangle over the right of the screen:

const int size = 100;
image.DrawRectangle(size / 2, 0, size / 2 - 1, size, 255, 127, 0);

Note that you cannot draw rectangles or circles outside the image bounds, or a error message is printed.

Your turn: Try updating main.cc to draw some rectangles -- some tall, thin ones, some short, wide ones, and any others you'd like! Can you draw the Italian flag? The Colombian flag?

{% next %}

Lines

Drawing lines is similar to rectangles: You specify the (x, y) coordinates of the starting pixel and the (x, y) coordinates of the ending pixel, as well as red, green and blue components of the pixel color. Here is the prototype of the function to draw lines:

/**
 * Draws a line from (x0, y0) to (x1, y1) with color specified  by |red|, |green| and
 * |blue| channels, and optional width |thickness|. Returns false if params are out of bounds.
 */
bool DrawLine(int x0, int y0, int x1, int y1, int red, int green, int blue);

For example, to draw a purple line diagonally from top right to bottom left:

const int size = 100;
image.DrawLine(size - 1, 0, 0, size - 1, 171, 132, 232);

Using the DrawLine, DrawRect and DrawCircle function you can draw all kinds of images. Here's a tree!

$fractal tree$

Saving images

If you like your image art, go ahead and save it! You can save to a bitmap file using SaveImageBmp. For example, the snippet below saves the current image to a file named "my_image.bmp".

image.SaveImageBmp("my_image.bmp");

Note: If you are on CS50 you can use the file's menu in the filetree to save it to your computer.

{% next %}

Loading images

In addition to creating new images, graphics::Image objects can be loaded from a file. Here's how you can load the file "kitten.bmp" which is included in the resources/ directory (you can save this file to your machine):

graphics::Image image;
bool success = image.Load("resources/kitten.bmp");
// You'll need to #import <iostream> to cout.
std::cout << "Successfully loaded kitten.bmp? " << success << std::endl;

Note: Bitmap images work best on all platforms, but on Ubuntu you can usually load other formats like .jpg and .png.

We can get the width and height of an image using the GetWidth and GetHeight functions:

std::cout << "Image is " << image.GetWidth() << " x " << image.GetHeight() << " pixels." << std::endl;

This unlocks some really cool image manipulation!

Your turn: Can you invert the colors in an image programatically?

->

Hint: You'll need to loop through the width and height of the image and manipulate the color at each pixel location. Remember the valid values for each pixel color are between 0 and 255.

{% spoiler Example %}

#include "cpputils/graphics/image.h"

int main() {
  graphics::Image image;
  // Load the file.
  image.Load("resources/kitten.bmp");
  // Iterate through all the pixels using for loops.
  for (int i = 0; i < image.GetWidth(); i++) {
    for (int j = 0; j < image.GetHeight(); j++) {
      // Get the red, green and blue.
      int red = image.GetRed(i, j);
      int green = image.GetGreen(i, j);
      int blue = image.GetBlue(i, j);
      // Invert the red, green and blue.
      image.SetRed(i, j, 255 - red);
      image.SetBlue(i, j, 255 - blue);
      image.SetGreen(i, j, 255 - green);
    }
  }
  image.ShowUntilClosed();
  return 0;
}

{% endspoiler %}

Your turn: Try other pixel manipulations, like swapping red, green and blue, or setting one channel to 0 or 255. What's the funniest image you can create?

{% next %}

Advanced topics

Feel free to read the image.h header to see what other functionality is available.

`graphics::Color` class

Instead of specifying red, green and blue separately, it can often be easier to work with these channels grouped into a single object. The graphics::Color class is defined in image.h. graphics::Color object can be created as follows:

graphics::Color color(255, 0, 0);

You can get the red, green and blue channels as follows:

int red = color.Red();
int green = color.Green();
int blue = color.Blue();

Using `graphics::Color` with `graphics::Image`

All of the drawing functions introduced above can also work with a graphics::Color. In addition, you can get and set colors at pixel locations using the following functions prototypes on graphics::Image:

/**
 * Gets the color at pixel at position (x, y) in the image.
 * Returns (-1, -1, -1) if (x, y) is out of bounds.
 */
Color GetColor(int x, int y) const;

/**
 * Sets the color of the RGB pixel at position (x, y)
 * in the image. Returns false if (x, y) is out of bounds or
 * red, green or blue are out of range [0, 255].
 */
bool SetColor(int x, int y, const Color& color);

Some examples using graphics::Color:

graphics::Color green(20, 255, 100);
graphics::Color red(240, 20, 0);

// Draw a green circle at x = 25, y = 25, radius 10, color (20, 255, 100).
image.DrawCircle(25, 25, 10, green);

// Set a pixel within the image to a red color.
image.SetColor(30, 30, red);

// Get a pixel color from the image.
graphics::Color pixel = image.GetColor(42, 49);

{% next %}

Simple Animation

Instead of using Image::ShowUntilClosed, use the Image::ShowForMs function to show a graphics::Image for a fixed duration (in milliseconds). For example, the following code snippet shows an image for 10 milliseconds in a window titled "Animation". Note that the program will halt for the specified duration while the image is being displayed.

image.ShowForMs(10, "Animation");

For example, can you create a linear progress bar by drawing rectangles of increasing size and then updating the display?

{% spoiler Example %}

#include "cpputils/graphics/image.h"

int main() {
  const int kWidth = 280;
  const int kSteps = 100;
  const int kMsPerStep = 60;

  graphics::Image image(kWidth + 20, 100);

  // Draw the progres bar background.
  image.DrawRectangle(10, 40, kWidth, 20, graphics::Color(175, 175, 175));

  for (int i = 0; i <= kSteps; i++) {
    // Draw the progress bar contents based on i.
    image.DrawRectangle(10, 40, kWidth * 1.0 / kSteps * i, 20, graphics::Color(25, 50, 255));

    // Display for a few ms before going through the loop again.
    image.ShowForMs(kMsPerStep, "Progress");
  }

  // Keep the image up until the user is done.
  image.ShowUntilClosed("Complete");
  return 0;
}

{% endspoiler %}

{% next %}

Handling Mouse Events

Like animations, you can implement an abstract graphics interface to listen to mouse events. Images can send mouse events after display by using the graphics::MouseEventListener class, adding this class as an MouseEventListener on a graphics::Image, and providing an implementation for the virtual void OnMouseEvent(const MouseEvent& event) function. This function will be called every time a mouse action is detected.

You can get the X and Y coordinates from the MouseEvent using as well as the MouseAction:

int x = event.GetX();
int y = event.GetY();
graphics::MouseAction action = event.GetMouseAction();

Here's the definition of a MouseAction:

/**
 * Enum representing whether a button was pressed or released.
 */
enum class MouseAction {
  // Left button down.
  kPressed = 0,
  // Moved while left button was down.
  kDragged,
  // Left button up.
  kReleased,
  // Moved but the left button was not down.
  kMoved,
};

For example, when the user presses the mouse button down at the coordinate (10, 10), we would get a graphics::MouseEvent event with event.GetX() == 10, event.GetY() == 10, and event.GetMouseAction() == graphics::MouseAction::kPressed.

You need to add your class as a listener on the image in order to start receiving events, and remove it as a listener when you are done receiving events (don't worry about the *this syntax if you don't know what that means, just copy-paste this into your class):

image_.AddMouseEventListener(*this);

Then in the destructor you should remove your class from the image to clean up. Copy-paste the following into the destructor:

image_.RemoveMouseEventListener(*this);

Below is an example of drawing random colored circles whenever the mouse is clicked. Can you extend this to draw lines when the mouse is dragged too? What about creating a simple brush tool where it draws thick lines between each deteced mouse point?

{% spoiler Example %}

#include <random>

#include "cpputils/graphics/image.h"
#include "cpputils/graphics/image_event.h"

class TouchDotCreator : public graphics::MouseEventListener {
 public:
  ~TouchDotCreator() {
    image_.RemoveMouseEventListener(*this);
  }

  // Initialize and shows the image, as well as adds itself as an
  // MouseEventListener.
  void Start() {
    image_.Initialize(250, 250);
    image_.AddMouseEventListener(*this);
    image_.ShowUntilClosed();
  }

  // Overridden from graphics::MouseEventListener, this method contains all of
  // the logic to update the drawing whenever a mouse event happens.
  void OnMouseEvent(const graphics::MouseEvent& event) override {
    if (event.GetMouseAction() == graphics::MouseAction::kPressed) {
      // Draw a circle of random color wherever the mouse is clicked.
      int red = rand() % 256;
      int green = rand() % 256;
      int blue = rand() % 256;
      image_.DrawCircle(event.GetX(), event.GetY(), 10, red, green, blue);
    }
    // Tell the image to redraw.
    image_.Flush();
  }

 private:
  graphics::Image image_;
};

int main() {
  TouchDotCreator touch_dots;
  touch_dots.Start();
  return 0;
}

Give it a try! You can save this code into mouse.cc, and build and execute with:

clang++ mouse.cc cpputils/graphics/image.cc -o mouse -lm -lX11 -lpthread && ./mouse

{% endspoiler %}

{% next %}

Animation after display

Instead of simple animation, which basically halts the program while the image is being shown, it can sometimes make more sense to have code which updates the image every time a fixed interval elapses.

Images can be animated after display by using the graphics::AnimationEventListener class, adding this class as an AnimationEventListener on a graphics::Image, and providing an implementation for the virtual void OnAnimationStep function. This function will be called every 30 ms by default, or as frequently as specified in an optional parameter to Image::ShowUntilClosed:

// Shows image in a window titled "My animation window", with an animation duration of 60 ms.
image.ShowUntilClosed("My animation window", 60);

Thus if you are showing an animation, all the logic to update the animation should begin from the OnAnimationStep function, which is the only function in the AnimationEventListener interface. At the end of OnAnimationStep you should call Image::Flush() to ensure the drawing is updated.

You need to add your class as a listener on the image in order to start receiving events, and remove it as a listener when you are done receiving events (don't worry about the *this syntax if you don't know what that means, just copy-paste this into your class):

image_.AddAnimationEventListener(*this);

Then in the destructor you should remove your class from the image to clean up. Copy-paste the following into the destructor:

image_.RemoveAnimationEventListener(*this);

For example, let's create an animation of a ball bouncing around like a screensaver. We could make an RedBallAnimator class which inherits from graphics::AnimationEventListener. RedBallAnimator should have a private member variables for:

a graphics::Image to draw on,
integers x_ and y_ to track the position of the ball,
integers dx_ and dy_ to track the change in x_ and y_ with each step.

We need to create three methods for this class:

a Start() function which will create and show the image as well as add itself (the RedBallAnimator) as a listener.
a destructor, ~RedBallAnimator, which removes itself as an AnimationEventListener from the image, and
OnAnimationStep, overridden from superclass graphics::AnimationEventListener, which will perform the drawing and logic.

Finally, the main function will simply create the RedBallAnimator, ask it to Start(), and then return 0.

The entirety of the logic for drawing the ball goes into OnAnimationStep. In this case, we can update the x_ and y_ member variables of RedBallAnimator by adding a fixed delta. When an edge is reached, the delta is simply subtracted instead of added.

Below is a short program that runs this animation. Can you change this code so that you have two balls bouncing simultaneously (perhaps by adding another set of x_ and y_)? What about creating something which grows larger and smaller in radius instead of moving its location?

{% spoiler Example %}

#include "cpputils/graphics/image.h"
#include "cpputils/graphics/image_event.h"

class RedBallAnimator : public graphics::AnimationEventListener {
 public:
  ~RedBallAnimator() {
    image_.RemoveAnimationEventListener(*this);
  }

  // Initialize and shows the image, as well as adds itself as an
  // AnimationEventListener.
  void Start() {
    image_.Initialize(250, 250);
    image_.AddAnimationEventListener(*this);

    // Milliseconds between animation frames.
    int animation_duration = 60;
    image_.ShowUntilClosed("Animation", animation_duration);
  }

  // Overridden from graphics::AnimationEventListener, this method contains
  // all of the logic to animate the image after it is displayed.
  void OnAnimationStep() override {
    // Draw a white circle to erase the previous red dot.
    image_.DrawCircle(x_, y_, 10, 255, 255, 255);

    // Check if we've hit an edge and need to bounce.
    if (x_ + dx_ >= 250 || x_ + dx_ < 0) {
      dx_ = -1 * dx_;
    }
    if (y_ + dy_ >= 250 || y_ + dy_ < 0) {
      dy_ = -1 * dy_;
    }

    // Update the position based on dy_ and dx_.
    x_ += dx_;
    y_ += dy_;

    // Draw a red circle in the new location.
    image_.DrawCircle(x_, y_, 10, 255, 0, 0);

    // Tell the image to redraw.
    image_.Flush();
  }

 private:
  graphics::Image image_;
  int x_ = 49;
  int y_ = 199;
  int dx_ = 5;
  int dy_ = 10;
};

int main() {
  RedBallAnimator bouncy;
  bouncy.Start();
  return 0;
}

Give it a try! You can save this code into animation.cc, and build and execute with:

clang++ animation.cc cpputils/graphics/image.cc -o animation -lm -lX11 -lpthread && ./animation

{% endspoiler %}

Animation and Mouse Events in a single program

If you want to receive mouse events and do animation, for example to create a game, you can create a class that inherits from both graphics::AnimationEventListener and graphics::MouseEventListener. Register the class as a listener of on the image for both mouse and animation events, then use Image::ShowUntilClosed. The OnMouseEvent callback will occur any time there is a mouse event, while the OnAnimationStep will occur regularly.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
resources		resources
.cs50.yml		.cs50.yml
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction to Digital Images

Setup

What is a pixel?

Color

Image coordinates

`graphics::Image` class

Interacting with pixels

Drawing shapes

Circles

Rectangles

Lines

Saving images

Loading images

Advanced topics

`graphics::Color` class

Using `graphics::Color` with `graphics::Image`

Simple Animation

Handling Mouse Events

Animation after display

Animation and Mouse Events in a single program

About

Uh oh!

Releases

Packages

License

ILXL-guides/intro-to-graphics

Folders and files

Latest commit

History

Repository files navigation

Introduction to Digital Images

Setup

What is a pixel?

Color

Image coordinates

graphics::Image class

Interacting with pixels

Drawing shapes

Circles

Rectangles

Lines

Saving images

Loading images

Advanced topics

graphics::Color class

Using graphics::Color with graphics::Image

Simple Animation

Handling Mouse Events

Animation after display

Animation and Mouse Events in a single program

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

`graphics::Image` class

`graphics::Color` class

Using `graphics::Color` with `graphics::Image`

Packages