𓆏

Image Quantizing

Image quantization is the best term I can come up for. When explaining this script to others i describe it as "old computers could only display 8 colors at a time. what 8 colors would you pick to best describe this image." Only difference with the script is the number doesn't need to be 8.

The first step involves building the data structure that holds all the pixel information. I used a dict with an input of the rgb values and stores a value of how many times that color appears in the image (the left side of the video is a 3d object where each dot represent a pixel color that exists in the original image). This is one of the densest ways I could store all the information that I needed to keep and allows me time to calculate an average pixel color as its being built.

Next I create 8 random points (could be any numbter of points) and send them on a random walk, change the points slightly and see if it scores better, to try and find a roughly good answer. What they are trying to solve is what 8 points minimize the distance to all the pixels from the original image. A voronoi diagram could also be used to describe this space, but each pixel is scored only with the closest point. So there's a sweet spot between being close to high value pixel colors (colors that appearr often in the image) and colors that are lonely and far away from all other colors. This process only takes the first 20 frames of the video

Afterwards I use simulated annealing to try and break through any false peaks and find a better global minimum. This is done through the same random walk method, but sometimes the new points are kept even if they give a worse score. The hope is that this can hop over local minimums, and with chaning the probability of how often the worse answers are kept, a better answer is hopefully achieved. This process takes the majority of the time, taking 770 frames in the video.

Lastly another random walk is made with pixel precision. The previous 2 processes were a little noisy to help speed runtime, so a final step to find the true minimum. This process is quite quick, taking only 10 frames of the final video, but it is worth it as the points are obviously seating into their final position.

Original image

Output Image

I did scale the image down by 1/4th. runtime was over a day long and reducing the pixels reduces the dict size. This is ment to imitate a pixel art style so reducing the pixel counts only to rescale later is almost the style im looking for.