CocoNet: A deep neural network for mapping pixel coordinates to color values
Paul Andrei Bricman
Radu Tudor Ionescu
arXiv e-Print archive - 2018 via Local arXiv
First published: 2018/05/29 (5 years ago) Abstract: In this paper, we propose a deep neural network approach for mapping the 2D
pixel coordinates in an image to the corresponding Red-Green-Blue (RGB) color
values. The neural network is termed CocoNet, i.e. COordinates-to-COlor
NETwork. During the training process, the neural network learns to encode the
input image within its layers. More specifically, the network learns a
continuous function that approximates the discrete RGB values sampled over the
discrete 2D pixel locations. At test time, given a 2D pixel coordinate, the
neural network will output the approximate RGB values of the corresponding
pixel. By considering every 2D pixel location, the network can actually
reconstruct the entire learned image. It is important to note that we have to
train an individual neural network for each input image, i.e. one network
encodes a single image only. To the best of our knowledge, we are the first to
propose a neural approach for encoding images individually, by learning a
mapping from the 2D pixel coordinate space to the RGB color space. Our neural
image encoding approach has various low-level image processing applications
ranging from image encoding, image compression and image denoising to image
resampling and image completion. We conduct experiments that include both
quantitative and qualitative results, demonstrating the utility of our approach
and its superiority over standard baselines, e.g. bilateral filtering or
The experiment is nice.
Though I assume the net practically memorized data and not inferred it as it makes little sense to say something intelligent on the pixel color by its location.
What I wonder if this can be made into something more clever.
A net with memory (RNN?) that gets the pixel coordinate in addition to estimation of pixels in the neighborhood or something.
Anyhow, I wonder if there is a code to replicate results.