Distributed On-the-Fly Image Processing and Open Source at Vimeo

When you think of Vimeo, you probably think of video — after all, it’s what we do. However, we also have to handle creation and distribution a lot of images: thumbnails, user portraits, channel headers, and all the various awesome graphics around Vimeo, to name a few.

For a very long time, all of this content was static and served from the CDN as-is. If we wanted to introduce a new video resolution, we would have to run batch jobs to get new, higher resolution thumbnails from all of the videos on the site, where possible. This also means that if we ever wanted to tweak the quality of said images, we would be out of luck. It also meant on mobile, or on a high DPI screen, we had to serve the same size images as on our main site, unless we wanted to store higher and/or lower resolution versions of the same images.

Enter ViewMaster.

About two years ago, during one of our code jams, one of our mobile site developers brought the issue to us, the transcode team, in search of a backend solution. ViewMaster was born that night, but sat idle for a long time after, due to heavy workloads, before being picked up again a few months ago.

We’ll go into more detail below, but a quick summary of what ViewMaster is and does is:

Written in Go and C.
Resizes, filters, crops, and encodes (with optimizations such as pngout to different formats on-the-fly, and entirely in memory.
Can be scaled out; each server is ‘dumb’.
Reworked thumbnailing that picks one ‘good’ thumbnail per video, during upload, and stores a master for use with the on-the-fly processing later on.
Migrates our existing N versions of each image to one master, high quality image to be stored.

This allows us to:

Serve smaller or larger images to different screen types, depending on DPI, and OS.
Serve optimized images for each browser; e.g. WebP for Chrome, and JPEG-XR for IE11+.
Easily introduce new video resolutions and player sizes.
Scale thumbnail images to the size of an embed, for display.
Introduce new optimizations such as mozjpeg instantly, and without any significant migration problems.

Now for the slightly more technical bits.

General Architectural Overview and Migration

ViewMaster Flow

A general look at the process is given in the diagram above. If you’d like a more detailed look at the infrastructure and migration strategy (and a higher res diagram), including what some of those funny names mean, head over to the Making Vimeo Blog to check it out!

Open Source

The actual image processing happens entirely in memory â€” the disk is never hit. The main image processing service is written in Go, and making somewhat liberal use of its C FFI to call several libraries and a few tiny C routines, open source or otherwise. It is known that calling C functions from Go has an overhead, but in practice, this has been negligible compared to the time taken by much more intensive operations inside the libraries, such as decoding, encoding, resizing, etc.

The process is rather straight forward: The video frame is seeked to and decoded and converted to RGB (yes, JPEG is YCbCr, but it made more sense for the master to be stored as RGB to us) and/or the image is decoded, and various calculations are done to account for things like non-square pixels, cropping, resizing, aspect ratios, etc. The image is then resized, encoded, and optimized. All of this is done in-memory using buffered IO in Go (via bufio), and if need be piped to an external process and back to the service where libraries are not available, such as the case is with Gifsicle and pngout.

Plenty of tricks are used to speed things up, such as detecting the image type and resolution based on mime-type, libmagic, and the libraries listed below, so we don’t need to call avformat_find_stream_info, which does a full decode of the image to get this information.

A few of the notable open source libraries we leverage (and contribute to!), include:

FFmpeg & Libav – Base image decoding libraries (libavcodec), resizing (swscale), remote image access. Now supports CMYK JPEGs too!
FFMS2 – Frame accurate seeking using the above libraries.
libwebp – WebP encoding.
LCMS2 – ICC profile handling.

On top of those, we’ve written several Go packages to aid in this as well, some of which we have just open sourced:

go-util – General utility functions for Go.
go-iccjpeg – ICC profile extraction from a generic io.Reader.
go-magic – Idiomatic Go bindings for the libmagic C API using io.Reader.
go-imgparse – Resolution extraction from JPEG, PNG, and WebP images optimized for I/O and convenience, again using a standard io.Reader.
go-taglog – Extended logging package compatible with the standard Go logging package.
go-mediainfo – Very basic binding for MediaInfo.

Future

Although we are currently optimizing quite well for PNG, and WebP, there is still lots to be done. To that end, we have been involved with an contributing to a number open source projects to create a better and faster web experience. A few are discussed below. It may not have been obvious though, since we tend to use our standard email accounts to contribute, rather than our corporate ones… Sneaky!

mozjpeg – Very promising already, having added features such as scan optimization, trellis quantization, DHT/DQT table merging, and deringing via overshoot clipping, with future features such as optimized quantization tables for high DPI displays and globally optimal edge extension. We plan to roll this out after the plan for ABI compatibility is implemented in 3.0, and also we plan to then add support to ImageMagick to benefit the greater community, if someone else has not already.

jxrlib – Awesome of Microsoft to open source this, but it needs a bit of work API-wise (that is, an actual API). Until fairly recently, it could not even be built as a library.

jpeg-recompress – Alongside mozjpeg, something akin to this is very desirable for JPEG generation. Uses the excellent IQA with mozjpeg and some other metrics (one implemented poorly) by me!).

Open Source PNG optimization library – This was a bit of a sticking point with us. The current open source PNG optimization utils do not support any in-memory API at all, or in fact, even piping via stdin/stdout. pngout is the only tool which even supports piping. Long term, we’d like to be able to ditch the closed source tool and contribute an API to one of these projects.

Photoshop, GIMP, etc. plugins – I plan to implement these using the above-mentioned libraries, so designers can more easily reap the benefits of better image compression.

Transcoder dude at Vimeo, and DSP hacker. Really into coffee. Reallllllllllllllllly.

7 comments

Valérie Martin

Very strange that you stick with aging and closed source PNGOUT since ZopfliPNG appears to be better and open source.

ZopfliPNG is part of Google Zopfli:
https://code.google.com/p/zopfli/
Here is a comparison that shows it outperforms pngout (in resulting size not necessarily speed):
http://frdx.free.fr/png_optimization.htm

November 5th, 2014 at 11:31
1. Derek Buitenhuis
  
  Thanks for the heads up! It didn’t come up at all while I was searching for such a library. I was aware of Zopfli, but not that it had a PNG library. I will definitely check it out (as in I’ve already hooked it up and am testing now!). The only downside is that it only has C++ bindings, but a simple C API wrapper should be easy. Looks to be leaps and bounds ahead of the other PNG optimization tools, in terms of ease of integration.
  
  EDIT: Looks like it didn’t actually *exist* at the time.
  
  November 5th, 2014 at 13:12
Fabricio Zuardi

any plans of removing the dependency on the flash plugin for watching videos on your site and maybe adopting free codecs such as vp8 or vp9 / webm? i use firefox in debian without plugins and i know it can play videos because youtube works fine ;)

November 5th, 2014 at 14:50
1. Michael Doe
  
  I second this. I killed flash in my Firefox years ago and I have to use another browser just to use a single website hosting its videos on Vimeo.
  
  November 6th, 2014 at 05:51
  1. Ivan
    
    I left flash since I came to Debian SID and youtube html5 player looks (in fact, IS) much better and fluid…
    
    Sometimes, when a website doesn’t have another option but flash, I use that kind of ‘converter site’ to download and convert video from these sites…
    
    Using this approach in the last year, I realized most sites are migrating to html5 players…
    
    November 7th, 2014 at 04:58
Fabricio C Zuardi

Another great consequence of not using plugins in the browser is that you show companies that you are not willing to accept DRM.

November 10th, 2014 at 07:29
Simon

Hi,
We have developed something similiar to ViewMaster it’s called Bildero we are providing All-in-one solution for 2D zoom and 360 images, althought we Bildero Image Server enables to process images on the fly. You can easily itegrate it with your website and provide Responsive images on your website for your visitors.
Just check all mighty features of Bildero there: http://www.bildero.com

Best,
Simon.

November 12th, 2014 at 15:08

Hacks

By Derek Buitenhuis

General Architectural Overview and Migration

Open Source

Future

About Derek Buitenhuis

7 comments

Distributed On-the-Fly Image Processing and Open Source at Vimeo

By Derek Buitenhuis

General Architectural Overview and Migration

Open Source

Future

About Derek Buitenhuis

Discover great resources for web development

Thanks! Please check your inbox to confirm your subscription.