I love the reddit team's presentations. Does anyone have a video to the whole thing?
I'm guessing reddit uses BeautifulSoup just for crawling for images? I've always wondered how they decide which of the 100 images on a page to use for thumbnails.
If the thumbs were really cut off (hard to tell from the slides), you should use the Python OpenCV hooks and center the thumbnail on the likeliest face.
A couple months ago, Ricardo Galli - spanish hacker and creator of the open-source reddit-like "Meneame" - posted an explanation of his algorithm for doing that.
I love offbeat slides like these, but I only get 10% of the story from them. Mind you, making up the content might be more interesting than the real thing, in some cases!
These slides are hilarious. Definitely good to see a group of developers that don't take themselves too seriously and don't inject a macho attitude into their work. Refreshing for sure.