Layout analysis and content classification in digitized books