- new xmlobject class for single ALTO file (each page of text) - add fields / subclasses for `TextBlock` and `TextLine` - `str` method on `TextLine` should return `@CONTENT` of `String` - add field to access `@VPOS` attribute for both TextBlock and TextLine