- Avoid parsing entirety of warc file - Don't parse http records inside Any improvements we can make to mean that large and gargantuan warc files can be read and processed speedily