An OCR pipeline built for my cloud engineering class in grad school. I was focused on OCR for 19th century documents.