Problem: Python's re module is slow. This slows down all the common crawl processing and increases the cost of running cc-lambda.
Solution: Use re2
The problem with the solution is that pywren is not "auto-installing" re2 (since it has python and C parts). Asked how to solve this in the pywren repository and got no answer (yet).
Potential solutions:
Problem: Python's
remodule is slow. This slows down all the common crawl processing and increases the cost of running cc-lambda.Solution: Use
re2The problem with the solution is that pywren is not "auto-installing" re2 (since it has python and C parts). Asked how to solve this in the pywren repository and got no answer (yet).
Potential solutions: