Add logic to skip files completely if anomalies were detected like extremely high percentage of vocab density for no reason.