Publishers often use alpha, i.e., [a-z], superscripts to denote affiliations for authors and when retrieving journal information from stored metadata the superscripts can appear as though they are part of the author's family names. When these superscripts are numeric, they do not pose a problem.
How to search for authors of the form XXXX versus XXXX[a-z] when both can be legitimate? For example,
Montresorc Marina
Montresor Marina
Gabele T.
Gabel Thomas
the first example represents an error and the second example represents two legitimate author names.
Ideas
- When family names are similar as in both of these examples, we could search the html for both names using curl and grep.
- Nelson created awk programs to split and organize names such as the list above.
- use orcid to check names
Publishers often use alpha, i.e., [a-z], superscripts to denote affiliations for authors and when retrieving journal information from stored metadata the superscripts can appear as though they are part of the author's family names. When these superscripts are numeric, they do not pose a problem.
How to search for authors of the form XXXX versus XXXX[a-z] when both can be legitimate? For example,
the first example represents an error and the second example represents two legitimate author names.
Ideas