Retrieve missing licenses for purl documented dependencies.
This cli utility takes one or more purl formatted urls from stdin and will try to find the license attached to each of them, by querying various package managers databases.
This is particularly useful to fill GitHub's Dependabot gap of missing 90% of licenses when working at scale with ghas-cli for instance.
- Github Actions ✔️
- Composer✔️
- Go✔️
- Maven✔️
- NPM: 🟠 wip - see issue
- Nuget✔️
- Pip: 🟠wip - see issue
- Rubygems✔️
- Rust: 🟠 wip - see issue
- Swift: 🟠 wip
Builds are available in the Releases tab and on Pypi
- Pypi:
pip install purl-license-checker- Manually:
python -m pip install /full/path/to/purl-license-checker-xxx.whl
# e.g: python3 -m pip install Downloads/purl-license-checker-0.5.0-none-any.whlTo show the help message for each command, run purl-license-checker -h:
Usage: purl-license-checker [OPTIONS] COMMAND [ARGS]...
Retrieve licenses for purl documented dependencies.
Get help: `@jboursier-mwb` on GitHub
Options:
--help Show this message and exit.
Commands:
get_license
load_file
merge_csv
get_license PURL GITHUB_TOKEN
e.g:
get_license pip:ghas-cli gh-123456789qwerty
load_file PATH GITHUB_TOKEN
e.g:
With a PATH csv file formatted as follow:
repo_name, purl, version, licenseWhere missing licenses are set to Unknown, for instance:
ghas-cli, ghas-cli, com.github.Malwarebytes/ghas-cli,, MIT
ghas-cli, pip:charset-normalizer,3.3.2, MIT
ghas-cli, pip:colorama,0.4.6, BSD-2-Clause AND BSD-3-Clause
ghas-cli, pip:click,8.1.7, BSD-2-Clause AND BSD-3-Clause
ghas-cli, pip:python-magic,0.4.27, MIT
ghas-cli, pip:urllib3,2.2.3, MIT
ghas-cli, pip:requests,2.32.3, Apache-2.0
ghas-cli, pip:configparser,7.1.0, MIT
ghas-cli, pip:certifi,2024.8.30, MPL-2.0
ghas-cli, pip:idna,3.10, BSD-2-Clause AND BSD-3-Clause
ghas-cli, actions:actions/checkout,4.*.*, Unknown
ghas-cli, actions:github/codeql-action/analyze,3.*.*, Unknown
ghas-cli, actions:github/codeql-action/init,3.*.*, Unknown
ghas-cli, actions:actions/dependency-review-action,4.*.*, Unknownload_file will do its best to find the licenses for all Unknown license fields and will output its results in output.csv.
The output format is as follow:
purl, licenseFor instance:
npm:unicode-match-property-ecmascript, MIT
npm:unicode-match-property-value-ecmascript, MIT
npm:unicode-property-aliases-ecmascript, MIT
npm:universalify, MIT
npm:unpipe, MIT
npm:use-sync-external-store, MIT
npm:util-deprecate, MIT
npm:utils-merge, MITmerge_csv LICENSES_INPUT_PATH DEPENDENCIES_OUTPUT_PATH GITHUB_TOKEN
Allows to fill the unknown dependencies in DEPENDENCIES_OUTPUT_PATH formatted as repo_name, purl, version, license from LICENSES_INPUT_PATH containing only purl, license.
Particularly useful with a workflow based on ghas-cli.
Install uv first, then:
make dev- Bump the version number:
uv version --minor - Update the
__version__field insrc/cli.pyaccordingly.
Requires syft to be installed to generate the sbom.
- Bump the version number as described above
make releaseto build the packagesgit commit -a -S Bump to version 1.1.2andgit tag -s v1.1.2 -m "1.1.2"- Upload
dist/*,checksums.sha512andchecksums.sha512.ascto a new release in GitHub.
This repository is provided as-is and isn't bound to Malwarebytes' SLA.