diff --git a/CHANGELOG.md b/CHANGELOG.md index c34fcd9..e22afb0 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -24,7 +24,7 @@ PUBLISHING PROCEDURE: 5. After publishing, the next PR author will add a new "## Unreleased" section --> -## Unreleased +## 0.6.1 (2026-05-20) ### Changed diff --git a/Cargo.lock b/Cargo.lock index 4187e88..d96d956 100644 --- a/Cargo.lock +++ b/Cargo.lock @@ -5097,7 +5097,7 @@ dependencies = [ [[package]] name = "monodex" -version = "0.6.0" +version = "0.6.1" dependencies = [ "anyhow", "arrow-array", diff --git a/Cargo.toml b/Cargo.toml index 9ceac75..781f257 100644 --- a/Cargo.toml +++ b/Cargo.toml @@ -1,6 +1,6 @@ [package] name = "monodex" -version = "0.6.0" +version = "0.6.1" edition = "2024" rust-version = "1.93" description = "Fast, accurate code search for large Rush monorepos" diff --git a/README.md b/README.md index 21f517d..cfdb281 100644 --- a/README.md +++ b/README.md @@ -361,7 +361,7 @@ monodex dump-chunks --file ./src/JsonFile.ts --debug monodex dump-chunks --file ./src/JsonFile.ts --target-size 4000 # Audit chunking quality across multiple files (AST-only mode) -monodex audit-chunks --count 20 --dir /path/to/project +monodex audit-chunks --count 20 --folder /path/to/project ``` **Chunk Quality Score**: 0-100%, higher is better. Scores below 95% may indicate chunking issues. Note: `dump-chunks` and `audit-chunks` use AST-only mode (fallback disabled) to accurately measure partitioner quality. @@ -460,7 +460,7 @@ RUST_LOG=debug ./target/release/monodex crawl --catalog sparo --label main --com The crawl behavior (which files to index and how to chunk them) can be customized via configuration files. -For the full inventory of files Monodex reads or writes (config-folder state, the database directory layout, repo-local config files), see [docs/design/monodex_files.md](https://github.com/microsoft/monodex/blob/main/docs/design/monodex_files.md). +For the full inventory of files Monodex reads or writes (config-folder state, the database folder layout, repo-local config files), see [docs/design/monodex_files.md](https://github.com/microsoft/monodex/blob/main/docs/design/monodex_files.md). ### Config Discovery @@ -474,7 +474,7 @@ No merging occurs. Exactly one config is used. ### Config Schema -JSON schemas are available in the `schemas/` directory for IDE autocomplete and validation. Reference the appropriate schema in your config file via the `$schema` field: +JSON schemas are available in the `schemas/` folder for IDE autocomplete and validation. Reference the appropriate schema in your config file via the `$schema` field: | Config File | Schema File | | ------------------------- | ----------------------------- | @@ -530,12 +530,12 @@ shouldCrawl = matchesFileType && (matchesPatternsToKeep || !matchesPatternsToExc - `fileTypes` is the primary filter. Unsupported file types are never crawled. - `patternsToKeep` overrides `patternsToExclude` (useful for keeping test files in `src/`) -- Directory patterns (ending in `/`) match anywhere in the path +- Folder patterns (ending in `/`) match anywhere in the path **Pattern syntax:** - Glob patterns use the standard syntax: `**` for recursive, `*` for wildcard -- Directory patterns end with `/` (e.g., `node_modules/`) +- Folder patterns end with `/` (e.g., `node_modules/`) - Example: `**/*.test.ts` matches test files at any depth ## Status diff --git a/docs/backlog.md b/docs/backlog.md index aeff9f8..3486408 100644 --- a/docs/backlog.md +++ b/docs/backlog.md @@ -44,7 +44,7 @@ For official feature requests, create a GitHub issue. If an issue needs higher p -**BL51 `monodex init` command, with `examples/` rename.** Generate `/monodex-config.json`, `/monodex-crawl-config.json`, and `/monodex-state.json` from the templates currently under `examples/`, with `$schema` URLs set to the published locations. Removes a setup step for new users. Implementation: `include_bytes!` to embed templates at compile time, plus a small command handler with the standard "file already exists" handling. Depends on the templates being embedded (trivial) and ideally on schema publication (otherwise `$schema` URLs are placeholders). The directory should be renamed from `examples/` to `config-templates/` as part of this work, since the current name is a misnomer. +**BL51 `monodex init` command, with `examples/` rename.** Generate `/monodex-config.json`, `/monodex-crawl-config.json`, and `/monodex-state.json` from the templates currently under `examples/`, with `$schema` URLs set to the published locations. Removes a setup step for new users. Implementation: `include_bytes!` to embed templates at compile time, plus a small command handler with the standard "file already exists" handling. Depends on the templates being embedded (trivial) and ideally on schema publication (otherwise `$schema` URLs are placeholders). The folder should be renamed from `examples/` to `config-templates/` as part of this work, since the current name is a misnomer. (severity=feature, work=small) @@ -90,7 +90,7 @@ Items with at least one non-obvious insight worth recording, but no commitment t -**BL52 Orphan reclamation garbage collection.** Three orphan kinds, swept by one `monodex gc` command: chunk-row orphans (rows in `chunks` with `active_label_ids = []`, typically from interrupted crawls; reclaimed by deleting the row), vector-payload orphans (non-NULL `vector` on a row no in-selection vector method points at; reclaimed by setting `vector = NULL`, row stays), Tantivy-directory orphans (a directory under `/fts//