Skip to content

Path filter#92

Merged
qin-ctx merged 2 commits intomainfrom
path_filter
Feb 7, 2026
Merged

Path filter#92
qin-ctx merged 2 commits intomainfrom
path_filter

Conversation

@kkkwjx07
Copy link
Collaborator

@kkkwjx07 kkkwjx07 commented Feb 7, 2026

Description

Related Issue

Type of Change

  • Bug fix (non-breaking change that fixes an issue)
  • New feature (non-breaking change that adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • Documentation update
  • Refactoring (no functional changes)
  • Performance improvement
  • Test update

Changes Made

  • 1. Commit 4c5e187: refactor: bitmap filter (最新)

此提交主要涉及代码重构、健壮性增强以及文档更新。

• 测试增强: 新增了
tests/vectordb/test_openviking_vectordb.py,包含一套完整的测试用例,覆盖了元数据创建、数据插入/更新/删除、以及各种复杂
过滤条件(包括前缀匹配、正则、时间范围等)的召回率测试。
• 数据处理健壮性: 在 DataProcessor 中引入了 TYPE_DEFAULTS,为 int64, float32, string
等类型提供了默认值,防止因上游数据缺失字段导致校验失败。同时移除了对 string 类型的硬性长度限制。
• 索引后端优化: 修改了 VikingVectorIndexBackend,在创建索引时会根据是否使用稀疏向量自动选择 flat_hybrid 或 flat
索引类型,并从标量索引中排除了 abstract 字段。
• C++ 核心优化: 优化了 IndexManagerImpl::search 中的 filter bitmap 处理逻辑。
• 文档更新: 更新了
README.md,移除了原本的“实际应用场景示例”(文档检索、推荐系统等),替换为“高级特性”章节,详细介绍了自动 ID
生成和向量归一化功能。

2. Commit 5150018: feat: use path field

此提交重点在于引入新的字段类型和改进数据处理流程。

• Schema 升级: 在 CollectionSchemas 中将 uri, parent_uri 字段类型从 string 更改为更具体的 path 类型,将 created_at,
updated_at 从 string 更改为 date_time 类型。
• 数据处理器集成: 在 LocalCollection, LocalIndex, PersistentIndex 等核心存储类中全面集成了
DataProcessor,用于统一处理数据的校验、类型转换和默认值填充。
• 文档扩展: 在 README.md 中新增了关于 time_range(时间范围查询)和 geo_range(地理位置范围查询)的过滤语法说明。
• 元数据处理: 更新了 IndexMeta 类,使用 DataProcessor 来辅助构建和解析标量索引的元数据。

Testing

  • I have added tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes
  • I have tested this on the following platforms:
    • Linux
    • macOS
    • Windows

Checklist

  • My code follows the project's coding style
  • I have performed a self-review of my code
  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • My changes generate no new warnings
  • Any dependent changes have been merged and published

Screenshots (if applicable)

Additional Notes

@qin-ctx qin-ctx merged commit 393a4c5 into main Feb 7, 2026
3 checks passed
@qin-ctx qin-ctx deleted the path_filter branch February 7, 2026 09:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants