Assess performance bottlenecks of the gather instruction in JVector

Due to security vulnerabilities in Intel processors up to the Ice Lake generation, the gather instruction was microcode patched and is now extremely slow. Intel advisory: https://www.intel.com/content/www/us/en/security-center/advisory/intel-sa-00828.html. JVector uses gather instructions in multiple places that are worth looking into: 

https://github.com/datastax/jvector/blob/f0c52d4017be6c0c5a55dc6c6b9e36b52b55809a/jvector-native/src/main/c/jvector_simd.c#L292

https://github.com/datastax/jvector/blob/f0c52d4017be6c0c5a55dc6c6b9e36b52b55809a/jvector-native/src/main/c/jvector_simd.c#L323

‣ Ref: other libraries (e.g., NumPy’s x86 simd sort) improved performance by replacing gather with scalar loads: https://github.com/numpy/x86-simd-sort/pull/65


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assess performance bottlenecks of the gather instruction in JVector #632

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Assess performance bottlenecks of the gather instruction in JVector #632

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions