Releases: SthPhoenix/InsightFace-REST
Releases · SthPhoenix/InsightFace-REST
InsightFace-REST v0.7.0.0
2021-11-06 v0.7.0.0
Since a lot of updates happened since last release version is updated straight to v0.7.0.0
Comparing to previous release (v0.6.2.0) this release brings improved performance for SCRFD based detectors.
Here is performance comparison on GPU Nvidia RTX 2080 Super for scrfd_10g_gnkps detector paired with
glintr100 recognition model (all tests are using src/api_trt/test_images/Stallone.jpg, 1 face per image):
| Num workers | Client threads | FPS v0.6.2.0 | FPS v0.7.0.0 | Speed-up |
|---|---|---|---|---|
| 1 | 1 | 56 | 103 | 83.9% |
| 1 | 30 | 72 | 128 | 77.7% |
| 6 | 30 | 145 | 179 | 23.4% |
Additions:
- Added experimental support for msgpack serializer: helps reduce network traffic for embeddings for ~2x.
- Output names no longer required for detection models when building TRT engine - correct output order is now extracted
from onnx models. - Detection models now can be exported to TRT engine with batch size > 1 - inference code doesn't support it yet, though
now they could be used in Triton Inference Server without issues.
Model Zoo:
- Added support for WebFace600k based recognition models from InsightFace repo:
w600k_r50andw600k_mbf - Added md5 check for models to allow automatic re-download if models have changed.
- All
scrfdbased models now supports batch dimension.
Improvements:
- 1.5x-2x faster SCRFD re-implementation with Numba: 4.5 ms. vs 10 ms. for
lumia.jpgexample with
scrfd_10g_gnkpsand threshold = 0.3 (432 faces detected)). - Move image normalization step to GPU with help of CuPy (4x lower data transfer from CPU to GPU, about 6%
inference speedup, and some computations offloaded from CPU). - 4.5x Faster
face_align.norm_cropimplementation with help of Numba and removal of unused computations.
(Cropping 432 faces fromlumia.jpgexample tooks 45 ms. vs 205 ms.). - Face crops are now extracted only when needed - when face data or embeddings are requested, improving
detection only performance. - Added Numba njit cache to reduce subsequent starts time.
- Logging timings rounded to ms for better readability.
- Minor refactoring
Fixes:
- Since gender/age estimation model is currently not supported exclude it from models preparing step.
InsightFace-REST v0.6.2.0
2021-09-09 v0.6.2.0
REST-API
- Use async
httpxlib for retrieving images by urls instead of urllib3 (which caused
performance drop in multi-GPU environment under load due to excessive usage of opened sockets) - Update draft Triton Infernce Server support to use CUDA shared memory.
- Minor refactoring for future change of project structure.
This release also includes previously missed release version 0.6.1.0:
2021-08-07 v0.6.1.0
REST-API
- Dropped support of MXNet inference backend and automatic MXNet->ONNX models conversion,
since all models are now distributed as ONNX by default.
InsightFace-REST v0.6.0.0
v0.6.0.0
REST-API
- Added support for newer InsightFace face detection SCRFD models:
scrfd_500m_bnkps,scrfd_2.5g_bnkps,scrfd_10g_bnkps - Released custom trained SCRFD models:
scrfd_500m_gnkps,scrfd_2.5g_gnkps,scrfd_10g_gnkps - Added support for newer InsightFace face recognition model
glintr100 - Models auto download switched to Google Drive.
- Default models switched to
glintr100andscrfd_10g_gnkps
InsightFace-REST v0.5.9.9
v0.5.9.9
REST-API
- Added JPEG decoding using PyTurboJPEG - increased decoding speed for large
JPEGs for about 2x. - Support for batch inference of
genderagemodel. - Support for limiting number of faces for recognition using
limit_facesparameter
inextractendpoint. - New
/multipart/draw_detectionsendpoint, supporting image upload using multipart
form data. - Support for printing face sizes and scores on image by
draw_detectionsendpoints. - More verbose timings for
extractendpoint for debug and logging purposes.
InsightFace-REST v0.5.9.6
Update version tag
InsightFace-REST v0.1.2
Old version based on TF MTCNN