anyvec

AnyVec is an open-source Python package that makes it easy to vectorize any type of file — text, images, audio, video, or code — through a single, unified interface. Traditionally, embedding different data types (like text vs. images) requires different models and disparate code paths. AnyVec abstracts away these complexities, allowing you to work with a unified API for all your vectorization needs, regardless of file type.

Supported File Types

Category	Extensions / MIME Types
Text/Docs	.txt, .rtf, .md, .doc, .docx, .odt
PDF	.pdf
Presentation	.ppt, .pptx, .ppsx, .pptm, .odp
Spreadsheet	.xls, .xlsx, .ods
EPUB	.epub
Templates	.dotm, .dotx, .docm
Image	.png, .jpg, .jpeg, .jpe, .bmp, .gif, .tiff, .ico, .icns, .heic, .avif, .webp, .psd
Audio	.mp3, .wav, .ogg, .m4a
Video	.mp4, .avi, .mov, .mkv, .webm, .mpeg, .mpg
Code	.py, .js, .ts, .tsx, .jsx, .java, .cpp, .c, .h, .hpp, .cs, .go, .rb, .php, .pl, .sh, .swift, .scala, .lua, .f90, .f95, .erl, .exs, .bat, .sql, .lisp, .vb, .ipynb, .xml, .yml, .yaml, .json, .kt, .rst, .html

For the most up-to-date list, see the mime_handlers dictionary in the codebase.

Processing Flow

File Type Detection: AnyVec uses MIME type and file extension to determine the file type.
Extraction: The relevant extractor parses text, images, or audio from the file.
Vectorization: The extracted content is sent to a CLIP-like model via API for embedding.
Unified Output: You get back text and image vectors, regardless of input type.

Detailed Processing Flow

Text Files:

Extracts raw text using format-appropriate parsers.
Returns extracted text for vectorization.

Image Files:

Returns the image data as base64-encoded JPEGs or PNGs.
Optionally, OCR (optical character recognition) can be performed for text extraction.

Audio Files:

Audio bytes are sent to a transcription server (e.g., OpenAI Whisper).
The server returns the transcribed text, which is then vectorized.

Video Files:

The video is processed in two ways:
1. Audio Extraction & Transcription:
  - Audio is extracted from the video using MoviePy.
  - The extracted audio is sent to the /transcribe endpoint in your inference container.
  - The returned transcript is used for vectorization.
2. Frame Extraction:
  - Frames are extracted at n-second intervals using OpenCV.
  - Frames are returned as base64-encoded JPEGs for downstream processing or vectorization.

Return Values:

For text, audio, and video: returns extracted text (or transcript) and/or images (frames).
For images: returns images and optionally OCR text.

Quick Start / Usage

Installation

pip install anyvec

For inference, you can skip building locally and pull the latest public image directly from Docker Hub:

docker pull mxy680/clip-inference:latest

Then run the container:

docker run --rm -it -p 8000:8080 mxy680/clip-inference:latest

The API will be available at http://localhost:8000.

To run the container in detached mode (in the background), use:

docker run -d -p 8000:8080 mxy680/clip-inference:latest

The API will still be available at http://localhost:8000 while the container runs in the background.

Basic Example

from anyvec.client import AnyVecClient
from anyvec.models import VectorizationPayload

client = AnyVecClient("http://localhost:8000")

# Process a PDF
with open("example.pdf", "rb") as f:
    file_content = f.read()
payload = VectorizationPayload(file_content=file_content, file_name="example.pdf")
result = client.vectorize(payload)
print("Vectorization result:", result)

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
anyvec		anyvec
clip @ f8426e2		clip @ f8426e2
.gitignore		.gitignore
.gitmodules		.gitmodules
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

anyvec

Supported File Types

Processing Flow

Detailed Processing Flow

Quick Start / Usage

Installation

Basic Example

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

anyvec

Supported File Types

Processing Flow

Detailed Processing Flow

Quick Start / Usage

Installation

Basic Example

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages