Skip to content
Merged
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion hivemind_etl/mediawiki/etl.py
Original file line number Diff line number Diff line change
Expand Up @@ -103,7 +103,7 @@ def load(self, documents: list[Document]) -> None:
)

# Process batches in parallel using ThreadPoolExecutor
batch_size = 1000
batch_size = 1
Comment thread
amindadgar marked this conversation as resolved.
batches = [documents[i:i + batch_size] for i in range(0, len(documents), batch_size)]

with ThreadPoolExecutor(max_workers=10) as executor:
Expand Down