Fix/update cropus name embeddings model lang by lpi-tn · Pull Request #16 · CyberCRI/welearn-database

lpi-tn · 2026-03-31T14:43:42Z

This pull request updates the materialized view for corpus embedding models to include additional metadata and ensures the corresponding SQLAlchemy model and versioning are in sync. The core change is a migration that drops and recreates the corpus_name_embedding_model_lang materialized view with more fields and improved logic, and updates the data model accordingly.

Database migration and schema changes:

Added a new Alembic migration (b049924f7067_modify_corpus_name_embedding_model_lang_) that drops and recreates the corpus_related.corpus_name_embedding_model_lang materialized view. The new view now includes corpus_id, embedding_model_id, used_since, and category_id, and ensures only the latest embedding model per corpus and language is kept using a window function.
Updated the CorpusNameEmbeddingModelLang SQLAlchemy model in corpus_related.py to add new fields: corpus_id, embedding_model_id, used_since, and category_id, matching the new view schema.

Versioning:

Bumped the package version in pyproject.toml from 1.4.0 to 1.4.2 to reflect the schema and model changes.

…l logic

Copilot

Pull request overview

This PR updates the corpus_related.corpus_name_embedding_model_lang materialized view to expose additional metadata (corpus/model IDs, used_since, category_id) and keep only the latest embedding model per (corpus, language), then aligns the SQLAlchemy read-only model and bumps the package version.

Changes:

Recreates the corpus_name_embedding_model_lang materialized view with extra columns and a ROW_NUMBER()-based “latest per corpus/lang” selection.
Extends the CorpusNameEmbeddingModelLang SQLAlchemy model to match the new view schema.
Bumps project version from 1.4.0 to 1.4.2.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
welearn_database/data/models/corpus_related.py	Adds new fields to the ORM model representing the updated materialized view.
welearn_database/alembic/versions/b049924f7067_modify_corpus_name_embedding_model_lang_.py	Drops/recreates the materialized view with updated projection and “latest per corpus/lang” logic.
pyproject.toml	Version bump to reflect the schema/model change.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

lpi-tn added 3 commits March 31, 2026 15:32

Bump version to 1.4.2 in pyproject.toml

ad2dd5f

Add new fields to Corpus model for embeddings and categorization

a2ded4b

Modify corpus_name_embedding_model_lang view to enhance data retrieva…

d116bec

…l logic

lpi-tn requested review from Copilot and sandragjacinto March 31, 2026 14:43

Copilot started reviewing on behalf of lpi-tn March 31, 2026 14:44 View session

sandragjacinto approved these changes Mar 31, 2026

View reviewed changes

Copilot AI reviewed Mar 31, 2026

View reviewed changes

Comment thread welearn_database/data/models/corpus_related.py Outdated

Update welearn_database/data/models/corpus_related.py

c30a8b5

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

lpi-tn merged commit 5f7410a into main Apr 29, 2026
1 check passed

lpi-tn deleted the Fix/update-cropus-name-embeddings-model-lang branch April 29, 2026 14:08

lpi-tn temporarily deployed to testpypi April 29, 2026 14:09 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/update cropus name embeddings model lang#16

Fix/update cropus name embeddings model lang#16
lpi-tn merged 4 commits intomainfrom
Fix/update-cropus-name-embeddings-model-lang

lpi-tn commented Mar 31, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

lpi-tn commented Mar 31, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants