do not merge: added load scripts for definitions by Jacobjeevan · Pull Request #3438 · ohcnetwork/care

Jacobjeevan · 2025-12-29T11:42:43Z

Proposed Changes

Converted FE scripts to BE; for loading activity definition, observation definition, specimen definition and charge item definitions.

Merge Checklist

Tests added/fixed
Update docs in /docs
Linting Complete
Any other necessary step

Only PR's with test cases included and passing lint and test pipelines will be reviewed

@ohcnetwork/care-backend-maintainers @ohcnetwork/care-backend-admins

Summary by CodeRabbit

New Features
- Added bulk import tools for Activity, Charge Item, Observation, and Specimen definitions.
- Supports multiple data sources (CSV, URL, Google Sheets) with facility scoping.
- Batch processing with configurable sizes, progress logging, and output CSV reporting per row.
- Resolves related references, performs optional code validation, and produces a summary of totals/successes/failures.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-12-29T11:43:00Z

📝 Walkthrough

Walkthrough

Adds four new Django management commands to import EMR definitions (Activity, Charge Item, Observation, Specimen) from CSV/Google Sheets and a shared utility module for CSV reading, normalization, code validation, slug/category management, and output generation. (Facility scoping, batch processing, and per-row reporting included.)

Changes

Cohort / File(s)	Summary
Shared utilities `care/emr/management/commands/load_emr_utils.py`	New helper module: CSV/Google Sheets readers, title normalization, slug creation, code parsing/validation and batch validation, category ensure function, and CSV output writer with logging and error handling.
Activity Definition command `care/emr/management/commands/load_activity_definition.py`	New management command: loads activity definitions with dependency loading (specimens, observations, charge items), reference resolution, code/category handling, batch processing, per-row error reporting, and output CSV.
Charge Item Definition command `care/emr/management/commands/load_charge_item_definition.py`	New management command: parses prices (supports ₹ and commas), derives tax components (5/12/18 → CGST/SGST), resolves categories, creates/reuses ChargeItemDefinition, supports batching and per-row CSV output.
Observation Definition command `care/emr/management/commands/load_observation_definition.py`	New management command: validates/normalizes codes (with optional substitution), parses body_site/method/units, handles JSON fields (component, qualified_ranges), creates/updates ObservationDefinition, batch processing, and output CSV including substitutions.
Specimen Definition command `care/emr/management/commands/load_specimen_definition.py`	New management command: parses nested specimen fields (container, minimum_volume, retention_time), validates type/container/unit codes, computes facility-scoped slugs, creates/reuses SpecimenDefinition, batch processing, and per-row output CSV.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

🚥 Pre-merge checks | ❌ 3

❌ Failed checks (2 warnings, 1 inconclusive)

Check name	Status	Explanation	Resolution
Description check	⚠️ Warning	The description is incomplete, missing an Associated Issue section and lacking sufficient detail about the changes; all merge checklist items remain unchecked despite the note stating tests and linting must pass.	Add the Associated Issue section, provide more detailed explanation of each load script's purpose, check off completed items, and ensure tests and linting have actually been run and passed.
Docstring Coverage	⚠️ Warning	Docstring coverage is 72.22% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Title check	❓ Inconclusive	The title is vague and generic, using non-descriptive phrasing like 'load scripts for definitions' that doesn't convey meaningful information about the primary changes.	Consider a more specific title that clearly identifies the main deliverable, such as 'Add Django management commands for loading EMR definition data' or similar.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 6

🧹 Nitpick comments (11)

care/emr/management/commands/load_specimen_definition.py (2)
111-111: Unused facility parameter in process_row.

The facility parameter is declared but never used in this method. Either remove it to fix the linter warning, or document why it's kept for interface consistency with other commands.
🔎 Proposed fix
-    def process_row(self, row: dict, facility: Facility) -> dict:
+    def process_row(self, row: dict) -> dict:
Then update the call site at line 304:
-                        data = self.process_row(row, facility)
+                        data = self.process_row(row)
319-320: Use logger.exception instead of logger.error for proper stack trace logging.

When catching exceptions, logger.exception automatically includes the full stack trace, which is valuable for debugging production issues. The current logger.error only logs the message.
🔎 Proposed fix
                     except Exception as e:
-                        logger.error("Error processing row '%s': %s", row_title, e)
+                        logger.exception("Error processing row '%s'", row_title)
                         failed.append(row_title)
care/emr/management/commands/load_activity_definition.py (2)
106-117: Argument definition for --validate-codes is somewhat unconventional.

Using action="store_true" with default=True is redundant—the flag will be True whether or not it's passed. The --no-validate-codes flag correctly sets it to False. While this works, it might confuse future maintainers. A clearer approach would be to just set the default and rely solely on --no-validate-codes.
🔎 Proposed simplification
         parser.add_argument(
             "--validate-codes",
-            action="store_true",
-            default=True,
-            help="Validate codes against valuesets (default: True)",
-        )
-        parser.add_argument(
-            "--no-validate-codes",
-            action="store_false",
-            dest="validate_codes",
-            help="Skip code validation",
+            action=argparse.BooleanOptionalAction,
+            default=True,
+            help="Validate codes against valuesets (default: True)",
         )
Or simply keep the boolean default and document the negation:
         parser.add_argument(
-            "--validate-codes",
-            action="store_true",
+            "--skip-code-validation",
+            action="store_true",
             default=True,
-            help="Validate codes against valuesets (default: True)",
-        )
-        parser.add_argument(
-            "--no-validate-codes",
-            action="store_false",
-            dest="validate_codes",
-            help="Skip code validation",
+            dest="validate_codes",
+            help="Skip code validation against valuesets",
         )
523-524: Use logger.exception for proper exception logging.

Same issue as in other commands—logger.exception captures the full traceback automatically.
🔎 Proposed fix
                     except Exception as e:
-                        logger.error("Error processing row '%s': %s", row_title, e)
+                        logger.exception("Error processing row '%s'", row_title)
                         failed.append(row_title)
care/emr/management/commands/load_observation_definition.py (3)
130-131: Move import json to the top of the file.

Importing inside a method works but is unconventional and slightly less efficient (the import check runs on every call). Standard practice is to place imports at the module level.
🔎 Proposed fix

Add at the top of the file with other imports:
 import logging
+import json
 from datetime import UTC, datetime
Then remove line 131.
123-125: Unused facility parameter in process_row.

The facility parameter is declared but never referenced within the method body. If it's intentionally kept for API consistency across commands, consider adding a comment or using _ prefix.
🔎 Proposed fix
     def process_row(
-        self, row: dict, facility: Facility | None, validate_codes: bool
+        self, row: dict, validate_codes: bool
     ) -> dict:
Update call site at line 359:
-                        data = self.process_row(row, facility, validate_codes)
+                        data = self.process_row(row, validate_codes)
375-376: Use logger.exception for proper stack trace logging.

Consistent with the other commands, this should use logger.exception to capture full tracebacks.
🔎 Proposed fix
                     except Exception as e:
-                        logger.error("Error processing row '%s': %s", row_title, e)
+                        logger.exception("Error processing row '%s'", row_title)
                         failed.append(row_title)
care/emr/management/commands/load_emr_utils.py (2)
113-128: Consider adding basic retry logic for HTTP requests.

Network requests to Google Sheets or external URLs may experience transient failures. A simple retry with backoff would improve reliability for batch operations.
🔎 Example with tenacity or manual retry
from requests.adapters import HTTPAdapter
from urllib3.util.retry import Retry

def read_csv_from_url(url: str) -> list[dict[str, str]]:
    """Read CSV from a URL with retry support."""
    logger.info("Reading CSV from URL: %s", url)
    
    session = requests.Session()
    retries = Retry(total=3, backoff_factor=0.5, status_forcelist=[500, 502, 503, 504])
    session.mount("https://", HTTPAdapter(max_retries=retries))
    
    response = session.get(url, timeout=30)
    response.raise_for_status()
    # ... rest of the function
204-206: Use logger.exception for proper stack trace capture.

When catching exceptions, logger.exception provides valuable debugging information by including the full traceback.
🔎 Proposed fix
     except Exception as e:
-        logger.error("Error validating code %s: %s", code, e)
+        logger.exception("Error validating code %s", code)
         return False
care/emr/management/commands/load_charge_item_definition.py (2)
189-196: Silent fallback to 0.0 for invalid base prices may mask data issues.

When Base Price cannot be parsed, the code silently defaults to 0.0. This could hide data quality problems in the source CSV. Consider logging a warning so operators are aware of unparseable values.
🔎 Proposed fix
             try:
                 base_price = float(base_price_str)
             except (ValueError, TypeError):
+                logger.warning(
+                    "Invalid base price '%s' for '%s', defaulting to 0.0",
+                    row.get("Base Price"),
+                    row.get("title"),
+                )
                 base_price = 0.0
341-342: Use logger.exception for proper stack trace logging.

Consistent with the pattern across all commands, this should capture full tracebacks.
🔎 Proposed fix
                     except Exception as e:
-                        logger.error("Error processing row '%s': %s", row_title, e)
+                        logger.exception("Error processing row '%s'", row_title)
                         failed.append(row_title)

📜 Review details

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 108fe9e and c18db88.

📒 Files selected for processing (5)

care/emr/management/commands/load_activity_definition.py
care/emr/management/commands/load_charge_item_definition.py
care/emr/management/commands/load_emr_utils.py
care/emr/management/commands/load_observation_definition.py
care/emr/management/commands/load_specimen_definition.py

🧰 Additional context used

📓 Path-based instructions (2)

**/*.py

📄 CodeRabbit inference engine (.cursorrules)

**/*.py: Prioritize readability and maintainability; follow Django's coding style guide (PEP 8 compliance).
Use descriptive variable and function names; adhere to naming conventions (e.g., lowercase with underscores for functions and variables).

Files:

care/emr/management/commands/load_activity_definition.py
care/emr/management/commands/load_charge_item_definition.py
care/emr/management/commands/load_observation_definition.py
care/emr/management/commands/load_specimen_definition.py
care/emr/management/commands/load_emr_utils.py

**/{models,views,management/commands}/*.py

📄 CodeRabbit inference engine (.cursorrules)

Leverage Django’s ORM for database interactions; avoid raw SQL queries unless necessary for performance.

Files:

care/emr/management/commands/load_activity_definition.py
care/emr/management/commands/load_charge_item_definition.py
care/emr/management/commands/load_observation_definition.py
care/emr/management/commands/load_specimen_definition.py
care/emr/management/commands/load_emr_utils.py

🧬 Code graph analysis (2)

care/emr/management/commands/load_activity_definition.py (9)

care/emr/management/commands/load_emr_utils.py (9)

create_slug (83-101)

ensure_category (241-281)

normalize_title (17-80)

parse_code (137-160)

read_csv_from_file (104-110)

read_csv_from_google_sheet (131-134)

read_csv_from_url (113-128)

validate_and_substitute_code (222-238)

write_output_csv (163-185)

care/emr/models/activity_definition.py (1)

ActivityDefinition (7-39)

care/emr/models/charge_item_definition.py (1)

ChargeItemDefinition (6-24)

care/emr/models/location.py (1)

FacilityLocation (12-127)

care/emr/models/observation_definition.py (1)

ObservationDefinition (6-27)

care/emr/models/specimen_definition.py (1)

SpecimenDefinition (6-23)

care/emr/management/commands/load_charge_item_definition.py (1)

handle (280-374)

care/emr/management/commands/load_observation_definition.py (1)

handle (308-409)

care/emr/management/commands/load_specimen_definition.py (1)

handle (258-352)

care/emr/management/commands/load_emr_utils.py (4)

care/emr/registries/care_valueset/care_valueset.py (1)

valueset (45-46)

care/emr/resources/common/coding.py (1)

Coding (4-13)

care/emr/models/resource_category.py (1)

ResourceCategory (10-86)

care/emr/models/base.py (1)

calculate_slug_from_facility (39-40)

🪛 GitHub Actions: Lint Code Base

care/emr/management/commands/load_activity_definition.py

[error] 436-436: PLR0915 Too many statements (59 > 50)

care/emr/management/commands/load_observation_definition.py

[error] 123-123: PLR0912 Too many branches (15 > 12)

[error] 123-123: PLR0915 Too many statements (53 > 50)

care/emr/management/commands/load_specimen_definition.py

[error] 111-111: PLR0912 Too many branches (12 > 12)

[error] 123-123: PLR0915 Too many statements (54 > 50)

care/emr/management/commands/load_emr_utils.py

[error] 97-97: PLR2004 Magic value used in comparison, consider replacing 25 with a constant variable

🪛 Ruff (0.14.10)

care/emr/management/commands/load_activity_definition.py

149-149: Avoid specifying long messages outside the exception class

(TRY003)

223-223: Abstract raise to an inner function

(TRY301)

223-223: Avoid specifying long messages outside the exception class

(TRY003)

428-428: Consider moving this statement to an else block

(TRY300)

436-436: Unused method argument: args

(ARG002)

523-523: Do not catch blind exception: Exception

(BLE001)

524-524: Use logging.exception instead of logging.error

Replace with exception

(TRY400)

care/emr/management/commands/load_charge_item_definition.py

95-95: Avoid specifying long messages outside the exception class

(TRY003)

187-187: Abstract raise to an inner function

(TRY301)

187-187: Avoid specifying long messages outside the exception class

(TRY003)

272-272: Consider moving this statement to an else block

(TRY300)

280-280: Unused method argument: args

(ARG002)

341-341: Do not catch blind exception: Exception

(BLE001)

342-342: Use logging.exception instead of logging.error

Replace with exception

(TRY400)

care/emr/management/commands/load_observation_definition.py

121-121: Avoid specifying long messages outside the exception class

(TRY003)

124-124: Unused method argument: facility

(ARG002)

134-134: Abstract raise to an inner function

(TRY301)

134-134: Avoid specifying long messages outside the exception class

(TRY003)

300-300: Consider moving this statement to an else block

(TRY300)

308-308: Unused method argument: args

(ARG002)

375-375: Do not catch blind exception: Exception

(BLE001)

376-376: Use logging.exception instead of logging.error

Replace with exception

(TRY400)

care/emr/management/commands/load_specimen_definition.py

109-109: Avoid specifying long messages outside the exception class

(TRY003)

111-111: Unused method argument: facility

(ARG002)

118-118: Abstract raise to an inner function

(TRY301)

118-118: Avoid specifying long messages outside the exception class

(TRY003)

126-126: Abstract raise to an inner function

(TRY301)

126-126: Avoid specifying long messages outside the exception class

(TRY003)

250-250: Consider moving this statement to an else block

(TRY300)

258-258: Unused method argument: args

(ARG002)

319-319: Do not catch blind exception: Exception

(BLE001)

320-320: Use logging.exception instead of logging.error

Replace with exception

(TRY400)

care/emr/management/commands/load_emr_utils.py

204-204: Do not catch blind exception: Exception

(BLE001)

205-205: Use logging.exception instead of logging.error

Replace with exception

(TRY400)

277-277: Consider moving this statement to an else block

(TRY300)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Test / Test

🔇 Additional comments (4)

care/emr/management/commands/load_activity_definition.py (1)

151-188: Dependency loading silently proceeds even if called commands fail.

If any of the call_command invocations raise an exception (e.g., due to invalid CSV data), the exception will propagate and halt the main command. However, if the dependency commands log errors but return normally (with failures), the main command proceeds unaware that some dependencies may be missing. This could lead to confusing cascading "missing dependency" errors later.

Consider capturing and surfacing dependency loading results, or at least documenting this behavior.

care/emr/management/commands/load_emr_utils.py (1)

17-80: LGTM on normalize_title implementation.

The title normalization logic is thorough—handling spacing, punctuation, and preserving medical abbreviations. The set of uppercase words covers common medical terms nicely.

care/emr/management/commands/load_charge_item_definition.py (2)

97-178: Tax components structure looks well-designed for Indian GST.

The CGST/SGST split at half the total rate (2.5%/2.5% for 5%, 6%/6% for 12%, 9%/9% for 18%) correctly reflects Indian GST regulations. The warning for unknown tax rates is helpful.

1-374: Overall: Well-structured command that follows established patterns.

The command follows the same consistent patterns as the other loaders, making the codebase predictable and maintainable. The tax handling logic is thoughtful and the batch processing implementation is solid.

coderabbitai · 2025-12-29T11:47:21Z

+    def handle(self, *args, **options):
+        start_time = datetime.now(tz=UTC)
+
+        # Set logging level
+        if options["verbosity"] == 0:
+            logger.setLevel(logging.ERROR)
+        elif options["verbosity"] == 1:
+            logger.setLevel(logging.INFO)
+        else:
+            logger.setLevel(logging.DEBUG)
+
+        try:
+            # Get facility
+            facility = Facility.objects.get(external_id=options["facility"])
+            logger.info("Loading activities for facility: %s", facility.name)
+
+            # Load dependencies first
+            self.load_dependencies(facility, options)
+
+            # Load activity data
+            logger.info("\n=== Loading Activity Definitions ===")
+            rows = self.load_data(options)
+            logger.info("Loaded %d rows from source", len(rows))
+
+            if not rows:
+                self.stdout.write(self.style.WARNING("No rows found in source"))
+                return
+
+            # Process rows in batches
+            batch_size = options["batch_size"]
+            validate_codes = options["validate_codes"]
+            total_rows = len(rows)
+            successful = []
+            failed = []
+            output_rows = []
+
+            for i in range(0, total_rows, batch_size):
+                batch = rows[i : i + batch_size]
+                batch_num = (i // batch_size) + 1
+                total_batches = (total_rows + batch_size - 1) // batch_size
+
+                logger.info(
+                    "Processing batch %d/%d (rows %d-%d)",
+                    batch_num,
+                    total_batches,
+                    i + 1,
+                    min(i + batch_size, total_rows),
+                )
+
+                for row in batch:
+                    row_title = row.get("title", "Unknown")
+                    slug_value = ""
+
+                    try:
+                        data = self.process_row(row, facility, validate_codes, None)
+                        slug_value = data["slug_value"]
+
+                        data, missing = self.resolve_dependencies(data, facility)
+
+                        if missing:
+                            error_msg = f"Missing dependencies: {', '.join(missing)}"
+                            logger.warning("Skipping %s: %s", data["title"], error_msg)
+                            failed.append(slug_value)
+                            output_rows.append(
+                                {
+                                    "title": data["title"],
+                                    "slug_value": slug_value,
+                                    "status": "Failed",
+                                    "error": error_msg,
+                                    "code_substitutions": data.get("substitutions", ""),
+                                }
+                            )
+                            continue
+
+                        self.create_activity_definition(data, facility, None)
+
+                        successful.append(slug_value)
+                        output_rows.append(
+                            {
+                                "title": data["title"],
+                                "slug_value": slug_value,
+                                "status": "Success",
+                                "error": "",
+                                "code_substitutions": data.get("substitutions", ""),
+                            }
+                        )
+
+                    except Exception as e:
+                        logger.error("Error processing row '%s': %s", row_title, e)
+                        failed.append(row_title)
+                        output_rows.append(
+                            {
+                                "title": row_title,
+                                "slug_value": slug_value,
+                                "status": "Failed",
+                                "error": str(e),
+                                "code_substitutions": "",
+                            }
+                        )
+
+            # Write output CSV
+            output_path = options.get("output") or default_output_path
+            if output_path:
+                write_output_csv(
+                    output_path,
+                    output_rows,
+                    ["title", "slug_value", "status", "error", "code_substitutions"],
+                )
+
+            # Print summary
+            self.stdout.write("\n=== Summary ===")
+            self.stdout.write(f"Total rows: {total_rows}")
+            self.stdout.write(self.style.SUCCESS(f"Successful: {len(successful)}"))
+            self.stdout.write(self.style.ERROR(f"Failed: {len(failed)}"))
+            self.stdout.write(f"Time taken: {datetime.now(tz=UTC) - start_time}")
+            self.stdout.write(
+                self.style.SUCCESS("Activity definitions loaded successfully")
+            )
+
+        except Exception as e:
+            logger.exception("Error in main process")
+            error_message = f"Error in main process: {e}"
+            self.stdout.write(self.style.ERROR(error_message))
+            raise


⚠️ Potential issue | 🟡 Minor

Pipeline failure: handle method exceeds statement limit (59 > 50).

This mirrors the pattern in the other loader commands. Consider extracting batch processing, dependency loading orchestration, and summary printing into helper methods. This will make the code more maintainable and resolve the linter error.

🧰 Tools

🪛 GitHub Actions: Lint Code Base

[error] 436-436: PLR0915 Too many statements (59 > 50)

🪛 Ruff (0.14.10)

436-436: Unused method argument: args

(ARG002)

523-523: Do not catch blind exception: Exception

(BLE001)

524-524: Use logging.exception instead of logging.error

Replace with exception

(TRY400)

🤖 Prompt for AI Agents

In care/emr/management/commands/load_activity_definition.py around lines 436-559, the handle method is too large (59 statements) and should be split to satisfy the linter: extract (1) dependency/load orchestration into a helper like load_all_dependencies(self, facility, options), (2) batch processing of rows into process_batches(self, rows, facility, validate_codes, batch_size) which returns (successful, failed, output_rows), and (3) summary/output writing into write_summary_and_output(self, start_time, total_rows, successful, failed, output_rows, options). Replace the inlined blocks with calls to these helpers, ensure params passed (options, facility, rows, validate_codes, batch_size, start_time), preserve logging and exception handling in handle (catch top-level exceptions and re-raise) and keep helper methods unit-testable and small to reduce statements in handle.

coderabbitai · 2025-12-29T11:47:21Z

+def create_slug(name: str) -> str:
+    """
+    Create a slug from a name.
+    Matches the TypeScript createSlug function.
+    """
+    if not name:
+        return ""
+
+    slug = name.lower()
+    slug = re.sub(r"[^a-z0-9\s_-]", "", slug)
+    slug = re.sub(r"\s+", "-", slug)
+    slug = re.sub(r"-+", "-", slug)
+    slug = slug.strip()
+    slug = slug[:25]
+    if len(slug) < 25:
+        hash_suffix = hashlib.sha256(slug.encode()).hexdigest()
+        needed_hash = 25 - len(slug) - 1
+        slug = slug + "-" + hash_suffix[:needed_hash]
+    return slug


⚠️ Potential issue | 🟡 Minor

Extract magic number 25 to a named constant.

The pipeline reports PLR2004 for the magic value. While the comment explains this matches a TypeScript function, a named constant would improve clarity and satisfy the linter.

🔎 Proposed fix

+SLUG_MAX_LENGTH = 25 + + def create_slug(name: str) -> str: """ Create a slug from a name. Matches the TypeScript createSlug function. """ if not name: return "" slug = name.lower() slug = re.sub(r"[^a-z0-9\s_-]", "", slug) slug = re.sub(r"\s+", "-", slug) slug = re.sub(r"-+", "-", slug) slug = slug.strip() - slug = slug[:25] - if len(slug) < 25: + slug = slug[:SLUG_MAX_LENGTH] + if len(slug) < SLUG_MAX_LENGTH: hash_suffix = hashlib.sha256(slug.encode()).hexdigest() - needed_hash = 25 - len(slug) - 1 + needed_hash = SLUG_MAX_LENGTH - len(slug) - 1 slug = slug + "-" + hash_suffix[:needed_hash] return slug

🧰 Tools

🪛 GitHub Actions: Lint Code Base

[error] 97-97: PLR2004 Magic value used in comparison, consider replacing 25 with a constant variable

🤖 Prompt for AI Agents

In care/emr/management/commands/load_emr_utils.py around lines 83 to 101, the repeated literal 25 used to limit slug length should be extracted to a module-level named constant (e.g., MAX_SLUG_LENGTH = 25) to remove the magic number; define the constant near the top of the file, replace all occurrences of 25 in this function with that constant (including the slice, the length comparison, and the hash suffix calculation), and ensure the needed_hash calculation uses MAX_SLUG_LENGTH so behavior remains identical.

coderabbitai · 2025-12-29T11:47:21Z

+def ensure_category(category_name: str, facility, resource_type: str, created_by=None):
+    """
+    Ensure a ResourceCategory exists for the given name, if not create it.
+    Returns the ResourceCategory object.
+    Raises exceptions on database or validation errors.
+    """
+    from care.emr.models.resource_category import ResourceCategory
+
+    try:
+        category_title = normalize_title(category_name)
+        category_slug_value = create_slug(category_name)
+        category_slug = ResourceCategory.calculate_slug_from_facility(
+            str(facility.external_id), category_slug_value
+        )
+
+        # Check if exists
+        category = ResourceCategory.objects.filter(
+            slug=category_slug, facility=facility
+        ).first()
+
+        if category:
+            return category
+
+        # Create new category
+        category = ResourceCategory(
+            facility=facility,
+            resource_type=resource_type,
+            resource_sub_type="other",
+            title=category_title,
+            slug=category_slug,
+            description=f"Auto-generated category for {category_title}",
+            created_by=created_by,
+            updated_by=created_by,
+        )
+        category.save()
+        logger.info("Created category: %s", category_title)
+        return category
+
+    except Exception as e:
+        error_message = f"Failed to ensure category '{category_name}': {e}"
+        raise RuntimeError(error_message) from e


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

rg -n "resource_type" care/emr/models/resource_category.py -A 2 -B 2

Repository: ohcnetwork/care

Length of output: 268

🏁 Script executed:

cat care/emr/models/resource_category.py

Repository: ohcnetwork/care

Length of output: 3025

🏁 Script executed:

# Search for resource_type constant definitions or usage patterns rg -n "resource_type\s*=" care/emr --type py | head -20

Repository: ohcnetwork/care

Length of output: 2198

🏁 Script executed:

# Also check for any choices or constants defined for resource types rg -n "RESOURCE_TYPE|resource_type.*choice" care/emr --type py -i

Repository: ohcnetwork/care

Length of output: 28120

🏁 Script executed:

rg -rn "ResourceCategoryResourceTypeOptions" care/emr --type py

Repository: ohcnetwork/care

Length of output: 189

🏁 Script executed:

cat care/emr/resources/resource_category/spec.py

Repository: ohcnetwork/care

Length of output: 1829

Validate resource_type against ResourceCategoryResourceTypeOptions enum.

The function accepts any resource_type string, but ResourceCategoryResourceTypeOptions defines only three valid values: product_knowledge, activity_definition, and charge_item_definition. Validating the input against this enum would catch typos or misconfiguration early.

🧰 Tools

🪛 Ruff (0.14.10)

277-277: Consider moving this statement to an else block

(TRY300)

🤖 Prompt for AI Agents

In care/emr/management/commands/load_emr_utils.py around lines 241 to 281, the function ensure_category accepts any resource_type string but must validate it against ResourceCategoryResourceTypeOptions; import ResourceCategoryResourceTypeOptions from care.emr.models.resource_category (or its module), perform a validation check early (before creating or querying ResourceCategory) to ensure resource_type is one of the enum's allowed values, and if not raise a clear ValueError (or RuntimeError consistent with surrounding code) with a message listing valid options; keep the rest of the flow unchanged so validated resource_type is used for the query/creation.

coderabbitai · 2025-12-29T11:47:21Z

+    def process_row(
+        self, row: dict, facility: Facility | None, validate_codes: bool
+    ) -> dict:
+        """
+        Process a single CSV row into an ObservationDefinition data dict.
+        Raises exceptions with descriptive messages on errors.
+        """
+        try:
+            import json
+
+            if not row.get("title"):
+                raise ValueError("Missing required field: title")
+
+            code_value = row.get("code_value")
+            code_system = row.get("code_system", "http://loinc.org")
+            code_display = row.get("code_display")
+
+            # Default code for observation
+            default_code = {
+                "code": "104922-0",
+                "system": "http://loinc.org",
+                "display": "Laboratory test details panel",
+            }
+
+            substitution_messages = []
+
+            # Validate code if requested
+            if validate_codes and code_value:
+                code, sub_msg = validate_and_substitute_code(
+                    code_value,
+                    code_system,
+                    "system-observation",
+                    default_code,
+                )
+                if sub_msg:
+                    substitution_messages.append(f"code: {sub_msg}")
+            else:
+                code = parse_code(code_value, code_system, code_display) or default_code
+
+            body_site = parse_code(
+                row.get("body_site_code"),
+                row.get("body_site_system", "http://snomed.info/sct"),
+                row.get("body_site_display"),
+            )
+
+            # Parse optional method
+            method_code = row.get("method_code")
+            method_system = row.get(
+                "method_system",
+                "http://terminology.hl7.org/CodeSystem/observation-methods",
+            )
+            method_display = row.get("method_display")
+
+            default_method = {
+                "code": "386053000",
+                "system": "http://snomed.info/sct",
+                "display": "Technique",
+            }
+
+            if validate_codes and method_code:
+                method, sub_msg = validate_and_substitute_code(
+                    method_code,
+                    method_system,
+                    "system-collection-method",
+                    default_method,
+                )
+                if sub_msg:
+                    substitution_messages.append(f"method: {sub_msg}")
+            else:
+                method = (
+                    parse_code(method_code, method_system, method_display)
+                    if method_code
+                    else None
+                )
+
+            permitted_unit = parse_code(
+                row.get("permitted_unit_code"),
+                row.get("permitted_unit_system", "http://unitsofmeasure.org"),
+                row.get("permitted_unit_display"),
+            )
+
+            component = []
+            if row.get("component"):
+                try:
+                    component = json.loads(row["component"])
+                    if not isinstance(component, list):
+                        component = []
+                except (json.JSONDecodeError, TypeError):
+                    logger.warning("Invalid component JSON for %s", row.get("title"))
+
+            qualified_ranges = []
+            if row.get("qualified_ranges"):
+                try:
+                    qualified_ranges = json.loads(row["qualified_ranges"])
+                    if not isinstance(qualified_ranges, list):
+                        qualified_ranges = []
+                except (json.JSONDecodeError, TypeError):
+                    logger.warning(
+                        "Invalid qualified_ranges JSON for %s", row.get("title")
+                    )
+
+            title = normalize_title(row["title"])
+            slug_value = create_slug(title)
+
+            return {
+                "title": title,
+                "slug_value": slug_value,
+                "status": row.get("status", "active"),
+                "description": row.get("description", ""),
+                "category": row.get("category", "laboratory"),
+                "code": code,
+                "permitted_data_type": row.get("permitted_data_type", "string"),
+                "body_site": body_site,
+                "method": method,
+                "permitted_unit": permitted_unit,
+                "derived_from_uri": row.get("derived_from_uri", ""),
+                "component": component,
+                "qualified_ranges": qualified_ranges,
+                "substitutions": "; ".join(substitution_messages)
+                if substitution_messages
+                else "",
+            }
+
+        except (KeyError, ValueError) as e:
+            error_message = f"Failed to process row: {e}"
+            raise ValueError(error_message) from e
+        except Exception as e:
+            error_message = f"Unexpected error processing row: {e}"
+            raise RuntimeError(error_message) from e
+


⚠️ Potential issue | 🟡 Minor

Pipeline failure: process_row exceeds complexity limits.

The linter reports PLR0912 (too many branches: 15 > 12) and PLR0915 (too many statements: 53 > 50). Consider extracting JSON parsing, code validation, and data assembly into separate helper methods.

🔎 Suggested extraction points

def _parse_json_field(self, row: dict, field_name: str, title: str) -> list: """Parse optional JSON field, returning empty list on failure.""" if not row.get(field_name): return [] try: parsed = json.loads(row[field_name]) return parsed if isinstance(parsed, list) else [] except (json.JSONDecodeError, TypeError): logger.warning("Invalid %s JSON for %s", field_name, title) return [] def _validate_observation_code( self, code_value: str | None, code_system: str, code_display: str | None, validate_codes: bool, default_code: dict ) -> tuple[dict, list[str]]: """Validate and return code with any substitution messages.""" # extraction of lines 149-160 ...

🧰 Tools

🪛 GitHub Actions: Lint Code Base

[error] 123-123: PLR0912 Too many branches (15 > 12)

[error] 123-123: PLR0915 Too many statements (53 > 50)

🪛 Ruff (0.14.10)

124-124: Unused method argument: facility

(ARG002)

134-134: Abstract raise to an inner function

(TRY301)

134-134: Avoid specifying long messages outside the exception class

(TRY003)

🤖 Prompt for AI Agents

In care/emr/management/commands/load_observation_definition.py around lines 123 to 252, the process_row method is over the PLR complexity/statement limits; extract the JSON parsing blocks, the code/method validation logic, and the final data-assembly into small helper methods to reduce branches and statements. Implement a _parse_json_field(self, row, field_name, title) that returns a list (empty on missing/invalid JSON) and logs the same warnings, and a _validate_observation_code(self, code_value, code_system, code_display, validate_codes, default_code) that encapsulates the validate_and_substitute_code vs parse_code logic and returns (code_dict, substitution_message_or_empty). Then replace the inline blocks for component, qualified_ranges, code, and method with calls to those helpers and build the returned dict using a short, single assembly block; keep exception handling but remove duplicated logic so process_row stays under the branch/statement thresholds.

coderabbitai · 2025-12-29T11:47:21Z

+            type_tested = {
+                "is_derived": bool(row.get("is_derived", False)),
+                "preference": row.get("preference", "preferred"),
+                "retention_time": retention_time,
+                "single_use": bool(row.get("single_use", True)),
+            }


⚠️ Potential issue | 🟠 Major

Boolean coercion from CSV strings will not behave as expected.

CSV values are strings, so bool(row.get("is_derived", False)) will return True for any non-empty string including "False", "false", or "0". This could lead to incorrect data being stored.

🔎 Proposed fix

+def parse_bool(value, default: bool = False) -> bool: + """Parse boolean from CSV string.""" + if isinstance(value, bool): + return value + if not value: + return default + return str(value).lower() in ("true", "1", "yes") + type_tested = { - "is_derived": bool(row.get("is_derived", False)), + "is_derived": parse_bool(row.get("is_derived"), False), "preference": row.get("preference", "preferred"), "retention_time": retention_time, - "single_use": bool(row.get("single_use", True)), + "single_use": parse_bool(row.get("single_use"), True), }

🤖 Prompt for AI Agents

In care/emr/management/commands/load_specimen_definition.py around lines 185 to 190, the code uses bool(row.get(...)) which treats any non-empty CSV string (e.g. "False") as True; change the boolean coercion to parse the CSV string explicitly by reading the raw value, handling None, trimming and lowercasing it, and then comparing against an explicit truth set (e.g. {"true","1","yes","y","t"}) for both is_derived and single_use so "false"/"0"/empty string become False while valid truthy strings become True.

coderabbitai · 2025-12-29T11:47:21Z

+    def handle(self, *args, **options):
+        start_time = datetime.now(tz=UTC)
+
+        # Set logging level
+        if options["verbosity"] == 0:
+            logger.setLevel(logging.ERROR)
+        elif options["verbosity"] == 1:
+            logger.setLevel(logging.INFO)
+        else:
+            logger.setLevel(logging.DEBUG)
+
+        try:
+            facility = Facility.objects.get(external_id=options["facility"])
+            logger.info("Loading specimens for facility: %s", facility.name)
+
+            rows = self.load_data(options)
+            logger.info("Loaded %d rows from source", len(rows))
+
+            if not rows:
+                self.stdout.write(self.style.WARNING("No rows found in source"))
+                return
+
+            batch_size = options["batch_size"]
+            total_rows = len(rows)
+            successful = []
+            failed = []
+            output_rows = []
+
+            for i in range(0, total_rows, batch_size):
+                batch = rows[i : i + batch_size]
+                batch_num = (i // batch_size) + 1
+                total_batches = (total_rows + batch_size - 1) // batch_size
+
+                logger.info(
+                    "Processing batch %d/%d (rows %d-%d)",
+                    batch_num,
+                    total_batches,
+                    i + 1,
+                    min(i + batch_size, total_rows),
+                )
+
+                for row in batch:
+                    row_title = row.get("title", "Unknown")
+                    slug_value = ""
+
+                    try:
+                        data = self.process_row(row, facility)
+                        slug_value = data["slug_value"]
+
+                        self.create_specimen_definition(data, facility, None)
+
+                        successful.append(slug_value)
+                        output_rows.append(
+                            {
+                                "title": data["title"],
+                                "slug_value": slug_value,
+                                "status": "Success",
+                                "error": "",
+                            }
+                        )
+
+                    except Exception as e:
+                        logger.error("Error processing row '%s': %s", row_title, e)
+                        failed.append(row_title)
+                        output_rows.append(
+                            {
+                                "title": row_title,
+                                "slug_value": slug_value,
+                                "status": "Failed",
+                                "error": str(e),
+                            }
+                        )
+
+            output_path = options.get("output") or default_output_path
+            if output_path:
+                write_output_csv(
+                    output_path,
+                    output_rows,
+                    ["title", "slug_value", "status", "error"],
+                )
+
+            self.stdout.write("\n=== Summary ===")
+            self.stdout.write(f"Total rows: {total_rows}")
+            self.stdout.write(self.style.SUCCESS(f"Successful: {len(successful)}"))
+            self.stdout.write(self.style.ERROR(f"Failed: {len(failed)}"))
+            self.stdout.write(f"Time taken: {datetime.now(tz=UTC) - start_time}")
+            self.stdout.write(
+                self.style.SUCCESS("Specimen definitions loaded successfully")
+            )
+
+        except Exception as e:
+            logger.exception("Error in main process")
+            error_message = f"Error in main process: {e}"
+            self.stdout.write(self.style.ERROR(error_message))


⚠️ Potential issue | 🟡 Minor

Pipeline failure: handle method exceeds complexity limits.

The linter reports PLR0912 (too many branches) and PLR0915 (too many statements). Consider extracting the batch processing loop and summary output into separate helper methods to reduce complexity and improve readability.

🔎 Suggested refactor approach

def _process_batch(self, batch: list[dict], facility: Facility) -> tuple[list, list, list]: """Process a batch of rows. Returns (successful, failed, output_rows).""" successful = [] failed = [] output_rows = [] for row in batch: row_title = row.get("title", "Unknown") slug_value = "" try: data = self.process_row(row) slug_value = data["slug_value"] self.create_specimen_definition(data, facility, None) successful.append(slug_value) output_rows.append({ "title": data["title"], "slug_value": slug_value, "status": "Success", "error": "", }) except Exception as e: logger.exception("Error processing row '%s'", row_title) failed.append(row_title) output_rows.append({ "title": row_title, "slug_value": slug_value, "status": "Failed", "error": str(e), }) return successful, failed, output_rows def _print_summary(self, total_rows: int, successful: list, failed: list, start_time): """Print final summary.""" self.stdout.write("\n=== Summary ===") self.stdout.write(f"Total rows: {total_rows}") self.stdout.write(self.style.SUCCESS(f"Successful: {len(successful)}")) self.stdout.write(self.style.ERROR(f"Failed: {len(failed)}")) self.stdout.write(f"Time taken: {datetime.now(tz=UTC) - start_time}") self.stdout.write(self.style.SUCCESS("Specimen definitions loaded successfully"))

🧰 Tools

🪛 Ruff (0.14.10)

258-258: Unused method argument: args

(ARG002)

319-319: Do not catch blind exception: Exception

(BLE001)

320-320: Use logging.exception instead of logging.error

Replace with exception

(TRY400)

🤖 Prompt for AI Agents

In care/emr/management/commands/load_specimen_definition.py around lines 258-351, the handle method is too large/complex (PLR0912/PLR0915); extract the inner batch loop into a helper like _process_batch(self, batch: list[dict], facility: Facility) -> tuple[list, list, list] that processes a single batch, returns (successful, failed, output_rows) and uses logger.exception for row errors, and extract the final printing into _print_summary(self, total_rows: int, successful: list, failed: list, start_time); update handle to iterate batches, call _process_batch for each batch and aggregate returned lists, then call _print_summary at the end and keep CSV output logic unchanged.

codecov · 2026-01-15T07:56:23Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 74.82%. Comparing base (d656583) to head (245c892).
⚠️ Report is 6 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #3438      +/-   ##
===========================================
- Coverage    74.94%   74.82%   -0.13%     
===========================================
  Files          460      465       +5     
  Lines        20914    21112     +198     
  Branches      2159     2182      +23     
===========================================
+ Hits         15674    15797     +123     
- Misses        4766     4835      +69     
- Partials       474      480       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

coderabbitai

Actionable comments posted: 4

🤖 Fix all issues with AI agents

In `@care/emr/management/commands/load_activity_definition.py`:
- Around line 321-376: The resolve_dependencies function currently resolves
observation, specimen, and charge_item slugs but omits resolving locations;
update resolve_dependencies to iterate over data["location_names"] (or the field
used for incoming location references), call
lookup_locations(str(facility.external_id), name) or the existing
lookup_locations helper to translate each location name into its model id,
collect found ids into data["location_ids"], and append missing names as
"location:<name>" to the missing list (use the same Facility instance and
consistent slug/id lookup pattern as for
ObservationDefinition/SpecimenDefinition/ChargeItemDefinition).
- Around line 189-212: The lookup_locations method is never used so location
names from CSV are being stored instead of IDs; update the row processing to
call lookup_locations and pass the resulting IDs into
create_activity_definition: call self.lookup_locations(location_names, facility)
(e.g., in resolve_dependencies after dependency resolution or in process_row
before invoking create_activity_definition), replace the raw location_names with
the returned location_ids when building the create_activity_definition payload,
and handle the missing names tuple (log or surface errors) so FacilityLocation
IDs (not strings) are stored in the locations ArrayField.
- Around line 275-279: The variable location_ids is misnamed because it holds
CSV location names; rename it to location_names and update all usages
accordingly (the assignment from row["locations"], any checks, and places that
expect names). Ensure that when location_names is non-empty you call
lookup_locations(location_names) to resolve names to actual IDs and replace any
downstream logic that assumed IDs to use the resolved IDs. Update variable names
in this file consistently (e.g., from location_ids to location_names and then to
resolved_location_ids after lookup) so intent is clear.
- Around line 482-502: The loader always passes None for created_by (see
process_row and create_activity_definition), which prevents audit trails; update
the command to accept a --user argument (or resolve a system/import user via a
helper like get_system_user()) and set a local created_by variable that is
passed into process_row(...) and create_activity_definition(...); apply the same
change pattern to the other loaders (load_specimen_definition.py,
load_observation_definition.py, load_charge_item_definition.py) or, if None is
intentional, add a clear comment explaining why audit fields are intentionally
omitted during bulk imports.

♻️ Duplicate comments (1)

care/emr/management/commands/load_activity_definition.py (1)

428-551: Pipeline failure: handle method exceeds statement limit (59 > 50).

This has been flagged previously. Extract batch processing, dependency orchestration, and summary output into helper methods to satisfy the linter and improve maintainability.

🧹 Nitpick comments (2)

care/emr/management/commands/load_activity_definition.py (2)
106-111: Minor: Redundant action="store_true" with default=True.

The action="store_true" is typically used when the default is False. Since default=True is already set, the action is redundant and might confuse future readers. A simple default=True without the action would suffice here.
🔧 Suggested simplification
         parser.add_argument(
             "--validate-codes",
-            action="store_true",
+            type=bool,
             default=True,
             help="Validate codes against valuesets (default: True)",
         )
515-516: Use logging.exception instead of logging.error for exception logging.

When logging within an exception handler, logging.exception automatically includes the stack trace, which is more helpful for debugging.
                     except Exception as e:
-                        logger.error("Error processing row '%s': %s", row_title, e)
+                        logger.exception("Error processing row '%s': %s", row_title, e)

📜 Review details

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c18db88 and 245c892.

📒 Files selected for processing (1)

care/emr/management/commands/load_activity_definition.py

🧰 Additional context used

📓 Path-based instructions (2)

**/*.py

📄 CodeRabbit inference engine (.cursorrules)

**/*.py: Prioritize readability and maintainability; follow Django's coding style guide (PEP 8 compliance).
Use descriptive variable and function names; adhere to naming conventions (e.g., lowercase with underscores for functions and variables).

Files:

care/emr/management/commands/load_activity_definition.py

**/{models,views,management/commands}/*.py

📄 CodeRabbit inference engine (.cursorrules)

Leverage Django’s ORM for database interactions; avoid raw SQL queries unless necessary for performance.

Files:

care/emr/management/commands/load_activity_definition.py

🧬 Code graph analysis (1)

care/emr/management/commands/load_activity_definition.py (11)

care/emr/management/commands/load_emr_utils.py (9)

create_slug (83-101)

ensure_category (241-281)

normalize_title (17-80)

parse_code (137-160)

read_csv_from_file (104-110)

read_csv_from_google_sheet (131-134)

read_csv_from_url (113-128)

validate_and_substitute_code (222-238)

write_output_csv (163-185)

care/emr/models/activity_definition.py (1)

ActivityDefinition (7-39)

care/emr/models/charge_item_definition.py (1)

ChargeItemDefinition (6-24)

care/emr/models/location.py (1)

FacilityLocation (12-127)

care/emr/models/observation_definition.py (1)

ObservationDefinition (6-27)

care/emr/models/specimen_definition.py (1)

SpecimenDefinition (6-23)

care/facility/models/facility.py (1)

Facility (152-274)

care/emr/management/commands/load_charge_item_definition.py (1)

Command (34-374)

care/emr/management/commands/load_observation_definition.py (1)

Command (35-409)

care/emr/management/commands/load_specimen_definition.py (1)

Command (34-352)

care/emr/models/base.py (1)

calculate_slug_from_facility (39-40)

🪛 GitHub Actions: Lint Code Base

care/emr/management/commands/load_activity_definition.py

[error] 428-428: PLR0915 Too many statements (59 > 50)

🪛 Ruff (0.14.11)

care/emr/management/commands/load_activity_definition.py

149-149: Avoid specifying long messages outside the exception class

(TRY003)

223-223: Abstract raise to an inner function

(TRY301)

223-223: Avoid specifying long messages outside the exception class

(TRY003)

420-420: Consider moving this statement to an else block

(TRY300)

428-428: Unused method argument: args

(ARG002)

515-515: Do not catch blind exception: Exception

(BLE001)

516-516: Use logging.exception instead of logging.error

Replace with exception

(TRY400)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Test / Test

🔇 Additional comments (4)

care/emr/management/commands/load_activity_definition.py (4)

1-41: LGTM on imports and module setup.

The docstring provides clear usage examples, and the imports are properly organized. The path calculation for default output is reasonable.

139-149: LGTM on data loading logic.

The method correctly handles all three source types (Google Sheet, URL, file) with appropriate fallback error handling.

151-187: LGTM on dependency loading.

Clean delegation to the specialized loader commands with proper facility scoping.

378-426: LGTM on activity creation logic.

The duplicate detection by title and facility is reasonable. The exception handling provides good error context for debugging import failures.

_{✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.}

coderabbitai · 2026-01-16T08:34:48Z

+    def lookup_locations(
+        self, location_names: list[str], facility: Facility
+    ) -> tuple[list[int], list[str]]:
+        """
+        Lookup location IDs by names.
+        Returns (location_ids, missing_names).
+        """
+        if not location_names:
+            return [], []
+
+        location_ids = []
+        missing = []
+
+        for name in location_names:
+            location = FacilityLocation.objects.filter(
+                name__iexact=name.strip(), facility=facility
+            ).first()
+            if location:
+                location_ids.append(location.id)
+            else:
+                missing.append(name)
+                logger.warning("Location not found: %s", name)
+
+        return location_ids, missing


⚠️ Potential issue | 🔴 Critical

Critical: lookup_locations method is defined but never called.

This method exists to resolve location names to their database IDs, but it's never invoked. In process_row (lines 275-279), location_ids is populated with raw string names from the CSV, and these strings are passed directly to create_activity_definition (line 413) where they're stored as locations. This will result in storing location names instead of actual location IDs in the locations ArrayField.

The fix should call lookup_locations during row processing or dependency resolution.

🐛 Proposed fix in resolve_dependencies method

Add location resolution in resolve_dependencies (after line 376):

+ # Resolve location names to IDs + location_names = data.get("location_ids", []) + if location_names: + location_ids, missing_locations = self.lookup_locations( + location_names, facility + ) + data["location_ids"] = location_ids + for loc_name in missing_locations: + missing.append(f"location:{loc_name}") + else: + data["location_ids"] = [] + return data, missing

🤖 Prompt for AI Agents

In `@care/emr/management/commands/load_activity_definition.py` around lines 189 - 212, The lookup_locations method is never used so location names from CSV are being stored instead of IDs; update the row processing to call lookup_locations and pass the resulting IDs into create_activity_definition: call self.lookup_locations(location_names, facility) (e.g., in resolve_dependencies after dependency resolution or in process_row before invoking create_activity_definition), replace the raw location_names with the returned location_ids when building the create_activity_definition payload, and handle the missing names tuple (log or surface errors) so FacilityLocation IDs (not strings) are stored in the locations ArrayField.

coderabbitai · 2026-01-16T08:34:49Z

+            location_ids = []
+            if row.get("locations"):
+                location_ids = [
+                    s.strip() for s in row["locations"].split(",") if s.strip()
+                ]


⚠️ Potential issue | 🟡 Minor

Misleading variable name: location_ids contains location names.

The variable location_ids is populated with location names from the CSV, not IDs. This is confusing and contributes to the bug where lookup_locations isn't called. Consider renaming to location_names for clarity.

- location_ids = [] + location_names = [] if row.get("locations"): - location_ids = [ + location_names = [ s.strip() for s in row["locations"].split(",") if s.strip() ]

🤖 Prompt for AI Agents

In `@care/emr/management/commands/load_activity_definition.py` around lines 275 - 279, The variable location_ids is misnamed because it holds CSV location names; rename it to location_names and update all usages accordingly (the assignment from row["locations"], any checks, and places that expect names). Ensure that when location_names is non-empty you call lookup_locations(location_names) to resolve names to actual IDs and replace any downstream logic that assumed IDs to use the resolved IDs. Update variable names in this file consistently (e.g., from location_ids to location_names and then to resolved_location_ids after lookup) so intent is clear.

coderabbitai · 2026-01-16T08:34:49Z

+    def resolve_dependencies(
+        self, data: dict, facility: Facility
+    ) -> tuple[dict, list[str]]:
+        """
+        Resolve slug references to internal IDs.
+        Returns (updated_data, missing_dependencies).
+        """
+        missing = []
+
+        # Resolve observation slugs
+        observation_ids = []
+        for slug in data["observation_slugs"]:
+            full_slug = ObservationDefinition.calculate_slug_from_facility(
+                str(facility.external_id), slug
+            )
+            obs = ObservationDefinition.objects.filter(
+                slug=full_slug, facility=facility
+            ).first()
+            if obs:
+                observation_ids.append(obs.id)
+            else:
+                missing.append(f"observation:{slug}")
+
+        # Resolve specimen slugs
+        specimen_ids = []
+        for slug in data["specimen_slugs"]:
+            full_slug = SpecimenDefinition.calculate_slug_from_facility(
+                str(facility.external_id), slug
+            )
+            spec = SpecimenDefinition.objects.filter(
+                slug=full_slug, facility=facility
+            ).first()
+            if spec:
+                specimen_ids.append(spec.id)
+            else:
+                missing.append(f"specimen:{slug}")
+
+        # Resolve charge item slugs
+        charge_item_ids = []
+        for slug in data["charge_item_slugs"]:
+            full_slug = ChargeItemDefinition.calculate_slug_from_facility(
+                str(facility.external_id), slug
+            )
+            charge = ChargeItemDefinition.objects.filter(
+                slug=full_slug, facility=facility
+            ).first()
+            if charge:
+                charge_item_ids.append(charge.id)
+            else:
+                missing.append(f"charge_item:{slug}")
+
+        data["observation_ids"] = observation_ids
+        data["specimen_ids"] = specimen_ids
+        data["charge_item_ids"] = charge_item_ids
+
+        return data, missing


⚠️ Potential issue | 🔴 Critical

Location resolution missing from dependency resolution.

The method correctly resolves observations, specimens, and charge items but doesn't resolve locations. As noted above, this is where lookup_locations should be integrated to convert location names to IDs.

🤖 Prompt for AI Agents

In `@care/emr/management/commands/load_activity_definition.py` around lines 321 - 376, The resolve_dependencies function currently resolves observation, specimen, and charge_item slugs but omits resolving locations; update resolve_dependencies to iterate over data["location_names"] (or the field used for incoming location references), call lookup_locations(str(facility.external_id), name) or the existing lookup_locations helper to translate each location name into its model id, collect found ids into data["location_ids"], and append missing names as "location:<name>" to the missing list (use the same Facility instance and consistent slug/id lookup pattern as for ObservationDefinition/SpecimenDefinition/ChargeItemDefinition).

coderabbitai · 2026-01-16T08:34:49Z

+                        data = self.process_row(row, facility, validate_codes, None)
+                        slug_value = data["slug_value"]
+
+                        data, missing = self.resolve_dependencies(data, facility)
+
+                        if missing:
+                            error_msg = f"Missing dependencies: {', '.join(missing)}"
+                            logger.warning("Skipping %s: %s", data["title"], error_msg)
+                            failed.append(slug_value)
+                            output_rows.append(
+                                {
+                                    "title": data["title"],
+                                    "slug_value": slug_value,
+                                    "status": "Failed",
+                                    "error": error_msg,
+                                    "code_substitutions": data.get("substitutions", ""),
+                                }
+                            )
+                            continue
+
+                        self.create_activity_definition(data, facility, None)


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Check how other loader commands handle created_by rg -n "created_by" care/emr/management/commands/load_*_definition.py | head -50

Repository: ohcnetwork/care

Length of output: 1922

🏁 Script executed:

#!/bin/bash # Find where process_row and create_activity_definition are called rg -n "self\.process_row\|self\.create_activity_definition" care/emr/management/commands/load_activity_definition.py -B 2 -A 1

Repository: ohcnetwork/care

Length of output: 41

🏁 Script executed:

#!/bin/bash # Find where these methods are called rg -n "process_row\|create_activity_definition" care/emr/management/commands/load_activity_definition.py | grep -v "def " | head -30

Repository: ohcnetwork/care

Length of output: 41

🏁 Script executed:

#!/bin/bash # Let's read the file around the lines mentioned in the review (482-502) sed -n '475,510p' care/emr/management/commands/load_activity_definition.py

Repository: ohcnetwork/care

Length of output: 1648

🏁 Script executed:

#!/bin/bash # Check the handle() method to see if there's user context available sed -n '1,100p' care/emr/management/commands/load_activity_definition.py | grep -A 50 "def handle"

Repository: ohcnetwork/care

Length of output: 41

🏁 Script executed:

#!/bin/bash # Check the full structure of the command class rg -n "def handle|add_arguments|class Command" care/emr/management/commands/load_activity_definition.py

Repository: ohcnetwork/care

Length of output: 173

🏁 Script executed:

#!/bin/bash # Check the add_arguments method sed -n '71,130p' care/emr/management/commands/load_activity_definition.py

Repository: ohcnetwork/care

Length of output: 1780

🏁 Script executed:

#!/bin/bash # Check the handle method sed -n '428,470p' care/emr/management/commands/load_activity_definition.py

Repository: ohcnetwork/care

Length of output: 1593

🏁 Script executed:

#!/bin/bash # Check if other loader commands have user arguments rg -n "add_argument.*user|--user" care/emr/management/commands/load_*_definition.py

Repository: ohcnetwork/care

Length of output: 41

🏁 Script executed:

#!/bin/bash # Check how load_specimen_definition.py calls process_row and create functions sed -n '1,50p' care/emr/management/commands/load_specimen_definition.py | grep -A 20 "def handle"

Repository: ohcnetwork/care

Length of output: 41

🏁 Script executed:

#!/bin/bash # Check load_specimen_definition.py for how it calls process_row/create functions rg -n "self\.process_row\|self\.create_" care/emr/management/commands/load_specimen_definition.py | head -20

Repository: ohcnetwork/care

Length of output: 41

🏁 Script executed:

#!/bin/bash # Search for process_row calls in specimen loader rg -n "process_row" care/emr/management/commands/load_specimen_definition.py -A 1 -B 1

Repository: ohcnetwork/care

Length of output: 308

🏁 Script executed:

#!/bin/bash # Check load_observation_definition.py process_row calls rg -n "process_row" care/emr/management/commands/load_observation_definition.py -A 1 -B 1

Repository: ohcnetwork/care

Length of output: 340

🏁 Script executed:

#!/bin/bash # Check load_charge_item_definition for process_row calls rg -n "process_row" care/emr/management/commands/load_charge_item_definition.py -A 1 -B 1

Repository: ohcnetwork/care

Length of output: 326

🏁 Script executed:

#!/bin/bash # Let's check the method signatures for create functions to see if they use created_by rg -n "def create_activity_definition|def create_specimen|def create_observation|def create_charge_item" care/emr/management/commands/load_*_definition.py -A 3

Repository: ohcnetwork/care

Length of output: 1621

🏁 Script executed:

#!/bin/bash # Double-check the exact signature of load_observation's process_row to confirm sed -n '123,130p' care/emr/management/commands/load_observation_definition.py

Repository: ohcnetwork/care

Length of output: 341

created_by is always None throughout the loader commands.

Both process_row (line 482) and create_activity_definition (line 502) receive None for created_by. This isn't unique to this command—the pattern is consistent across all loader commands (load_specimen_definition.py, load_observation_definition.py, load_charge_item_definition.py). If this is intentional, a comment explaining why audit trails aren't tracked during bulk imports would be helpful. Otherwise, consider adding a --user argument or a system user to maintain audit trail consistency.

🤖 Prompt for AI Agents

In `@care/emr/management/commands/load_activity_definition.py` around lines 482 - 502, The loader always passes None for created_by (see process_row and create_activity_definition), which prevents audit trails; update the command to accept a --user argument (or resolve a system/import user via a helper like get_system_user()) and set a local created_by variable that is passed into process_row(...) and create_activity_definition(...); apply the same change pattern to the other loaders (load_specimen_definition.py, load_observation_definition.py, load_charge_item_definition.py) or, if None is intentional, add a clear comment explaining why audit fields are intentionally omitted during bulk imports.

added load scripts for definitions

c18db88

Jacobjeevan requested a review from a team as a code owner December 29, 2025 11:42

coderabbitai Bot reviewed Dec 29, 2025

View reviewed changes

Merge branch 'develop' into definition-load-scripts

48b90f7

simplify location loading

245c892

coderabbitai Bot reviewed Jan 16, 2026

View reviewed changes

Conversation

Jacobjeevan commented Dec 29, 2025 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed Changes

Merge Checklist

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Jacobjeevan commented Dec 29, 2025 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Dec 29, 2025 •

edited

Loading

codecov Bot commented Jan 15, 2026 •

edited

Loading