You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/concepts/models/overview.md
+3Lines changed: 3 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -250,6 +250,9 @@ For models that are incremental, the following parameters can be specified in th
250
250
### batch_size
251
251
- Batch size is used to optimize backfilling incremental data. It determines the maximum number of intervals to run in a single job. For example, if a model specifies a cron of `@hourly` and a batch_size of `12`, when backfilling 3 days of data, the scheduler will spawn 6 jobs. (3 days * 24 hours/day = 72 hour intervals to fill. 72 intervals / 12 intervals per job = 6 jobs.)
252
252
253
+
### batch_concurrency
254
+
- The maximum number of [batches](#batch_size) that can run concurrently for this model. If not specified, the concurrency is only constrained by the number of concurrent tasks set in the connection settings.
255
+
253
256
### forward_only
254
257
- Set this to true to indicate that all changes to this model should be [forward-only](../plans.md#forward-only-plans).
Copy file name to clipboardExpand all lines: docs/reference/model_configuration.md
+6-7Lines changed: 6 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,7 +21,6 @@ Configuration options for SQLMesh model properties. Supported by all model kinds
21
21
|`interval_unit`| The temporal granularity of the model's data intervals. Supported values: `year`, `month`, `day`, `hour`, `half_hour`, `quarter_hour`, `five_minute`. (Default: inferred from `cron`) | str | N |
22
22
|`start`| The date/time that determines the earliest date interval that should be processed by a model. Can be a datetime string, epoch time in milliseconds, or a relative datetime such as `1 year ago`. | str \| int | N |
23
23
|`end`| The date/time that determines the latest date interval that should be processed by a model. Can be a datetime string, epoch time in milliseconds, or a relative datetime such as `1 year ago`. | str \| int | N |
24
-
|`batch_size`| The maximum number of intervals that can be evaluated in a single backfill task. If this is `None`, all intervals will be processed as part of a single task. If this is set, a model's backfill will be chunked such that each individual task only contains jobs with the maximum of `batch_size` intervals. (Default: `None`) | int | N |
25
24
|`grains`| The column(s) whose combination uniquely identifies each row in the model | str \| array[str]| N |
26
25
|`references`| The model column(s) used to join to other models' grains | str \| array[str]| N |
27
26
|`depends_on`| Models on which this model depends. (Default: dependencies inferred from model code) | array[str]| N |
@@ -45,7 +44,6 @@ The SQLMesh project-level `model_defaults` key supports the following options, d
45
44
- owner
46
45
- start
47
46
- end
48
-
- batch_size
49
47
- storage_format
50
48
51
49
## Model kind properties
@@ -74,10 +72,11 @@ Python model configuration object: [FullKind()](https://sqlmesh.readthedocs.io/e
74
72
75
73
Configuration options for all incremental models (in addition to [general model properties](#general-model-properties)).
|`batch_size`| The maximum number of intervals that can be evaluated in a single backfill task. If this is `None`, all intervals will be processed as part of a single task. If this is set, a model's backfill will be chunked such that each individual task only contains jobs with the maximum of `batch_size` intervals. (Default: `None`) | int | N |
80
-
|`lookback`| The number of time unit intervals prior to the current interval that should be processed. (Default: `0`) | int | N |
|`batch_size`| The maximum number of intervals that can be evaluated in a single backfill task. If this is `None`, all intervals will be processed as part of a single task. If this is set, a model's backfill will be chunked such that each individual task only contains jobs with the maximum of `batch_size` intervals. (Default: `None`) | int | N |
78
+
|`batch_concurrency`| The maximum number of batches that can run concurrently for this model (Default: the number of concurrent tasks set in the connection settings). | int | N |
79
+
|`lookback`| The number of time unit intervals prior to the current interval that should be processed. (Default: `0`) | int | N |
81
80
82
81
#### Incremental by time range
83
82
@@ -172,4 +171,4 @@ Options specified within the `kind` property's `csv_settings` property (override
172
171
|`lineterminator`| Character used to denote a line break. More information at the [Pandas documentation](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html). | str | N |
173
172
|`encoding`| Encoding to use for UTF when reading/writing (ex. 'utf-8'). More information at the [Pandas documentation](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html). | str | N |
174
173
175
-
Python model configuration object: [SeedKind()](https://sqlmesh.readthedocs.io/en/stable/_readthedocs/html/sqlmesh/core/model/kind.html#SeedKind)
174
+
Python model configuration object: [SeedKind()](https://sqlmesh.readthedocs.io/en/stable/_readthedocs/html/sqlmesh/core/model/kind.html#SeedKind)
0 commit comments