Add Tasklet Classification by ThrudPrimrose · Pull Request #2280 · spcl/dace

ThrudPrimrose · 2026-01-28T11:05:58Z

The analysis returns a dictionary containing the left- and right-hand-side operands of a given tasklet. It returns the operator (or function), the constants involved, and whether they are data (array or scalar) or constants (hardcoded constants or symbols), etc.

I have implemented the analysis for the auto-vectorization to determine whether to use vector-vector intrinsics or vector-scalar intrinsics, etc. It only accepts tasklets with at most 2 rhs operands. SplitTasklets that I have merged before can be used to ensure that all supported operators and a subset of functions (such as min, sin, cos, etc.) are used. Python tasklets have 2 rhs operands.

The analysis was also useful for a student trying to implement blocked-FP numbers in DaCe, so I think it will have further use beyond auto-vectorization.

phschaad

Some questions and minor corrections - after that will take closer look at remaining parts

dace/sdfg/tasklet_utils.py

Co-authored-by: Philipp Schaad <schaad.phil@gmail.com>

tbennun · 2026-01-28T16:08:09Z

dace/sdfg/tasklet_utils.py

+import typing
+
+
+class TaskletType(Enum):


tbennun · 2026-01-28T17:55:48Z

dace/sdfg/tasklet_utils.py

+
+
+def _token_split(string_to_check: str) -> Set[str]:
+    """


not the correct (Sphinx) docstring format we use in the rest of dace (apply everywhere)

tbennun · 2026-01-28T17:56:03Z

dace/sdfg/tasklet_utils.py

+def _token_split(string_to_check: str) -> Set[str]:
+    """
+    Splits a string into a set of tokens, keeping delimiters, and returns all tokens.
+    The input string is split on empty space and brackets (` `, `(`, `)`, `[`, `]`).


double backticks (``) for code comments

tbennun · 2026-01-28T17:57:53Z

dace/sdfg/tasklet_utils.py

+            if isinstance(node.op, ast.USub):
+                return f"-{node.operand.value}"
+            elif isinstance(node.op, ast.UAdd):
+                return str(node.operand.value)


why not +value?

tbennun · 2026-01-28T17:58:39Z

dace/sdfg/tasklet_utils.py

+    return lhs_str, rhs_str
+
+
+def _extract_non_connector_syms_from_tasklet(node: dace.nodes.Tasklet, state) -> typing.Set[str]:


missing type hints and direct inclusion of typing.
Since dace is Python 3.10+, use set[str] instead of typing

tbennun · 2026-01-28T18:07:34Z

dace/sdfg/tasklet_utils.py

+    n_in = len(in_conns)
+    n_out = len(out_conns)
+
+    assert n_out <= 1, "Only support tasklets with at most 1 output in this pass"


tbennun · 2026-01-28T18:08:03Z

dace/sdfg/tasklet_utils.py

+    assert isinstance(node, dace.nodes.Tasklet)
+    code: CodeBlock = node.code
+    assert code.language == dace.dtypes.Language.Python
+    code_str: str = code.as_string


it's inefficient that you already have an AST object and you convert it to a string and then reparse it. Explain your reasoning please.

tbennun · 2026-01-28T18:09:25Z

dace/sdfg/tasklet_utils.py

+        lhs_data = state.sdfg.arrays[lhs_data_name]
+
+        # Assignment operators it will return op <- `=` and always populate `rhs1`
+        if code_str == f"{lhs} = {rhs}" or code_str == f"{lhs} = {rhs};":


I would not trust an expression like this (e.g., what about a = (b)?). Why not use isinstance(ast.Assign/AnnAssign)?

tbennun · 2026-01-28T18:10:42Z

dace/sdfg/tasklet_utils.py

+                    info_dict.update({"type": ttype, "constant1": c1, "constant2": None, "op": op})
+                    return info_dict
+
+    raise NotImplementedError("Unhandled case in detect tasklet type")


Shouldn't this just return a cannot-classify-tasklet or "other" TaskletType result?

tbennun · 2026-01-28T18:11:50Z

dace/sdfg/tasklet_utils.py

+        return node
+
+
+def rewrite_boolean_functions_to_boolean_ops(src: str) -> str:


where is this used?

alexnick83

Very well written, but I feel that it is a bit overengineered considering the currently limited supported cases. Please find some comments and questions below, but I will probably need to have another look.

alexnick83 · 2026-01-28T17:32:38Z

dace/sdfg/tasklet_utils.py

+    Each pattern represents a specific combination of input types (arrays, scalars, symbols)
+    and operation types (assignment, binary operation, unary operation).
+
+    Note: inside a tasklet you always have scalars, it is about he connector types


Suggested change

Note: inside a tasklet you always have scalars, it is about he connector types

Note: inside a tasklet you always have scalars, it is about the connector types

alexnick83 · 2026-01-28T17:35:02Z

dace/sdfg/tasklet_utils.py

+        ARRAY_ARRAY_ASSIGNMENT: Direct array-to-array copy (e.g., a = b)
+        ARRAY_SYMBOL_ASSIGNMENT: Symbol/constant assignment to array (e.g., a = sym)
+        ARRAY_SCALAR_ASSIGNMENT: Scalar variable assignment to array (e.g., a = scl)
+        SCALAR_ARRAY_ASSIGNMENT: Array assignment to scalar variable (e.g., scl = a)


Is this a valid case? Even when, e.g., you assign A[i] to a scalar b, isn't the source connector a scalar?

I see now, way below (line 600+), that you are looking at the data type, not the connector type. Maybe this should be clarified in the note above (line 26).

alexnick83 · 2026-01-28T17:42:03Z

dace/sdfg/tasklet_utils.py

+    ----------
+    string_to_check : str
+        The string to split into tokens.
+    pattern_str : str


But the parameter is not actually kept in the signature

alexnick83 · 2026-01-28T17:43:56Z

dace/sdfg/tasklet_utils.py

+    # Split while keeping delimiters
+    tokens = re.split(r'(\s+|[()\[\]])', string_to_check)
+
+    # Replace tokens that exactly match src


Comment doesn't make sense (copied from token_match?)

alexnick83 · 2026-01-28T17:48:55Z

dace/sdfg/tasklet_utils.py

+        if isinstance(node, ast.Constant):
+            return str(node.value)
+        elif isinstance(node, ast.UnaryOp) and isinstance(node.operand, ast.Constant):
+            if isinstance(node.op, ast.USub):


Do we need the bitwise inversion? In that case, I would evaluate the result and convert it to a string.

alexnick83 · 2026-01-28T18:10:04Z

dace/sdfg/tasklet_utils.py

+
+        found = op
+
+    code_rhs = src.split(" = ")[-1].strip()


Wouldn't it be better to convert to AST, find the assignment, and then take the value?

alexnick83 · 2026-01-28T18:12:34Z

dace/sdfg/tasklet_utils.py

+    """
+    tdict = dict()
+    for ie in state.in_edges(tasklet):
+        if ie.data is not None:


Maybe worth testing what happens when the input comes from another tasklet ... we used to have some "hidden" scalar descriptors for that, but I am not up to date on that.

alexnick83 · 2026-01-28T18:14:53Z

dace/sdfg/tasklet_utils.py

+        For function calls, uses AST parsing to extract arguments in order.
+        For operators, splits the code by the operator symbol.
+    """
+    code_rhs = code_str.split(" = ")[-1].strip()


Same as above.

alexnick83 · 2026-01-28T18:19:26Z

dace/sdfg/tasklet_utils.py

+    n_out = len(out_conns)
+
+    assert n_out <= 1, "Only support tasklets with at most 1 output in this pass"
+    lhs = next(iter(node.out_connectors.keys())) if n_out == 1 else None


Very nitpicky, but why not just lhs = out_conns[0] if n_out == 1 else None?

alexnick83 · 2026-01-28T18:39:30Z

dace/sdfg/tasklet_utils.py

+
+    assert n_out == 1
+
+    if n_in == 1:


Given the many restrictions on supported tasklets, the following feels somewhat too long. If I understand correctly, you accept codes with a single expression (assignment). The RHS may be either an operand or a unary/binary operator. The operands can only be connector names, symbols, or other expressions that evaluate to a constant. You also have a special case where the binary operation is applied to the same connector and interpreted as a unary op. Are you going to support more generalized tasklets in the future?

ThrudPrimrose added 2 commits January 28, 2026 12:00

Add tasklet classification

c5ba1ff

Rm init file?

aa322e9

ThrudPrimrose requested review from phschaad and tbennun January 28, 2026 11:20

phschaad requested changes Jan 28, 2026

View reviewed changes

dace/sdfg/tasklet_utils.py Outdated Show resolved Hide resolved

dace/sdfg/tasklet_utils.py Outdated Show resolved Hide resolved

dace/sdfg/tasklet_utils.py Outdated Show resolved Hide resolved

ThrudPrimrose and others added 2 commits January 28, 2026 13:31

Update dace/sdfg/tasklet_utils.py

1924029

Co-authored-by: Philipp Schaad <schaad.phil@gmail.com>

Clean up unused functions and unnecessary imports

338b16b

ThrudPrimrose requested a review from phschaad January 28, 2026 12:35

tbennun requested changes Jan 28, 2026

View reviewed changes

alexnick83 reviewed Jan 28, 2026

View reviewed changes

		return lhs_str, rhs_str


		def _extract_non_connector_syms_from_tasklet(node: dace.nodes.Tasklet, state) -> typing.Set[str]:

		return node


		def rewrite_boolean_functions_to_boolean_ops(src: str) -> str:

	Note: inside a tasklet you always have scalars, it is about he connector types
	Note: inside a tasklet you always have scalars, it is about the connector types

		import typing


		class TaskletType(Enum):

Conversation

ThrudPrimrose commented Jan 28, 2026

Uh oh!

phschaad left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexnick83 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants