Skip to content

Make retags an implicit part of typed copies#154341

Open
RalfJung wants to merge 1 commit intorust-lang:mainfrom
RalfJung:retag-on-typed-copy
Open

Make retags an implicit part of typed copies#154341
RalfJung wants to merge 1 commit intorust-lang:mainfrom
RalfJung:retag-on-typed-copy

Conversation

@RalfJung
Copy link
Copy Markdown
Member

@RalfJung RalfJung commented Mar 24, 2026

View all comments

Ever since Stacked Borrows was first implemented in Miri, that was done with Retag statements: given a place (usually a local variable), those statements find all references stored inside the place and refresh their tags to ensure the aliasing requirements are upheld. However, this is a somewhat unsatisfying approach for multiple reasons:

  • It leaves open the question of where to even put Retag statements. Over time, the AddRetag pass settled on one possible answer to this, but it wasn't very canonical.
  • For assignments of the form *ptr = expr, if the assignment involves copying a reference, we probably want to do a retag -- but if we do a Retag(*ptr) as the next instruction, it can be non-trivial to argue that this even retags the right value, so we refrained from doing retags in that case. This has come up as a potential issue for Rust making better use of LLVM "captures" annotations.
  • Normal compilation avoids generating retags, but we still generate LLVM IR with noalias. What does that even mean? How do MIR optimization passes interact with retags? These are questions we have to figure out to make better use of aliasing information, but currently we can't even really ask such questions.

I think we should resolve all that by making retags part of what happens during a typed copy (a concept and interpreter infrastructure that did not exist yet when retags were initially introduced). Under this proposal, when executing a MIR assignment statement, what conceptually happens is as follows:

  • We evaluate the LHS to a place.
  • We evaluate the RHS to a value. This does a typed load from memory if needed, raising UB if memory does not contain a valid representation of the assignment's type.
  • We walk that value, identify all references inside of it, and retag them. If this happens as part of passing a function argument, this is a protecting retag.
  • We store (a representation of) the value into the place.

However, this semantics doesn't fully work: there's a mandatory MIR pass that turns expressions like &mut ***ptr into intermediate deref's. Those must not do any retags. So far this happened because the AddRetag pass did not add retags for assignments to deref temporaries, but that information is not recorded in cross-crate MIR. Therefore I instead added a field to Rvalue::Use to indicate whether this value should be retagged or not. A non-retagging copy seems like a sufficiently canonical primitive that we should be able to express it. Dealing with the fallout from that is a large chunk of the overall diff. (I also considered adding this field to StatementKind::Assign instead, but decided against that as we only actually need it for Rvalue::Use. I am not sure if this was the right call...)

This neatly answers the question of when retags should occur, and handles cases like *ptr = expr. It avoids traversing values twice in Miri. It makes codegen's use of noalias sound wrt the actual MIR that it is working on. It also gives us a target semantics to evaluate MIR opts against. However, I did not carefully check all MIR opts -- in particular, GVN needs a thorough look under the new semantics; it currently can turn alias-correct code into alias-incorrect code. (But this PR doesn't make things any worse for normal compilation where the retag indicator is anyway ignored.)

Another side-effect of this PR is that -Zmiri-disable-validation now also disables alias checking. It'd be nicer to keep them orthogonal but I find this an acceptable price to pay.

@rustbot rustbot added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Mar 24, 2026
@rust-log-analyzer

This comment has been minimized.

@RalfJung RalfJung force-pushed the retag-on-typed-copy branch from dbabc07 to c5a3e40 Compare March 24, 2026 22:18
@rust-log-analyzer

This comment has been minimized.

@RalfJung RalfJung force-pushed the retag-on-typed-copy branch from c5a3e40 to df515dd Compare March 24, 2026 22:44
@rustbot rustbot added the T-clippy Relevant to the Clippy team. label Mar 24, 2026
@RalfJung
Copy link
Copy Markdown
Member Author

@bors try
@rust-timer queue

@rust-timer

This comment has been minimized.

@rustbot rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Mar 24, 2026
@rust-bors

This comment has been minimized.

rust-bors bot pushed a commit that referenced this pull request Mar 24, 2026
Make retags an implicit part of typed copies
@rust-log-analyzer

This comment has been minimized.

@RalfJung RalfJung force-pushed the retag-on-typed-copy branch from df515dd to 76c8c9d Compare March 24, 2026 22:52
@rust-log-analyzer

This comment has been minimized.

@rust-bors
Copy link
Copy Markdown
Contributor

rust-bors bot commented Mar 25, 2026

☀️ Try build successful (CI)
Build commit: 82d9903 (82d99031f4626ac962af0c7f6d78d1f7173d7145, parent: 362211dc29abc4e8f8cfc384740237f144929b03)

@rust-timer

This comment has been minimized.

@rust-timer

This comment was marked as outdated.

@rustbot rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Mar 25, 2026
@RalfJung
Copy link
Copy Markdown
Member Author

Looks like enabling validation of references just to keep retags working in const-eval was not a good idea...

@RalfJung RalfJung force-pushed the retag-on-typed-copy branch from 76c8c9d to d79e607 Compare March 25, 2026 07:23
@RalfJung
Copy link
Copy Markdown
Member Author

@bors try
@rust-timer queue

@rust-timer

This comment has been minimized.

@rust-bors

This comment has been minimized.

rust-bors bot pushed a commit that referenced this pull request Mar 25, 2026
Make retags an implicit part of typed copies
@rustbot rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Mar 25, 2026
@rust-log-analyzer

This comment has been minimized.

@rust-bors
Copy link
Copy Markdown
Contributor

rust-bors bot commented Mar 25, 2026

☀️ Try build successful (CI)
Build commit: 5bbea76 (5bbea7620d94ef1e4dd2e6617ed840cde1cf87f3, parent: 8a703520e80d87d4423c01f9d4fbc9e5f6533a02)

@rust-log-analyzer

This comment has been minimized.

@rust-bors
Copy link
Copy Markdown
Contributor

rust-bors bot commented Mar 25, 2026

☀️ Try build successful (CI)
Build commit: 2b1714e (2b1714e869694712562be686f5f043b35954ce82, parent: 8a703520e80d87d4423c01f9d4fbc9e5f6533a02)

@RalfJung
Copy link
Copy Markdown
Member Author

Miri performance numbers are positive throughout. :)

Comparison with baseline (relative speed, lower is better for the new results):
  backtraces: 0.97 ± 0.01
  big-allocs: 0.99 ± 0.10
  mse: 0.97 ± 0.02
  range-iteration: 0.94 ± 0.01
  serde1: 0.94 ± 0.01
  serde2: 0.93 ± 0.00
  slice-chunked: 0.96 ± 0.03
  slice-get-unchecked: 0.99 ± 0.02
  string-replace: 0.95 ± 0.03
  unicode: 0.97 ± 0.01
  zip-equal: 0.86 ± 0.01

@RalfJung RalfJung force-pushed the retag-on-typed-copy branch 2 times, most recently from 8f6343a to 443e1d2 Compare March 25, 2026 21:57
@rust-log-analyzer

This comment has been minimized.

@rust-bors

This comment has been minimized.

@RalfJung RalfJung force-pushed the retag-on-typed-copy branch from 443e1d2 to d298feb Compare March 26, 2026 07:07
@rustbot

This comment has been minimized.

@RalfJung
Copy link
Copy Markdown
Member Author

Yay, CI is green. :)

@fee1-dead as randomly selected reviewer for a big PR like this -- please let me know if you're comfortable reviewing this (in due time of course, no rush), or if I should go look for someone else.

) -> InterpResult<'tcx, Option<ImmTy<'tcx, CtfeProvenance>>> {
if matches!(ecx.machine.retag_mode, RetagMode::None | RetagMode::Raw) {
return interp_ok(None);
}
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

out of curiosity -- what's the difference between Raw and None for RetagMode?

Copy link
Copy Markdown
Member Author

@RalfJung RalfJung Mar 26, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A Raw retag does more than a default retag: in Stacked Borrows, when it finds a raw pointer, it will retag that. (In Tree Borrows, default retags and raw retags are equivalent.)

A None retag does less than a default retag. It doesn't retag anything.

@rust-bors

This comment has been minimized.

@RalfJung RalfJung force-pushed the retag-on-typed-copy branch from d298feb to de5358f Compare March 28, 2026 09:00
@rustbot

This comment has been minimized.

@rust-bors

This comment has been minimized.

@RalfJung RalfJung force-pushed the retag-on-typed-copy branch from de5358f to 764096e Compare March 28, 2026 11:15
@rustbot

This comment has been minimized.

@fee1-dead
Copy link
Copy Markdown
Member

please let me know if you're comfortable reviewing this (in due time of course, no rush), or if I should go look for someone else.

Sorry for the late reply, I took a shallow look today and I don't think I'd be able to review the changes (specifically MIR transforms semantics). So I'll reroll, feel free to r? someone, of course.

@rustbot reroll

@rustbot rustbot assigned jackh726 and unassigned fee1-dead Mar 29, 2026
@rust-bors

This comment has been minimized.

@RalfJung RalfJung force-pushed the retag-on-typed-copy branch from 764096e to ce09dd9 Compare April 10, 2026 07:00
@rustbot
Copy link
Copy Markdown
Collaborator

rustbot commented Apr 10, 2026

This PR was rebased onto a different main commit. Here's a range-diff highlighting what actually changed.

Rebasing is a normal part of keeping PRs up to date, so no action is needed—this note is just to help reviewers.

@oli-obk oli-obk assigned oli-obk and unassigned jackh726 Apr 12, 2026
@oli-obk
Copy link
Copy Markdown
Contributor

oli-obk commented Apr 12, 2026

there's a mandatory MIR pass that turns expressions like &mut ***ptr into intermediate deref's. Those must not do any retags. So far this happened because the AddRetag pass did not add retags for assignments to deref temporaries, but that information is not recorded in cross-crate MIR. Therefore I instead added a field to Rvalue::Use to indicate whether this value should be retagged or not.

Before I dive into this PR, does that mean that only Use rvalues in those split derefs ever use a different value for the new field at this stage? Everthing else has Use rvalues which do retagging?

Because I'm getting the feeling you've discovered a representation that may allow us to move the derefer pass before borrowck, because borrowck now has a trivial way to figure out whether it should... "retag"

@RalfJung
Copy link
Copy Markdown
Member Author

RalfJung commented Apr 12, 2026

Before I dive into this PR, does that mean that only Use rvalues in those split derefs ever use a different value for the new field at this stage? Everthing else has Use rvalues which do retagging?

Correct. MIR building always sets WithRetag::Yes. The erase-deref-temp and deref-separator passes and some MIR opts (InstSimplify, LargeEnums, MatchBranches) are the only places that introduce no-retag uses.

Because I'm getting the feeling you've discovered a representation that may allow us to move the derefer pass before borrowck, because borrowck now has a trivial way to figure out whether it should... "retag"

Maybe? I don't know enough avout borrowck to judge this. :)

Note that the derefer pass introduces CopyForDeref, so the representation of what that pass does is not touched by this PR. This PR only changes what happens with CopyForDeref later in the MIR pipeline. It seems possible that CopyForDeref can be entirely replaced by no-retag uses, but that touches too much code I am not familiar with.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-clippy Relevant to the Clippy team. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. T-libs Relevant to the library team, which will review and decide on the PR/issue.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants