fix: redact pending primary email before retirement deletion by ktyagiapphelix2u · Pull Request #38426 · openedx/openedx-platform

ktyagiapphelix2u · 2026-04-23T10:23:01Z

Summary

This change addresses a privacy issue in the retirement flow for users who have a pending primary email change.

Problem

When a user retires with an active row in student_pendingemailchange, the LMS deletes that row. However, sensitive data such as the pending email and activation key could still persist indirectly (e.g., in logs, backups, or downstream systems), creating a privacy risk.

Root Cause

The retirement flow deleted PendingEmailChange records directly without redacting sensitive fields first.

What Changed

Added a model helper to redact PendingEmailChange fields for a user before deletion
Updated the retirement flow to call redaction before deleting records
Added tests to verify redaction behavior and correct ordering
Updated inline comments and PII annotations to explicitly document the “redact then delete” approach

Behavior Before

User retires with a pending primary email
LMS deletes the pending email row
Sensitive values (e.g., pending email) may still persist indirectly

Behavior After

User retires with a pending primary email
LMS first redacts sensitive fields in the pending email record
LMS deletes the record
Any persisted traces contain only redacted values

Ticket & Reference

https://2u-internal.atlassian.net/browse/BOMS-498

robrap · 2026-04-24T16:51:27Z

+        original_new_email = self.email_change.new_email
+        original_activation_key = self.email_change.activation_key
+        record_was_redacted = PendingEmailChange.redact_pending_email_by_user_value(self.user2, field='user')
+        assert not record_was_redacted


I don't really understand this test. Should it just have lines 617 and 618, where you ask to redact on a user that isn't in the table, and it returns that it didn't redact?

All the other details about the user 1 email change seem irrelevant and confusing. If you think it is important, I'd need better comments.

The test now:

Has a clear docstring explaining it verifies redacting a user with no pending email change returns False

Only tests the relevant behavior - calling redact on user2 (who has no email change record) returns False

Removes the confusing assertions about user1's email change data remaining unchanged

robrap · 2026-04-24T16:53:50Z

+        Redact pending email change fields for records matching ``field=value``.
+        This method is intended for retirement flows where downstream systems
+        may keep soft-deleted snapshots of these rows.
+        """


Docstrings should have a one line summary, and an optional blank line and longer description. Like a comment message.

Also, maybe add something like:

Returns True if redacted, and False if no matching records found.

robrap · 2026-04-24T16:54:34Z

        assert not record_was_deleted
        assert 1 == len(PendingEmailChange.objects.all())

+    def test_redact_by_user_redacts_pending_email_change_fields(self):


Should this be updated to test for multiple pending records, and ensuring that they are all redacted?

The user field is a OneToOneField with unique=True, so there can only be one PendingEmailChange per user. The test already covers the maximum case (redacting 1 record). Multiple records per user aren't possible with this model constraint.

Akanshu-2u · 2026-04-27T13:54:07Z

+            record.new_email = get_retired_email_by_email(record.new_email)
+            record.save(update_fields=['new_email'])


The PR description explicitly identifies activation_key as sensitive data that can still persist indirectly in logs, backups, or downstream systems. However, the implementation only redacts new_email . activation_key is left as-is. If downstream systems snapshot these rows, the activation key still leaks. It should be cleared before deletion, e.g.:

Suggested change

record.new_email = get_retired_email_by_email(record.new_email)

record.save(update_fields=['new_email'])

record.new_email = get_retired_email_by_email(record.new_email)

record.activation_key = '' # or a redacted value

record.save(update_fields=['new_email', 'activation_key'])

Akanshu-2u · 2026-04-27T13:56:42Z

+        assert record_was_redacted
+        self.email_change.refresh_from_db()
+        assert self.email_change.new_email == expected_retired_email
+        assert self.email_change.activation_key == original_activation_key


If activation_key redaction is added, this assertion must be updated to verify the key is also cleared/replaced. As-is, this test will need to change regardless once above issue is fixed.

Akanshu-2u · 2026-04-27T14:03:29Z

+        records_matching_user_value = cls.objects.filter(**filter_kwargs)
+        if not records_matching_user_value.exists():
+            return False
+        for record in records_matching_user_value:


Both queries fetch the same data. Since the queryset is lazy, .exists() and the for loop each trigger a separate SQL query. Change to:

Suggested change

records_matching_user_value = cls.objects.filter(**filter_kwargs)

if not records_matching_user_value.exists():

return False

for record in records_matching_user_value:

records = list(cls.objects.filter(**filter_kwargs))

if not records:

return False

for record in records:

This is a single DB hit, which matters more if the field filter ever doesn't use the OneToOneField on user.

Note: The change is optional.

fix: redact pending primary email before retirement deletion

5e6a070

ktyagiapphelix2u marked this pull request as ready for review April 23, 2026 11:29

ktyagiapphelix2u requested a review from a team as a code owner April 23, 2026 11:29

robrap reviewed Apr 24, 2026

View reviewed changes

fix: redact pending primary email before retirement deletion

4eb27b5

Akanshu-2u reviewed Apr 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: redact pending primary email before retirement deletion#38426

fix: redact pending primary email before retirement deletion#38426
ktyagiapphelix2u wants to merge 2 commits intoopenedx:masterfrom
ktyagiapphelix2u:ktyagi/primaryemail

ktyagiapphelix2u commented Apr 23, 2026 •

edited

Loading

Uh oh!

robrap Apr 24, 2026

Uh oh!

ktyagiapphelix2u Apr 27, 2026

Uh oh!

robrap Apr 24, 2026

Uh oh!

robrap Apr 24, 2026

Uh oh!

ktyagiapphelix2u Apr 27, 2026

Uh oh!

Akanshu-2u Apr 27, 2026

Uh oh!

Akanshu-2u Apr 27, 2026

Uh oh!

Akanshu-2u Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		record.new_email = get_retired_email_by_email(record.new_email)
		record.save(update_fields=['new_email'])

Conversation

ktyagiapphelix2u commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Root Cause

What Changed

Behavior Before

Behavior After

Ticket & Reference

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ktyagiapphelix2u commented Apr 23, 2026 •

edited

Loading