Aspects data fixes by saraburns1 · Pull Request #170 · openedx/aspects-dbt

saraburns1 · 2026-06-12T15:37:57Z

deprecate unneeded performance model (the dataset that used this is changing)
Remove 'emission_time' from model (was preventing replacingmerge)
use last response model to get more accurate data

the original query was taking the first successful response for each actor and left joining all attempts - which means that if an actor never had a successful attempt, their actor_id and number of attempts would be NULL. we then did a distinct at the end which would only keep 1 record for each problem that never had a correct attempt instead of actually counting how many attempts were made

the new query uses the last response for each actor regardless of if its successful or not. this way, we can get an accurate count of incorrect and correct responses and all data is populated for each attempt.

part of
openedx/openedx-aspects#369
openedx/openedx-aspects#370
openedx/openedx-aspects#372

bmtcril · 2026-06-12T19:15:17Z

 group by
    org,
    course_key,
-    emission_time,


I'm trying to understand how this is causing the replacing merge tree issues, were there events with duplicate timestamps being incorrectly aggregated here?

the timestamps were causing events to NOT be aggregated but we need them to. the mv is keyed on response and the response_count should have been updated to the aggregate each time a new event came in, but the emission_time made the count always 1 and then the mv would just replace the previous record with the same values and still a count of 1

bmtcril · 2026-06-12T19:27:15Z

-            first_success.attempts as attempts,
-            first_success.actor_id as actor_id,
-            splitByChar('@', events.problem_id)[3] as block_id_short,
+            last_response.org as org,


I've been out of this for a while, can you write up a quick explanation of the fix?

the original query was taking the first successful response for each actor and left joining all attempts - which means that if an actor never had a successful attempt, their actor_id and number of attempts would be NULL. we then did a distinct at the end which would only keep 1 record for each problem that never had a correct attempt instead of actually counting how many attempts were made

the new query uses the last response for each actor regardless of if its successful or not. this way, we can get an accurate count of incorrect and correct responses and all data is populated for each attempt.

saraburns1 added 6 commits June 4, 2026 11:41

fix: use last response instead of first success

7a62923

fix: remove emission time

30db978

Merge branch 'main' into aspects_changes

ae6b8e0

fix: cleanup

fe35f27

fix: cleanup

5a16081

fix: unit tests

9ed72bf

bmtcril reviewed Jun 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aspects data fixes#170

Aspects data fixes#170
saraburns1 wants to merge 6 commits into
openedx:mainfrom
saraburns1:aspects_changes

saraburns1 commented Jun 12, 2026 •

edited

Loading

Uh oh!

bmtcril Jun 12, 2026

Uh oh!

saraburns1 Jun 17, 2026

Uh oh!

bmtcril Jun 12, 2026

Uh oh!

saraburns1 Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

saraburns1 commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bmtcril Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

saraburns1 Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

bmtcril Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

saraburns1 Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

saraburns1 commented Jun 12, 2026 •

edited

Loading