If no FailureLines exist for a failure, fallback to TestLogFailure #9079

camd · 2025-11-18T18:25:33Z

An experiment to see how this looks.

Here were the options Claude proposed. I went with Option 1 in this PR:

Option 1: Make Push Health use TextLogError as a fallback ✅ RECOMMENDED

Modify treeherder/push_health/tests.py:322-341 to query TextLogError when no FailureLine objects exist
Minimal changes, uses existing data
Pros: Quick fix, uses data that's already there
Cons: Mixing two data sources; TextLogError doesn't have structured test data like FailureLine

Option 2: Investigate and fix FailureLine generation

Determine if timeout failures create FailureLine objects
If yes, what action do they have? Add that action to Push Health's query
If no, fix the log parser to create FailureLine objects for timeouts
Pros: Fixes the root cause
Cons: Requires understanding the Mozilla test harness structured log format; may require changes upstream

Option 3: Create FailureLine objects from TextLogError

When TextLogError objects exist but no matching FailureLine, create synthetic FailureLine objects
Pros: Unifies the data sources
Cons: More complex; adds processing overhead

My Recommendation

Start with Option 1 (fallback to TextLogError) because:

It's the fastest solution
The data is already available and working in the Summary Panel
It's a minimal code change to treeherder/push_health/tests.py
You can later implement Option 2 as a proper fix

codecov-commenter · 2025-11-18T18:38:09Z

Codecov Report

❌ Patch coverage is 18.07229% with 68 lines in your changes missing coverage. Please review.
✅ Project coverage is 80.10%. Comparing base (75d47a7) to head (8cf3aa9).

Files with missing lines	Patch %	Lines
treeherder/push_health/tests.py	18.07%	68 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #9079      +/-   ##
==========================================
- Coverage   80.26%   80.10%   -0.16%     
==========================================
  Files         596      596              
  Lines       32182    32265      +83     
  Branches     3276     3269       -7     
==========================================
+ Hits        25830    25845      +15     
- Misses       6184     6252      +68     
  Partials      168      168

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

jmaher

it seems that this is solving for the case where we are pending log parsing- but that is a normal thing, yet probably not so misleading. If you think this will make push health more responsive and accurate (even the perception), then let me know

jmaher · 2025-11-18T19:01:36Z

treeherder/push_health/tests.py

+                            total_jobs_for_type,
+                            is_investigated,
+                            investigated_test_id,
+                        )


would this end up with a duplicate entry, assuming we have:
TEST-UNEXPECTED-FAIL | test_1.js | timed out
TEST-CRASH | | test_1.js

in the end we don't need both, I will be curious where this ends up.

Taking a look now...

Looks like we'd just get whichever one comes first. So likely the timeout. Would it be better to show the crash instead?

heh, whatever has the most information. I think a crash is a better signal than timeout the majority of the time. I guess we could replace (add/delete) if a crash comes in after the timeout?

Sounds good. I made the change to use the crash, rather than timeout if there is both.

camd · 2025-11-19T23:12:23Z

it seems that this is solving for the case where we are pending log parsing- but that is a normal thing, yet probably not so misleading. If you think this will make push health more responsive and accurate (even the perception), then let me know

This is a really good point. I added logic to check if the job's log parsing is still pending before it tries to fall back to the TextLogErrors.

If no FailureLines exist for a failure, fallback to TestLogFailure

672b1ab

camd self-assigned this Nov 18, 2025

camd requested a review from jmaher November 18, 2025 18:25

jmaher reviewed Nov 18, 2025

View reviewed changes

don't fallback if log parsing is pending.

8cf3aa9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

If no FailureLines exist for a failure, fallback to TestLogFailure #9079

If no FailureLines exist for a failure, fallback to TestLogFailure #9079

Uh oh!

camd commented Nov 18, 2025

Uh oh!

codecov-commenter commented Nov 18, 2025 •

edited

Loading

Uh oh!

jmaher left a comment

Uh oh!

jmaher Nov 18, 2025

Uh oh!

camd Nov 19, 2025

Uh oh!

camd Nov 19, 2025

Uh oh!

jmaher Nov 19, 2025

Uh oh!

camd Nov 20, 2025

Uh oh!

camd commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

If no FailureLines exist for a failure, fallback to TestLogFailure #9079

Are you sure you want to change the base?

If no FailureLines exist for a failure, fallback to TestLogFailure #9079

Uh oh!

Conversation

camd commented Nov 18, 2025

Uh oh!

codecov-commenter commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jmaher left a comment

Choose a reason for hiding this comment

Uh oh!

jmaher Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

camd Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

camd Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

jmaher Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

camd Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

camd commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-commenter commented Nov 18, 2025 •

edited

Loading