Consider high risk evaluation result from other unrelated prs #2317

xueqzhan · 2025-02-05T20:47:30Z

With this there are still a few caveats:

Some jobs are missing risk analysis file. Those jobs failed to contact sippy during analysis time. I assume those are the ones without external network access. A few examples:

pull-ci-openshift-installer-main-e2e-vsphere-host-groups-ovn-custom-no-upgrade
pull-ci-openshift-console-master-okd-scos-e2e-aws-ovn

RA does not deal with junits created by sippy (e.g. install units) or the ones created by gather extra (e.g. operator related). But PR commenting still picks them up. Eventually we probably want to filter them out.
RA does not deal with some non-openshift-tests tests. Some examples:

Hypershift tests: pull-ci-openshift-cluster-version-operator-main-e2e-hypershift
pull-ci-openshift-cluster-ingress-operator-master-e2e-aws-operator

openshift-ci · 2025-02-05T20:48:13Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: xueqzhan

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [xueqzhan]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

xueqzhan · 2025-02-05T20:48:58Z

pkg/sippyserver/pr_commenting_processor.go

+		if summary.RiskLevel == api.FailureRiskLevelHigh {
+			riskAnalysisPRTestRiskMetric.WithLabelValues(org, repo, number, jobName, jobID, testSummary.Name).Set(float64(testSummary.Risk.Level.Level))
+		} else {
+			riskAnalysisPRTestRiskMetric.DeleteLabelValues(org, repo, number, jobName, jobID, testSummary.Name)


I struggled to unset the metric gauge. At first I thought this would work. But in reality, failed test might pass in the next run and will not be evaluated for risk analysis. That means they will not be in the summary.

I wonder if we need a separate daemon processor for metrics. Something that could look at the previous 24 hours and track the active count until it dropped off to 0 for 24 hours or so.

Could be better to leave the metrics out and focus that work on TRT-1704

At the moment, we only set this for FailureRiskLevelHigh. I would be interested in a metric that published the count for how many failures we have seen recently for a High Risk Failure, the greater the count the more likely we are detecting a regression. Again, this work would likely be better in the other card.

openshift-ci · 2025-02-05T23:56:50Z

@xueqzhan: all tests passed!

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

neisw · 2025-02-11T17:44:08Z

pkg/api/job_runs.go

@@ -280,6 +287,7 @@ func JobRunRiskAnalysis(dbc *db.DB, jobRun *models.ProwJobRun, jobRunTestCount i
 			logger.WithError(err).Errorf("Failed to find matching jobIds for: %s", jobRun.ProwJob.Name)
 		}
 	}
+	logger.Infof("Found %d unfilered matching jobs %d runs for: %s\njobs %+v", len(jobNames), totalJobRuns, jobRun.ProwJob.Name, jobNames)


s/unfilered/unfiltered

neisw · 2025-02-11T18:24:57Z

pkg/api/job_runs.go

@@ -553,6 +572,55 @@ func runTestRunAnalysis(failedTest models.ProwJobRunTest, jobRun *models.ProwJob
 	return analysis, nil
 }

+func isHighRiskInOtherPRs(bqc *bigquery.Client, failedTest models.ProwJobRunTest, jobRun *models.ProwJobRun) bool {


Have you tried running locally? I would be curious if we could find a case or fake one out to run through this logic.

neisw · 2025-02-13T12:50:49Z

pkg/api/job_runs.go

@@ -161,7 +167,7 @@ func findReleaseMatchJobNames(dbc *db.DB, jobRun *models.ProwJobRun, compareRele
 			}

 			if len(jobs) > 0 {
-				logger.Infof("Found %d matches with: %s", len(jobs), name)
+				logger.Infof("Found %d matches with: %s\njobs %+v", len(jobs), name, jobs)


This is a lot of logging isn't it? An array of jobs each time within a loop...

neisw · 2025-02-13T12:54:56Z

pkg/api/job_runs.go

+		}
+		rowCount = values[0].(int64)
+		if rowCount > 0 {
+			log.Infof("High risk items found in other PRs for job %s test '%s'", jobRun.ProwJob.Name, failedTest.Test.Name)


How about adding the count to the logging? Maybe even returning the count so we could use it in a metric

neisw · 2025-02-13T13:27:30Z

pkg/api/job_runs.go

+			analysis.Risk = apitype.TestFailureRisk{
+				Level: apitype.FailureRiskLevelNone,
+				Reasons: []string{
+					"High risk was identified in other PRs first",


I'm wondering if this shouldn't go straight to none but instead be set to Medium with the reason indicating that the test may have regressed external to the current PR. Potential external regression detected for High Risk Test analysis

neisw · 2025-02-13T17:44:39Z

pkg/api/job_runs.go

@@ -106,6 +111,7 @@ func FetchJobRun(dbc *db.DB, jobRunID int64, logger *log.Entry) (*models.ProwJob
 	// Load the ProwJobRun, ProwJob, and failed tests:
 	// TODO: we may want to expand to analyzing flakes here in the future
 	res := dbc.DB.Joins("ProwJob").
+		Preload("PullRequests").


Was this something that is being used already and is performance boost or left over from previous work (I didn't see a new reference to pull requests but might be missing it.

openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 5, 2025

openshift-ci bot requested review from DennisPeriquet and stbenjam February 5, 2025 20:48

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 5, 2025

xueqzhan commented Feb 5, 2025

View reviewed changes

Consider high risk evaluation result from other unrelated prs

c26d118

xueqzhan force-pushed the risk-other-repos branch from 5e6cb1b to c26d118 Compare February 5, 2025 22:14

openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 5, 2025

neisw reviewed Feb 11, 2025

View reviewed changes

neisw reviewed Feb 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider high risk evaluation result from other unrelated prs #2317

Consider high risk evaluation result from other unrelated prs #2317

xueqzhan commented Feb 5, 2025 •

edited

Loading

openshift-ci bot commented Feb 5, 2025

xueqzhan Feb 5, 2025

neisw Feb 11, 2025

neisw Feb 11, 2025

neisw Feb 13, 2025

openshift-ci bot commented Feb 5, 2025

neisw Feb 11, 2025

neisw Feb 11, 2025

neisw Feb 13, 2025

neisw Feb 13, 2025

neisw Feb 13, 2025

neisw Feb 13, 2025

Consider high risk evaluation result from other unrelated prs #2317

Are you sure you want to change the base?

Consider high risk evaluation result from other unrelated prs #2317

Conversation

xueqzhan commented Feb 5, 2025 • edited Loading

openshift-ci bot commented Feb 5, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

openshift-ci bot commented Feb 5, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xueqzhan commented Feb 5, 2025 •

edited

Loading