Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[YSQL] Major version upgrade failure on YBA universe #24839

Closed
1 task done
foucher opened this issue Nov 7, 2024 · 2 comments
Closed
1 task done

[YSQL] Major version upgrade failure on YBA universe #24839

foucher opened this issue Nov 7, 2024 · 2 comments
Assignees
Labels
area/ysql Yugabyte SQL (YSQL) kind/bug This issue is a bug priority/medium Medium priority issue

Comments

@foucher
Copy link
Contributor

foucher commented Nov 7, 2024

Jira Link: DB-13949

Description

Still investigating, but the main error from the upgrade logs is:

pg_restore: while PROCESSING TOC:
pg_restore: from TOC entry 2808; 1259 16402 INDEX XXXXidx yugabyte
pg_restore: error: could not execute query: ERROR:  could not serialize access due to concurrent update
Command was: CREATE INDEX "XXXXidx" ON "public"."XXXX" USING "lsm" ("id" HASH, "YYYY" ASC);

Seems possibly timing-related. The error occurs with 2-core nodes and not with 8-core nodes.

Could it be related to #24727?

Issue Type

kind/bug

Warning: Please confirm that this issue does not contain any sensitive information

  • I confirm this issue does not contain any sensitive information.
@foucher foucher added area/ysql Yugabyte SQL (YSQL) status/awaiting-triage Issue awaiting triage labels Nov 7, 2024
@foucher foucher self-assigned this Nov 7, 2024
@yugabyte-ci yugabyte-ci added kind/bug This issue is a bug priority/medium Medium priority issue labels Nov 7, 2024
@foucher
Copy link
Contributor Author

foucher commented Nov 7, 2024

From the YB master leader log, I believe this is the actual issue [line breaks/whitespace added]:

I1106 16:27:57.913406 58595 tablet_rpc.cc:495] Operation failed. Try again (yb/docdb/conflict_resolution.cc:87):
Failed Write(tablet: 1fe231164f3d489ebed535aec7d1c2ec, num_ops: 142, num_attempts: 1, txn: a5ae262a-d162-4959-8305-275b74d45282, subtxn: [none])
to tablet 1fe231164f3d489ebed535aec7d1c2ec
on tablet server { uuid: 40b73ecb495a4e71b9e6ded5f31b17e0
 private: [host: "10.9.218.183" port: 9100]
 cloud_info: placement_cloud: "aws" placement_region: "us-west-2" placement_zone: "us-west-2d"
after 1 attempt(s):
a5ae262a-d162-4959-8305-275b74d45282 conflicts with higher priority transaction: c8af857a-fa3e-4cdf-848a-6b0bccb06b15 (transaction error 3)

Since it says there are 142 ops it seems possible that it's the issue being fixed by Fizaa.

@sushantrmishra sushantrmishra removed the status/awaiting-triage Issue awaiting triage label Nov 8, 2024
@foucher
Copy link
Contributor Author

foucher commented Nov 12, 2024

The issue didn't repro on b280 which contains 3cdb27c. Closing this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/ysql Yugabyte SQL (YSQL) kind/bug This issue is a bug priority/medium Medium priority issue
Projects
None yet
Development

No branches or pull requests

3 participants