Hive 29574 merge join poc#6456
Conversation
|
abstractdog
left a comment
There was a problem hiding this comment.
nice patch @illiabarbashov-sketch so far, let some comments
d9bc666 to
f97785a
Compare
abstractdog
left a comment
There was a problem hiding this comment.
left minor comments, also the last run has a related failure:
https://ci.hive.apache.org/blue/organizations/jenkins/hive-precommit/detail/PR-6456/12/tests/
other than that, this is quite close! 🤞
| import org.apache.hadoop.hive.ql.exec.*; | ||
| import org.apache.hadoop.hive.ql.plan.*; |
There was a problem hiding this comment.
we tend to extract wildcard imports to one-by-one
| ReduceSinkDesc rsConf = mock(ReduceSinkDesc.class); | ||
| when(rsConf.getKeyCols()).thenReturn(Collections.singletonList(keyExpr)); | ||
|
|
||
| @SuppressWarnings("unchecked") ReduceSinkOperator rs = mock(ReduceSinkOperator.class); |
There was a problem hiding this comment.
most style guides and formatters (including Google Java Format) will place the annotation on its own line for readability, I recommend doing the same here
| String[] descTableAliases = conf.getSkewJoinTableAliases(); | ||
|
|
||
| for (int pos = 0; pos < maxAlias; pos++) { | ||
| joinSkewKeyColumns[pos] = (descKeyNames != null && pos < descKeyNames.length |
There was a problem hiding this comment.
I can still see the same pattern here, can you double-check, please?
joinSkewKeyColumns[pos] = (descKeyNames != null && pos < descKeyNames.length
&& descKeyNames[pos] != null) ? descKeyNames[pos] : "unknown";
|
there are some warnings in sonarqube report: some of them can be fixed easily, like |
f8348ed to
09d4440
Compare
abstractdog
left a comment
There was a problem hiding this comment.
minor comments @illiabarbashov-sketch , otherwise this looks good to me, almost ready :)
| private JoinOperationMetadataResolver resolver; | ||
|
|
||
| @BeforeEach | ||
| void setUp() { |
There was a problem hiding this comment.
nit: it's strange for me that these unit tests are not public, I got used to see it, can you please fix
There was a problem hiding this comment.
sorry, I just got to know: it's fine as is in case of Junit5 and Sonar complains
| Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Data skew detected in merge join: 2 rows accumulated for join column(s) [testcolkey_a] in table alias [a]. Consider reviewing data distribution. | ||
| #### A masked pattern was here #### | ||
| ]], Vertex did not succeed due to OWN_TASK_FAILURE, failedTasks:1 killedTasks:0, Vertex vertex_#ID# [Reducer 2] killed/failed due to:OWN_TASK_FAILURE] | ||
| DAG did not succeed due to VERTEX_FAILURE. failedVertices:1 killedVertices:0 |
There was a problem hiding this comment.
I believe this is going to be masked after HIVE-29656, so needs to be changed here
|



What changes were proposed in this pull request?
Why are the changes needed?
Does this PR introduce any user-facing change?
No
How was this patch tested?