-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-51439][SQL] Support SQL UDF with DEFAULT argument #50408
base: master
Are you sure you want to change the base?
Conversation
@cloud-fan @allisonwang-db Please take a look at this follow up of [SPARK-50763][SQL] Add Analyzer rule for resolving SQL table functions! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great!
sql/api/src/main/scala/org/apache/spark/sql/catalyst/parser/DataTypeParserInterface.scala
Outdated
Show resolved
Hide resolved
sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseParser.g4
Outdated
Show resolved
Hide resolved
sql/api/src/main/scala/org/apache/spark/sql/catalyst/parser/DataTypeAstBuilder.scala
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
explain refactor
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/ParserUtils.scala
Show resolved
Hide resolved
sql/api/src/main/scala/org/apache/spark/sql/catalyst/util/SparkParserUtils.scala
Outdated
Show resolved
Hide resolved
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala
Show resolved
Hide resolved
@@ -45,6 +45,70 @@ class DataTypeAstBuilder extends SqlBaseParserBaseVisitor[AnyRef] { | |||
withOrigin(ctx)(StructType(visitColTypeList(ctx.colTypeList))) | |||
} | |||
|
|||
override def visitSingleRoutineParamList(ctx: SingleRoutineParamListContext): StructType = { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
where do we fail this method if unsupported feature such as generated column is specified?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's checked in SparkSqlParser.visitCreateUserDefinedFunction. I should add comment in visitStructField to explain
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/UserDefinedFunction.scala
Show resolved
Hide resolved
* Create a [[StructField]] from a column definition which allows options like COMMENT and | ||
* DEFAULT. | ||
* | ||
* Don't handle generation expression since this function is currently only used for creating |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why don't we reject it here within this function?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we reject it in SparkSqlParser to report the right ParserRuleContext
* SQL functions which don't support generation expressions. The rejection logic is in | ||
* [[SqlBaseParserVisitor#visitCreateUserDefinedFunction()]] implementation. | ||
*/ | ||
private def visitStructField( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This method looks very similar to visitColDefinition
. shall we invoke visitColDefinition
here, fail if ColumnDefinition#generationExpression/identityColumnSpec
is defined, and call ColumnDefinition#toV1Column
to get StructField
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
@@ -50,6 +50,10 @@ abstract class AbstractParser extends DataTypeParserInterface with Logging { | |||
astBuilder.visitSingleTableSchema(parser.singleTableSchema()) | |||
} | |||
|
|||
override def parseRoutineParam(sqlText: String): StructType = parse(sqlText) { parser => |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
so the main difference between this and parseTableSchema
is: it supports default expression, and it fails for unsupported fields.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah
Continuing @allisonwang-db's work on #50373 and #49471
What changes were proposed in this pull request?
This PR adds support for DEFAULT arguments in SQL UDF. Examples:
See sql-udf.sql for more valid and invalid examples.
Why are the changes needed?
To support default arguments in SQL UDFs.
Does this PR introduce any user-facing change?
Yes. Now SQL UDFs support DEFAULT arguments.
A side effect of the grammar change is that some invalid function parameter definitions are now no longer rejected by the grammar, but instead rejected by the parser logic.
Examples:
This doesn't change the behavior of existing valid SQL.
How was this patch tested?
End-to-end regression tests in
sql-udf.sql
and simple tests inSQLFunctionSuite
.Was this patch authored or co-authored using generative AI tooling?
No