[PROTOCOL] [BUG] Inconsistent behavior for binary type partition value serialization #4189
Open
3 of 8 tasks
Labels
bug
Something isn't working
Bug
Which Delta project/connector is this regarding?
Describe the problem
PROTOCOL says:
Encoded as a string of escaped binary values. For example, "\u0001\u0002\u0003"
This is not happening for values that can be represented by string:
INSERT INTO TestBinary VALUES (10, X'123456'); -- OK
data:image/s3,"s3://crabby-images/78032/78032f96bdf79755c6fa10009f4d9b2e59ebe95b" alt="Image"
INSERT INTO TestBinary VALUES (10, CAST('Hello' as BINARY)); // NOT OK
data:image/s3,"s3://crabby-images/64a4f/64a4f045cddbd71de77ffd41c9346f01bd5bab46" alt="Image"
Steps to reproduce
CREATE TABLE TestBinary (id INT, value BINARY) PARTITIONED BY (value);
INSERT INTO TestBinary VALUES (10, CAST('Hello' as BINARY));
INSERT INTO TestBinary VALUES (10, X'123456');
Observed results
Hello
Expected results
\u0048\u0065\u006c\u006c\u006f
Environment information
Willingness to contribute
The Delta Lake Community encourages bug fix contributions. Would you or another member of your organization be willing to contribute a fix for this bug to the Delta Lake code base?
The text was updated successfully, but these errors were encountered: