-
Notifications
You must be signed in to change notification settings - Fork 178
Adding repeated properties to schema results in corrupt parquet file. #67
Copy link
Copy link
Closed
Description
Version 0.8.0
Having some issues with repeated. The resulting parquet file seems to have errors in it.
org.apache.parquet.io.ParquetDecodingException: Can not read value at 0 in block -1 in file file:/PATHTOFILE/profile.parquet
Here is the code I'm testing with, it's the identities object that is causing the problem.
let schema = new parquet.ParquetSchema({
person: {
repeated: false,
fields: {
firstName: {
type: 'UTF8'
},
lastName: {
type: 'UTF8'
}
}
},
identities: {
repeated: true,
fields: {
id: {
type: 'UTF8'
},
xid: {
type: 'UTF8'
}
}
}
});
async function writeToParquet(schema) {
// create new ParquetWriter that writes to 'fruits.parquet`
var writer = await parquet.ParquetWriter.openFile(schema, 'profile.parquet');
writer.appendRow({
person: {
firstName: "Test",
lastName: "User"
},
identities: [{
id: "ID",
xid: "XID"
},{
id: "ID",
xid: "XID"
}]
});
await writer.close();
}
writeToParquet(schema);```
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels