Skip to content

Arrow returns undefined as values when reading from a large arrow file #54

Open
@denisgursky

Description

@denisgursky

./file.arrow has about 235M rows. The file size is 3.77GB.

const fs = require("fs");
const { tableFromIPC, RecordBatchReader } = require("apache-arrow");

const filePath = "./file.arrow";

const stream = fs.createReadStream(filePath);
const reader = RecordBatchReader.from(stream);

(async function () {
  const table = await tableFromIPC(reader);

  // /Int64/Float64
  console.log("rows num", table.numRows);
  console.log("first row", table.get(0).toArray());
})();

table.get(0).toArray()

returns [undefined, undefined]

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions