readParquetSchema without loading entire file by AshyIsMe · Pull Request #7 · interregna/JArrow

AshyIsMe · 2023-04-24T05:10:01Z

Intended to allow reading just the schema from a parquet file without having to read all of the data into memory.

Unfortunately this approach still reads the entire file into memory (and then seems to leak that memory immediately...).

It looks like this is a limitation of the arrow glib C binding but that seems so brain-dead that I surely must be wrong...

interregna · 2023-05-23T23:50:14Z

Sounds like this doesn't solve the problem. Will take a look and see if there's some way to release the memory and not read the whole file.

readParquetSchema without loading entire file

654c734

Provide feedback