perf: only check Parquet type once in NativeBatchReader

NativeBatchReader calls `checkParquetType` on all of the columns on every invocation of `loadNextBatch`. I tried moving it up to `init` but some Spark SQL tests expect the exceptions that this generates to come from a call stack going into `loadNextBatch` rather than `init`. Perhaps we can stash a boolean that we've checked the columns after the first batch, and then elide the column checks on subsequent batches.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: only check Parquet type once in NativeBatchReader #1810

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

perf: only check Parquet type once in NativeBatchReader #1810

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions