Skip to content

Add support for scalar UDFs that operate on Arrow data #4177

@andygrove

Description

@andygrove

What is the problem the feature request solves?

Spark supports UDFs that operate on row-based data. I would like Comet to support UDFs that operate on batches of columnar data in Arrow format so that users can implement UDFs and keep execution in Comet.

There is a prototype of this in #4170 where it is being used to implement regexp expressions that are compatible with Spark. If this prototype is successful then we can expose the Comet UDF functionality to users with some more work.

Describe the potential solution

No response

Additional context

No response

Metadata

Metadata

Labels

area:expressionsExpression evaluationenhancementNew feature or requestpriority:mediumFunctional bugs, performance regressions, broken features

Type

No type
No fields configured for issues without a type.

Projects

Status

Done

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions