Add missing conditional functions by timsaucer · Pull Request #1464 · apache/datafusion-python

timsaucer · 2026-03-29T23:31:45Z

Which issue does this PR close?

Closes #1449

Rationale for this change

There are a few methods missing from the Python library, so this exposes them.

What changes are included in this PR?

Exposes python functions.
Adds unit tests.

Are there any user-facing changes?

Addition only.

…che#1449) Expose four conditional functions from upstream DataFusion that were not yet available in the Python bindings. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Tests cover multiple data types (integers, strings), null handling (all-null, partial-null), multiple arguments, and ifnull/nvl equivalence. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Exposes missing conditional scalar functions in datafusion-python to close #1449, adding Python wrappers/bindings and unit tests.

Changes:

Add Python API functions: greatest, least, nvl2, and ifnull (alias for nvl).
Add Rust PyO3 bindings for greatest, least (variadic) and nvl2.
Add unit tests covering numeric/string behavior and null handling.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File	Description
python/datafusion/functions.py	Adds new public Python wrappers and updates exports (`__all__`).
crates/core/src/functions.rs	Exposes the corresponding DataFusion functions to Python via PyO3 wrappers.
python/tests/test_functions.py	Adds unit tests validating the new functions’ behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-30T16:26:02Z

python/tests/test_functions.py

+    assert result.column(0).to_pylist() == ["banana", "cherry"]
+
+
+def test_least(df):


df fixture is accepted but never used in this test. Because pytest will still construct the fixture, this adds unnecessary setup cost and can slow the suite. Either remove the df parameter or refactor the test to reuse the provided fixture/context instead of creating a new SessionContext.

Suggested change

def test_least(df):

def test_least():

Copilot · 2026-03-30T16:26:03Z

python/tests/test_functions.py

+def test_nvl2(df):
+    ctx = SessionContext()
+    batch = pa.RecordBatch.from_arrays(
+        [


df fixture is accepted but never used in this test. Because pytest will still construct the fixture, this adds unnecessary setup cost and can slow the suite. Either remove the df parameter or refactor the test to reuse the provided fixture/context instead of creating a new SessionContext.

Copilot · 2026-03-30T16:26:04Z

python/tests/test_functions.py

+def test_ifnull(df):
+    ctx = SessionContext()
+    batch = pa.RecordBatch.from_arrays(


df fixture is accepted but never used in this test. Because pytest will still construct the fixture, this adds unnecessary setup cost and can slow the suite. Either remove the df parameter or refactor the test to reuse the provided fixture/context instead of creating a new SessionContext.

Copilot · 2026-03-30T16:26:06Z

python/tests/test_functions.py

    )
+
+
+def test_greatest(df):


df fixture is accepted but never used in this test. Because pytest will still construct the fixture, this adds unnecessary setup cost and can slow the suite. Either remove the df parameter or refactor the test to reuse the provided fixture/context instead of creating a new SessionContext.

Suggested change

def test_greatest(df):

def test_greatest():

timsaucer and others added 3 commits March 29, 2026 19:18

Add missing conditional functions: greatest, least, nvl2, ifnull (apa…

c8dc245

…che#1449) Expose four conditional functions from upstream DataFusion that were not yet available in the Python bindings. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Add unit tests for greatest, least, nvl2, and ifnull functions

0d14831

Tests cover multiple data types (integers, strings), null handling (all-null, partial-null), multiple arguments, and ifnull/nvl equivalence. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Use standard alias docstring pattern for ifnull

c773349

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

timsaucer marked this pull request as ready for review March 30, 2026 11:14

timsaucer requested a review from Copilot March 30, 2026 16:22

Copilot started reviewing on behalf of timsaucer March 30, 2026 16:22 View session

Copilot AI reviewed Mar 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add missing conditional functions#1464

Add missing conditional functions#1464
timsaucer wants to merge 3 commits intoapache:mainfrom
timsaucer:add-missing-conditional-functions

timsaucer commented Mar 29, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		assert result.column(0).to_pylist() == ["banana", "cherry"]


		def test_least(df):

Conversation

timsaucer commented Mar 29, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants