feat: Offline Store historical features retrieval based on datetime range in Ray #5738

aniketpalu · 2025-11-25T19:18:39Z

What this PR does / why we need it:

Add support for entity_df=None in RayOfflineStore.get_historical_features with start_date/end_date.
-- Derives entity set by reading distinct join keys from each FeatureView source within the time window, applies field mappings and join_key_map, filters by timestamp, and unions aligned schemas.
-- Adds stable event_timestamp = end_date for PIT joins.
Signature change: get_historical_features accepts entity_df: Optional[Union[pd.DataFrame, str]] and **kwargs.
-- Why: Match base interface and support date-only retrieval.

Which issue(s) this PR fixes:

RHOAIENG-38643

Misc

…ange in Ray Signed-off-by: Aniket Paluskar <[email protected]>

Signed-off-by: Aniket Paluskar <[email protected]>

jyejare

Looking good initially, have some doubts.

Also needs to add tests.

jyejare · 2025-12-02T15:12:38Z

sdk/python/feast/infra/offline_stores/contrib/ray_offline_store/ray.py

            return pa.Table.from_pandas(df).schema


+def _compute_non_entity_dates_ray(


I think we should have make a common utility function for this, so that it can be used in all stores without repeating the code.

wdyt ?

jyejare · 2025-12-02T15:19:44Z

sdk/python/feast/infra/offline_stores/contrib/ray_offline_store/ray.py

+    return _filter_range
+
+
+def _make_select_distinct_keys(join_keys: List[str]):


I think we should not drop rows with duplicate IDs, because there could be multiple transactions per ID and we need to choose the row based on timestamp while joining the colums from another table/view. I think this is the same case with your spark PR.

Please check the postgres implementation to understand the case.

Or Am I misreading this ?

Testing the case after discussion

feat: Offline Store historical features retrieval based on datetime r…

ccd8cc5

…ange in Ray Signed-off-by: Aniket Paluskar <[email protected]>

aniketpalu requested a review from a team as a code owner November 25, 2025 19:18

aniketpalu and others added 2 commits November 26, 2025 00:49

Reforamatted code to fix lint issues

9f8aa5a

Signed-off-by: Aniket Paluskar <[email protected]>

Merge branch 'master' into RHOAIENG-38643

33f85ba

jyejare suggested changes Dec 2, 2025

View reviewed changes

Merge branch 'master' into RHOAIENG-38643

a0ad328

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Offline Store historical features retrieval based on datetime range in Ray #5738

feat: Offline Store historical features retrieval based on datetime range in Ray #5738

Uh oh!

aniketpalu commented Nov 25, 2025

Uh oh!

jyejare left a comment

Uh oh!

jyejare Dec 2, 2025

Uh oh!

jyejare Dec 2, 2025 •

edited

Loading

Uh oh!

aniketpalu Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return pa.Table.from_pandas(df).schema


		def _compute_non_entity_dates_ray(

		return _filter_range


		def _make_select_distinct_keys(join_keys: List[str]):

feat: Offline Store historical features retrieval based on datetime range in Ray #5738

Are you sure you want to change the base?

feat: Offline Store historical features retrieval based on datetime range in Ray #5738

Uh oh!

Conversation

aniketpalu commented Nov 25, 2025

What this PR does / why we need it:

Which issue(s) this PR fixes:

Misc

Uh oh!

jyejare left a comment

Choose a reason for hiding this comment

Uh oh!

jyejare Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

jyejare Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aniketpalu Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jyejare Dec 2, 2025 •

edited

Loading