fix: ref() and source() in Python models by akmalsoliev · Pull Request #666 · aws-samples/dbt-glue

akmalsoliev · 2026-02-23T21:04:49Z

fix: ref() and source() in Python models to use dbt-core's resolved functions

The previous implementation constructed table references manually. When calling source("my_source_aliase", "my_table") it would literally pass those string into spark.table resulting in a call spark.table("my_source_aliase"."my_table").

This fix will resolve this issue with reutilization of

def ref(*args, **kwargs):
    refs = {"my_ref_table": "my_schema.my_ref_table"}
    key = '.'.join(args)
    version = kwargs.get("v") or kwargs.get("version")
    if version:
        key += f".v{version}"
    dbt_load_df_function = kwargs.get("dbt_load_df_function")
    return dbt_load_df_function(refs[key])

def source(*args, dbt_load_df_function):
    sources = {"my_source_aliase.my_source_table": "my_source_schema.my_source_table"}
    key = '.'.join(args)
    return dbt_load_df_function(sources[key])

NOTE: that instead of having my_source_aliase it points to my_source_schema, which would solve the issue of an incorrect pointer with spark.table, outputting spark.table("my_source_schema"."my_source_table")

This took me ages to figure out, was so confused on why there was source declaration.

resolves #

Description

Checklist

I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have updated the CHANGELOG.md and added information about my change to the "dbt-glue next" section.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…unctions The previous implementation constructed table references manually. When calling `source("my_source_aliase", "my_table")` it would literally pass those string into `spark.table` resulting in a call `spark.table("my_source_aliase"."my_table")`. This fix will resolve this issue with reutilization of ```py def ref(*args, **kwargs): refs = {"my_ref_table": "my_schema.my_ref_table"} key = '.'.join(args) version = kwargs.get("v") or kwargs.get("version") if version: key += f".v{version}" dbt_load_df_function = kwargs.get("dbt_load_df_function") return dbt_load_df_function(refs[key]) def source(*args, dbt_load_df_function): sources = {"my_source_aliase.my_source_table": "my_source_schema.my_source_table"} key = '.'.join(args) return dbt_load_df_function(sources[key]) ``` **NOTE:** that instead of having `my_source_aliase` it instead points to `my_source_schema`, which would solve the issue of an incorrect pointer with `spark.table`, outputting `spark.table("my_source_schema"."my_source_table")` This took me ages to figure out, was so confused on why there was source declaration.

akmalsoliev · 2026-03-06T11:25:37Z

I believe this also fixes the issue: #635

yotahk · 2026-03-09T15:01:25Z

Thanks for the fix. I believe delegating to dbt-core's resolved ref/source functions (as defined in py_script_postfix) is the right approach.

One small ask: could you add an entry to CHANGELOG.md for this change & complete the PR checklist?

update: CHANGELOG.md

akmalsoliev · 2026-03-09T17:28:25Z

Thanks for the fix. I believe delegating to dbt-core's resolved ref/source functions (as defined in py_script_postfix) is the right approach.

One small ask: could you add an entry to CHANGELOG.md for this change & complete the PR checklist?

Hey @yotahk,

You'll find the updates, thanks to Claude was able to understand how in the god's name to create test.

yotahk · 2026-03-12T11:06:33Z

Merged, thank you for the contribution!

github-actions Bot added the beginning-contributor label Feb 23, 2026

akmalsoliev changed the title ~~fix: ref() and source() in Python models to use dbt-core's resolved f…~~ fix: ref() and source() in Python models Feb 23, 2026

yotahk added the enable-functional-tests This label enable functional tests label Mar 9, 2026

new: adding tests for ref and source

c4ebc00

update: CHANGELOG.md

Merge branch 'main' into fix_table_ref_source_issue

e2a7b9f

yotahk added enable-functional-tests This label enable functional tests and removed enable-functional-tests This label enable functional tests labels Mar 12, 2026

Update CHANGELOG.md

f39221a

yotahk added enable-functional-tests This label enable functional tests and removed enable-functional-tests This label enable functional tests labels Mar 12, 2026

yotahk approved these changes Mar 12, 2026

View reviewed changes

yotahk merged commit eb716ff into aws-samples:main Mar 12, 2026
26 checks passed

akmalsoliev deleted the fix_table_ref_source_issue branch May 2, 2026 18:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: ref() and source() in Python models #666

fix: ref() and source() in Python models #666
yotahk merged 4 commits intoaws-samples:mainfrom
akmalsoliev:fix_table_ref_source_issue

akmalsoliev commented Feb 23, 2026 •

edited

Loading

Uh oh!

akmalsoliev commented Mar 6, 2026

Uh oh!

yotahk commented Mar 9, 2026

Uh oh!

akmalsoliev commented Mar 9, 2026

Uh oh!

Uh oh!

yotahk commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

akmalsoliev commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

akmalsoliev commented Mar 6, 2026

Uh oh!

yotahk commented Mar 9, 2026

Uh oh!

akmalsoliev commented Mar 9, 2026

Uh oh!

Uh oh!

yotahk commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

akmalsoliev commented Feb 23, 2026 •

edited

Loading