-
-
Notifications
You must be signed in to change notification settings - Fork 0
FAQ
CSV, Parquet, and Excel (.xlsx) for file-based mapping. Plus PostgreSQL, MySQL, SQLite, and DuckDB via connection strings. Polars and Pandas DataFrames for in-memory use.
No. infermap only maps column names — it doesn't clean, reformat, or transform values. Use tools like GoldenFlow for data transformation.
pandas.rename() requires you to manually specify the mapping. infermap infers the mapping automatically by analyzing column names, value patterns, and statistical profiles.
Yes. Use the @infermap.scorer decorator to register a custom scorer:
@infermap.scorer(name="my_scorer", weight=0.7)
def my_scorer(source, target):
return infermap.ScorerResult(score=0.8, reasoning="custom match")Default is 0.3. Pairs below this are dropped. Configure with min_confidence parameter or --min-confidence CLI flag.
Yes. Save with result.to_config("mapping.yaml"), reload with infermap.from_config("mapping.yaml"). The saved config applies column renames directly.
The DBProvider introspects the live schema at runtime. If the target table gains or loses columns, the mapping adapts on the next run. Required fields that disappear generate warnings.
Not in v1. The Hungarian algorithm produces strictly 1:1 assignments. Composite column detection (e.g., full_name -> first_name + last_name) is planned for a future release.