types.DataFrameSource

DataFrameSource(df, table_name)

A DataSource implementation that wraps a pandas DataFrame using DuckDB.

Methods

Name Description
cleanup Close the DuckDB connection.
execute_query Execute query using DuckDB.
get_data Return the unfiltered data as a DataFrame.
get_db_type Get the database type.
get_schema Generate schema information from DataFrame.

cleanup

types.DataFrameSource.cleanup()

Close the DuckDB connection.

Returns

Name Type Description
None

execute_query

types.DataFrameSource.execute_query(query)

Execute query using DuckDB.

Parameters

Name Type Description Default
query str SQL query to execute required

Returns

Name Type Description
pd.DataFrame Query results as pandas DataFrame

get_data

types.DataFrameSource.get_data()

Return the unfiltered data as a DataFrame.

Returns

Name Type Description
pd.DataFrame The complete dataset as a pandas DataFrame

get_db_type

types.DataFrameSource.get_db_type()

Get the database type.

Returns

Name Type Description
str The string “DuckDB”

get_schema

types.DataFrameSource.get_schema(categorical_threshold)

Generate schema information from DataFrame.

Parameters

Name Type Description Default
categorical_threshold int Maximum number of unique values for a text column to be considered categorical required

Returns

Name Type Description
str String describing the schema