types.SQLAlchemySource

SQLAlchemySource(engine, table_name)

A DataSource implementation that supports multiple SQL databases via SQLAlchemy.

Supports various databases including PostgreSQL, MySQL, SQLite, Snowflake, and Databricks.

Methods

Name Description
cleanup Dispose of the SQLAlchemy engine.
execute_query Execute SQL query and return results as DataFrame.
get_data Return the unfiltered data as a DataFrame.
get_db_type Get the database type.
get_schema Generate schema information from database table.

cleanup

types.SQLAlchemySource.cleanup()

Dispose of the SQLAlchemy engine.

Returns

Name Type Description
None

execute_query

types.SQLAlchemySource.execute_query(query)

Execute SQL query and return results as DataFrame.

Parameters

Name Type Description Default
query str SQL query to execute required

Returns

Name Type Description
pd.DataFrame Query results as pandas DataFrame

get_data

types.SQLAlchemySource.get_data()

Return the unfiltered data as a DataFrame.

Returns

Name Type Description
pd.DataFrame The complete dataset as a pandas DataFrame

get_db_type

types.SQLAlchemySource.get_db_type()

Get the database type.

Returns the specific database type (e.g., POSTGRESQL, MYSQL, SQLITE) by inspecting the SQLAlchemy engine. Removes ” SQL” suffix if present.

get_schema

types.SQLAlchemySource.get_schema(categorical_threshold)

Generate schema information from database table.

Returns: String describing the schema