Reference
Classes
Main classes provided by the package
- Pipeline
-
A multi-step data processing pipeline.
Pipeline Methods
Methods for the Pipeline class
- Pipeline.load()
-
Load data from a source dictionary.
- Pipeline.clean()
-
Clean the loaded data.
- Pipeline.transform()
-
Apply a transformation function to the data.
- Pipeline.validate()
-
Validate the data against an optional schema.
- Pipeline.aggregate()
-
Aggregate data by a given key.
- Pipeline.export()
-
Export the pipeline results.
- Pipeline.status()
-
Get the current pipeline status.