Reference
Classes
Main classes provided by the package
- DataProcessor
-
A data processing pipeline with many operations.
DataProcessor Methods
Methods for the DataProcessor class
- DataProcessor.load()
-
Load data from a file path.
- DataProcessor.transform()
-
Apply a transformation function to the data.
- DataProcessor.filter()
-
Filter data based on a predicate.
- DataProcessor.sort()
-
Sort data by a key.
- DataProcessor.aggregate()
-
Aggregate data by applying a function to a column.
- DataProcessor.validate()
-
Validate the current data state.
- DataProcessor.export()
-
Export data to a file.
- DataProcessor.summary()
-
Return a summary of the data.
Functions
Utility functions
- load_data()
-
Convenience function to create a processor and load data.
- save_data()
-
Save processor data to a file.