---------------------------------------------------------------------- This is the API documentation for the gdtest_big_class library. ---------------------------------------------------------------------- ## Classes Main classes provided by the package DataProcessor(name: str) A data processing pipeline with many operations. Parameters ---------- name The name of this processor instance. Examples -------- >>> dp = DataProcessor("etl") >>> dp.name 'etl' ## DataProcessor Methods Methods for the DataProcessor class load(self, path: str) -> None Load data from a file path. Parameters ---------- path Path to the data file. transform(self, func) -> 'DataProcessor' Apply a transformation function to the data. Parameters ---------- func A callable that transforms the data. Returns ------- DataProcessor Self, for method chaining. filter(self, predicate) -> 'DataProcessor' Filter data based on a predicate. Parameters ---------- predicate A callable returning True for rows to keep. Returns ------- DataProcessor Self, for method chaining. sort(self, key: str, ascending: bool = True) -> 'DataProcessor' Sort data by a key. Parameters ---------- key The column/field to sort by. ascending Sort ascending if True, descending if False. Returns ------- DataProcessor Self, for method chaining. aggregate(self, func, column: str) -> 'DataProcessor' Aggregate data by applying a function to a column. Parameters ---------- func Aggregation function (e.g., sum, mean). column Column to aggregate. Returns ------- DataProcessor Self, for method chaining. validate(self) -> bool Validate the current data state. Returns ------- bool True if data is valid. export(self, path: str, fmt: str = 'csv') -> None Export data to a file. Parameters ---------- path Output file path. fmt Output format (csv, json, parquet). summary(self) -> dict Return a summary of the data. Returns ------- dict A dictionary with summary statistics. ## Functions Utility functions load_data(path: str) -> gdtest_big_class.DataProcessor Convenience function to create a processor and load data. Parameters ---------- path Path to the data file. Returns ------- DataProcessor A new processor with data loaded. save_data(processor: gdtest_big_class.DataProcessor, path: str) -> None Save processor data to a file. Parameters ---------- processor The data processor whose data to save. path Output file path.