---------------------------------------------------------------------- This is the API documentation for the gdtest_pdm_big_class library. ---------------------------------------------------------------------- ## Classes Main classes provided by the package Pipeline(name: str, verbose: bool = False) A multi-step data processing pipeline. Provides methods for loading, cleaning, transforming, validating, aggregating, and exporting data. Parameters ---------- name : str The pipeline name. verbose : bool Whether to print progress messages. Examples -------- >>> p = Pipeline("etl") >>> p.load({"items": [1, 2, 3]}) ## Pipeline Methods Methods for the Pipeline class load(self, source: dict) -> 'Pipeline' Load data from a source dictionary. Parameters ---------- source : dict The data source. Returns ------- Pipeline Self for method chaining. clean(self, drop_nulls: bool = True) -> 'Pipeline' Clean the loaded data. Parameters ---------- drop_nulls : bool Whether to drop null values. Returns ------- Pipeline Self for method chaining. transform(self, func: ) -> 'Pipeline' Apply a transformation function to the data. Parameters ---------- func : callable A function to apply to each data item. Returns ------- Pipeline Self for method chaining. validate(self, schema: dict | None = None) -> bool Validate the data against an optional schema. Parameters ---------- schema : dict or None Validation schema. If None, performs basic checks. Returns ------- bool True if the data passes validation. aggregate(self, group_by: str, agg_func: str = 'sum') -> dict Aggregate data by a given key. Parameters ---------- group_by : str The field to group by. agg_func : str Aggregation function: 'sum', 'mean', 'count'. Returns ------- dict Aggregated results. export(self, fmt: str = 'json') -> str Export the pipeline results. Parameters ---------- fmt : str Output format: 'json', 'csv', or 'parquet'. Returns ------- str The serialized output. status(self) -> dict Get the current pipeline status. Returns ------- dict Status info including name, data loaded, and step count.