-
paguro.Dataset[VFM: paguro.models.vfm.vfmodel.VFrameModel]].skim(config: list[tuple] | None =
None, *, by: str | list[str] | None =None, hist: bool =False, unnest_structs: bool | ColumnNameOrSelector =False) Collection Generate a summary of the dataset based on specified configurations.
This method provides a comprehensive summary for a dataset, allowing customization of grouped statistics, histogram generation, and deeper dataset analysis by unnesting nested structures.
- Parameters:¶
- config: list[tuple] | None =
None¶ A list of tuples defining the configuration for the summary statistics. Each tuple typically represents a specific computation or analysis to perform on the dataset.
- by: str | list[str] | None =
None¶ Defines the columns by which to group data before generating the summary. If multiple strings are provided, data is grouped hierarchically by the specified columns.
- hist: bool =
False¶ If True, includes histograms for numeric fields in the summary output. Useful for visualizing the distribution of numeric data.
- unnest_structs: bool | ColumnNameOrSelector =
False¶ If True, unnests any nested structures in the dataset to enable deeper analysis. A ColumnNameOrSelector can also be provided to target specific fields to unnest.
- config: list[tuple] | None =