DatasetFormatter#

class myoverse.datasets.utils.DatasetFormatter(console=None, debug_level=0)[source]#

Handles Rich console output for dataset creation.

Extracts all formatting and display logic from EMGDataset to provide a clean separation of concerns.

Parameters:
  • console (Console | None) – Rich console instance. If None, creates a new one.

  • debug_level (int) – Debug level (0=none, 1=text, 2=text+graphs).

Examples

>>> formatter = DatasetFormatter(debug_level=1)
>>> formatter.print_header()
>>> formatter.print_config(config)
>>> formatter.print_summary(dataset)

Methods

__init__([console, debug_level])

_calculate_sizes(dataset)

Calculate dataset sizes in MB.

_print_structure_tree(dataset)

Print dataset structure as a tree.

print_action(action)

Print an action being performed.

print_augmentation_config(num_pipelines, ...)

Print augmentation configuration.

print_config(config)

Print dataset configuration table.

print_data_panel(data, title)

Print a data object in a styled panel.

print_data_structure(emg_data, ground_truth_data)

Print data structure tree.

print_header([title])

Print a section header.

print_section(title)

Print a section label.

print_split_sizes(training_sizes, ...)

Print dataset split sizes table.

print_summary(dataset)

Print final dataset summary.

print_tasks_info(tasks)

Print information about tasks to process.

should_print([level])

Check if output should be printed at the given level.

print_action(action)[source]#

Print an action being performed.

Parameters:

action (str)

Return type:

None

print_augmentation_config(num_pipelines, pipeline_names, batch_size, training_size)[source]#

Print augmentation configuration.

Parameters:
  • num_pipelines (int)

  • pipeline_names (list[str])

  • batch_size (int)

  • training_size (int)

Return type:

None

print_config(config)[source]#

Print dataset configuration table.

Parameters:

config (DatasetConfig)

Return type:

None

print_data_panel(data, title)[source]#

Print a data object in a styled panel.

Parameters:
Return type:

None

print_data_structure(emg_data, ground_truth_data)[source]#

Print data structure tree.

Parameters:
Return type:

None

print_header(title='STARTING DATASET CREATION')[source]#

Print a section header.

Parameters:

title (str)

Return type:

None

print_section(title)[source]#

Print a section label.

Parameters:

title (str)

Return type:

None

print_split_sizes(training_sizes, testing_sizes, validation_sizes)[source]#

Print dataset split sizes table.

Parameters:
Return type:

None

print_summary(dataset)[source]#

Print final dataset summary.

Parameters:

dataset (Group)

Return type:

None

print_tasks_info(tasks)[source]#

Print information about tasks to process.

Parameters:

tasks (list[str])

Return type:

None

should_print(level=1)[source]#

Check if output should be printed at the given level.

Parameters:

level (int)

Return type:

bool