Job metadata
execution.models.job_metadata
¶
Job Metadata and Configuration Models
This module defines the data structures and enumerations used for job metadata, configuration, status tracking, and resource monitoring throughout the ETL pipeline.
ResourceUsage
dataclass
¶
Container for tracking job resource consumption metrics.
Used for monitoring and optimizing job performance across different execution environments (local, Glue, etc.).
cpu_usage_percent: Optional[float] = None
class-attribute
instance-attribute
¶
memory_usage_mb: Optional[float] = None
class-attribute
instance-attribute
¶
disk_usage_mb: Optional[float] = None
class-attribute
instance-attribute
¶
network_io_mb: Optional[float] = None
class-attribute
instance-attribute
¶
execution_time_seconds: Optional[float] = None
class-attribute
instance-attribute
¶
__init__(cpu_usage_percent: Optional[float] = None, memory_usage_mb: Optional[float] = None, disk_usage_mb: Optional[float] = None, network_io_mb: Optional[float] = None, execution_time_seconds: Optional[float] = None) -> None
¶
JobStatus
¶
Enumeration of possible job execution states.
Used for tracking job lifecycle and enabling monitoring, alerting, and recovery mechanisms.
PENDING = 'pending'
class-attribute
instance-attribute
¶
RUNNING = 'running'
class-attribute
instance-attribute
¶
COMPLETED = 'completed'
class-attribute
instance-attribute
¶
FAILED = 'failed'
class-attribute
instance-attribute
¶
CANCELLED = 'cancelled'
class-attribute
instance-attribute
¶
RETRYING = 'retrying'
class-attribute
instance-attribute
¶
JobType
¶
Enumeration of supported ETL job types.
Defines the different categories of processing jobs available in the Owl-Watch pipeline.
ETL = 'etl'
class-attribute
instance-attribute
¶
CLEANING = 'cleaned'
class-attribute
instance-attribute
¶
SENTIMENT = 'sentiment'
class-attribute
instance-attribute
¶
NLP = 'nlp'
class-attribute
instance-attribute
¶
ANALYTICS = 'analytics'
class-attribute
instance-attribute
¶
COMMUNICATION_ETL = 'communication_etl'
class-attribute
instance-attribute
¶
DataSource
¶
Enumeration of supported data sources.
Identifies the origin systems and datasets that can be processed by the ETL pipeline.