Optimize model inference and runtime
Description: Improve speed and efficiency of decision-making and model performance.
User Story: As a developer, I want the system to run faster, so it can operate in real-time conditions.
DoR: Functional baseline system
DoD: Model size reduced, runtime improved
Acceptance Criteria: Inference time < 0.5s per image