A First Evaluation of a Multi-Modal Learning System to Control Surgical Assistant Robots via Action Segmentation