Overseas access: www.kdjingpai.com
Bookmark Us
Current Position:fig. beginning " AI Answers

How to overcome the gap between semantic reasoning and action output in end-to-end autonomous driving?

2025-08-25 1.5 K

Three-phase harmonized optimization programme

Orion solves the semantic-action alignment challenge with the following architectural design:

  • cross-modal alignment layer: EVA-CLIP visual encoder (224 × 224 inputs) with QLoRA fine-tuned LLM (7B parameters) sharing attention mechanism
  • trainable interface design: Add lightweight Adapter (only 0.5M parameters) to VLM outputs to encode textual commands as kinematic parameters
  • Online correction mechanism: Correcting trajectory deviations with real-time feedback from CARLA's RGBD camera

Specific operational procedures:

  1. Preparation phase: download eva02_petr_proj.pth and pretrain_qformer.pth weights files
  2. Training configuration: set joint_optimization=True in configs/train.yaml
  3. Validation method: run python eval_gap.py -metric semantic_action_gap

The scheme achieves 82.3% instruction-action matching on the nuScenes validation set, which is a 2.1x improvement over the baseline method. It especially excels in complex scenarios such as 'yielding to pedestrians'.

Recommended

Can't find AI tools? Try here!

Just type in the keyword Accessibility Bing SearchYou can quickly find all the AI tools on this site.

Top

en_USEnglish