Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 2 additions & 5 deletions docs/en/notes/mm_guide/image_understanding/image_gcot.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,13 +39,10 @@ cd run_gcot

Save the code in the "Pipeline Example" section below as `image_gcot_pipeline.py`.

### Step 3: Configure Parameters

Ensure you have a VLM model capable of grounding (e.g., Qwen2.5-VL-7B-Instruct).
### Step 3: Download Example Data

```bash
# Install dependencies
pip install open-dataflow vllm
huggingface-cli download --repo-type dataset OpenDCAI/dataflow-demo-image --local-dir example_data

```

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -40,13 +40,10 @@ cd run_scalecap

Save the code in the "Pipeline Example" section below as `scalecap_pipeline.py`.

### Step 3: Configure Parameters

Ensure the VLM model path (e.g., Qwen2.5-VL) is correct.
### Step 3: Download Example Data

```bash
# Install dependencies
pip install open-dataflow vllm
huggingface-cli download --repo-type dataset OpenDCAI/dataflow-demo-image --local-dir example_data

```

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -36,13 +36,10 @@ cd run_vis_mcq

Save the code in the "Pipeline Example" section below as `visual_mcq_pipeline.py`.

### Step 3: Configure Parameters

Control filtering thresholds via CLI. For example, requiring 100% visual accuracy and less than 25% blind accuracy:
### Step 3: Download Example Data

```bash
# Install dependencies
pip install open-dataflow vllm
huggingface-cli download --repo-type dataset OpenDCAI/dataflow-demo-image --local-dir example_data

```

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -39,13 +39,10 @@ cd run_mcts_reasoning

Save the code in the "Pipeline Example" section below as `vision_mcts_pipeline.py`.

### Step 3: Configure Parameters

Ensure the input file (jsonl) contains the `tree` field (for extraction) or just `question/image` (for generation).
### Step 3: Download Example Data

```bash
# Install dependencies
pip install open-dataflow vllm
huggingface-cli download --repo-type dataset OpenDCAI/dataflow-demo-image --local-dir example_data

```

Expand Down