qianhuiwu nielsr HF Staff commited on
Commit
6b0edc6
Β·
verified Β·
1 Parent(s): dc6d3d4

Improve model card with pipeline tag and library name (#1)

Browse files

- Improve model card with pipeline tag and library name (80395f928c6141cc6e18224a7d0051d87c70b024)


Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +6 -5
README.md CHANGED
@@ -1,15 +1,17 @@
1
  ---
2
- license: mit
3
  base_model:
4
  - Qwen/Qwen2-VL-7B-Instruct
 
 
 
5
  ---
6
 
7
  # GUI-Actor-7B with Qwen2-VL-7B as backbone VLM
8
 
9
- This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://aka.ms/GUI-Actor).
10
  It is developed based on [Qwen2-VL-7B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
11
 
12
- For more details on model design and evaluation, please check: [🏠 Project Page](https://aka.ms/GUI-Actor) | [πŸ’» Github Repo](https://github.com/microsoft/GUI-Actor) | [πŸ“‘ Paper](https://www.arxiv.org/pdf/2506.03143).
13
 
14
  | Model Name | Hugging Face Link |
15
  |--------------------------------------------|--------------------------------------------|
@@ -131,5 +133,4 @@ print(f"Predicted click point: [{round(px, 4)}, {round(py, 4)}]")
131
  primaryClass={cs.CV},
132
  url={https://www.arxiv.org/pdf/2506.03143},
133
  }
134
- ```
135
-
 
1
  ---
 
2
  base_model:
3
  - Qwen/Qwen2-VL-7B-Instruct
4
+ license: mit
5
+ library_name: transformers
6
+ pipeline_tag: image-text-to-text
7
  ---
8
 
9
  # GUI-Actor-7B with Qwen2-VL-7B as backbone VLM
10
 
11
+ This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://huggingface.co/papers/2506.03143).
12
  It is developed based on [Qwen2-VL-7B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
13
 
14
+ For more details on model design and evaluation, please check: [🏠 Project Page](https://microsoft.github.io/GUI-Actor/) | [πŸ’» Github Repo](https://github.com/microsoft/GUI-Actor) | [πŸ“‘ Paper](https://www.arxiv.org/pdf/2506.03143).
15
 
16
  | Model Name | Hugging Face Link |
17
  |--------------------------------------------|--------------------------------------------|
 
133
  primaryClass={cs.CV},
134
  url={https://www.arxiv.org/pdf/2506.03143},
135
  }
136
+ ```