Improve model card with pipeline tag and library name (#1)
Browse files- Improve model card with pipeline tag and library name (80395f928c6141cc6e18224a7d0051d87c70b024)
Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>
README.md
CHANGED
@@ -1,15 +1,17 @@
|
|
1 |
---
|
2 |
-
license: mit
|
3 |
base_model:
|
4 |
- Qwen/Qwen2-VL-7B-Instruct
|
|
|
|
|
|
|
5 |
---
|
6 |
|
7 |
# GUI-Actor-7B with Qwen2-VL-7B as backbone VLM
|
8 |
|
9 |
-
This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://
|
10 |
It is developed based on [Qwen2-VL-7B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
|
11 |
|
12 |
-
For more details on model design and evaluation, please check: [π Project Page](https://
|
13 |
|
14 |
| Model Name | Hugging Face Link |
|
15 |
|--------------------------------------------|--------------------------------------------|
|
@@ -131,5 +133,4 @@ print(f"Predicted click point: [{round(px, 4)}, {round(py, 4)}]")
|
|
131 |
primaryClass={cs.CV},
|
132 |
url={https://www.arxiv.org/pdf/2506.03143},
|
133 |
}
|
134 |
-
```
|
135 |
-
|
|
|
1 |
---
|
|
|
2 |
base_model:
|
3 |
- Qwen/Qwen2-VL-7B-Instruct
|
4 |
+
license: mit
|
5 |
+
library_name: transformers
|
6 |
+
pipeline_tag: image-text-to-text
|
7 |
---
|
8 |
|
9 |
# GUI-Actor-7B with Qwen2-VL-7B as backbone VLM
|
10 |
|
11 |
+
This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://huggingface.co/papers/2506.03143).
|
12 |
It is developed based on [Qwen2-VL-7B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
|
13 |
|
14 |
+
For more details on model design and evaluation, please check: [π Project Page](https://microsoft.github.io/GUI-Actor/) | [π» Github Repo](https://github.com/microsoft/GUI-Actor) | [π Paper](https://www.arxiv.org/pdf/2506.03143).
|
15 |
|
16 |
| Model Name | Hugging Face Link |
|
17 |
|--------------------------------------------|--------------------------------------------|
|
|
|
133 |
primaryClass={cs.CV},
|
134 |
url={https://www.arxiv.org/pdf/2506.03143},
|
135 |
}
|
136 |
+
```
|
|