RichardErkhov commited on
Commit
1692a05
·
verified ·
1 Parent(s): ed51bc2

uploaded readme

Browse files
Files changed (1) hide show
  1. README.md +70 -0
README.md ADDED
@@ -0,0 +1,70 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Quantization made by Richard Erkhov.
2
+
3
+ [Github](https://github.com/RichardErkhov)
4
+
5
+ [Discord](https://discord.gg/pvy7H8DZMG)
6
+
7
+ [Request more models](https://github.com/RichardErkhov/quant_request)
8
+
9
+
10
+ MathCoder2-CodeLlama-7B - bnb 4bits
11
+ - Model creator: https://huggingface.co/MathGenie/
12
+ - Original model: https://huggingface.co/MathGenie/MathCoder2-CodeLlama-7B/
13
+
14
+
15
+
16
+
17
+ Original model description:
18
+ ---
19
+ license: apache-2.0
20
+ datasets:
21
+ - MathGenie/MathCode-Pile
22
+ language:
23
+ - en
24
+ metrics:
25
+ - accuracy
26
+ base_model:
27
+ - codellama/CodeLlama-7b-hf
28
+ pipeline_tag: text-generation
29
+ tags:
30
+ - math
31
+ ---
32
+
33
+ # MathCoder2
34
+
35
+ ### Introduction
36
+
37
+ The MathCoder2 models are created by conducting continued pretraining on [MathCode-Pile](https://huggingface.co/datasets/MathGenie/MathCode-Pile). They are introduced in the paper [MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code](https://arxiv.org/abs/2410.08196).
38
+
39
+ The mathematical pretraining dataset includes mathematical code accompanied with natural language reasoning steps, making it a superior resource for models aimed at performing advanced mathematical reasoning tasks.
40
+
41
+ ### Evaluation
42
+
43
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65dd9e7b4a4fce1ec96dc6b7/BEZoDZLjp-fPFlt7oFXBa.png)
44
+
45
+ ### Citation
46
+
47
+ If you find this repository helpful, please consider citing our papers:
48
+
49
+ ```
50
+ @misc{lu2024mathcoder2bettermathreasoning,
51
+ title={MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code},
52
+ author={Zimu Lu and Aojun Zhou and Ke Wang and Houxing Ren and Weikang Shi and Junting Pan and Mingjie Zhan and Hongsheng Li},
53
+ year={2024},
54
+ eprint={2410.08196},
55
+ archivePrefix={arXiv},
56
+ primaryClass={cs.CL},
57
+ url={https://arxiv.org/abs/2410.08196},
58
+ }
59
+ ```
60
+ ```
61
+ @inproceedings{
62
+ wang2024mathcoder,
63
+ title={MathCoder: Seamless Code Integration in {LLM}s for Enhanced Mathematical Reasoning},
64
+ author={Zimu Lu and Aojun Zhou and Zimu Lu and Sichun Luo and Weikang Shi and Renrui Zhang and Linqi Song and Mingjie Zhan and Hongsheng Li},
65
+ booktitle={The Twelfth International Conference on Learning Representations},
66
+ year={2024},
67
+ url={https://openreview.net/forum?id=z8TW0ttBPp}
68
+ }
69
+ ```
70
+