评测结果
书生·济世”模型在通用评测数据集C-Eval,和金融专业评测数据集FinEval和CFBenchmark上均进行了测试
| 数据集名称 | CFGPT2-7B | CFGPT2-20B | InternLM2-7B | InternLM2-20B | Baichuan2-13B | ChatGLM2-6B |
|---|---|---|---|---|---|---|
| C-Eval | 63.5 | 69.2 | 60.8 | 63.0 | 58.2 | 51.7 |
| FinEval | 62.9 | 64.8 | 51.9 | 55.5 | - | 47.4 |
| OpenFinData | 70.5 | 73.8 | 57.8 | 62.9 | 57.2 | 54.4 |
| CFBenchmark-Basic | 71.4 | 76.4 | 57.6 | 59.6 | 57.2 | 47.9 |