test_parser / .eval_results /test-bench-public.yaml
boyang-runllama's picture
Add Test Bench Public evaluation results
8677361 verified
- dataset:
id: llamaindex/test_bench
task_id: chart_test
value: 100.0
- dataset:
id: llamaindex/test_bench
task_id: table_test
value: 70.0