Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

BenchFlow

company
https://benchflow.ai
benchflow_ai
benchflow-ai
benchflow-ai
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

bingran-you  new activity about 1 hour ago
benchflow/skillsbench-leaderboard:Salvage 321 healthy replacement runs from PR2/PR3 (5-trial era)
bingran-you  new activity about 18 hours ago
benchflow/skillsbench-leaderboard:Wrap-up: final healthy SkillsBench trajectories
bingran-you  updated a dataset about 18 hours ago
benchflow/skillsbench-leaderboard
View all activity

Papers

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

View all Papers

Xiangyi Li's profile pictureBingran You's profile picture

benchflow 's models

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs