MCP Servers



Local Tools


manage_context
handle_overlong_tool_outputs
Instruction
Analyze the wandb project https://wandb.ai/mluo/deepscaler-1.5b?nw=nwusermluo, identify the experiment with the best validation set performance, and find which step performed best in that experiment. Save the best_experiment_name, best_step, and best_val_score to a CSV file namedbest_experiment.csv in the workspace.
Initial State
Local Workspace
workspace/
└── best_experiment.csv
Wandb Projects
├── deepscaler-1.5b/