Feature Request: Add Explainability / Visibility Mode to `HumanCliSolver`

### Describe the feature or improvement you're requesting

### Current State

[`HumanCliSolver`](https://github.com/AftabHussain/evals/blob/4bfc1f58a821685acb40bb28ba54131740535046/evals/solvers/human_cli_solver.py#L8) is a CLI-based solver that replaces model inference with human input. The current implementation is clean and minimal, works functionally, but lacks visibility into what the evaluation pipeline is actually doing. 

Currently the solver injects the task description as a `system` message, then appends prior conversation state, as follows:

   ```python
   msgs += task_state.messages
   ```

Then it flattens all messages into a single string and appends the CLI input prompt and calls:
   ```python
   "\n".join([f"{msg.role}: {msg.content}" for msg in msgs])
   ```
   ```python
   answer = input(prompt)
   ```

Finally it logs the interaction via [`record_sampling`](https://github.com/AftabHussain/evals/blob/4bfc1f58a821685acb40bb28ba54131740535046/evals/record.py#L210) and returns the result:
   ```python
   record_sampling(prompt=prompt, sampled=answer, model="human")
   ```

### What is Missing

This design hides important details that matter in evaluation contexts:

- There is no structured view of the task context vs. conversation history.
- The exact final prompt string is not clearly separated or emphasized.
- The sampling metadata (prompt length, answer length, model tag) is not visible.

For human baselines, debugging, or audit-heavy evaluation workflows, this lack of visibility reduces transparency and makes experiments harder to reason about.

### Proposed Enhancement: Optional Explainability Mode

Introduce an optional flag:

```python
HumanCliSolver(explain=True)
```

When enabled, the solver prints structured, clearly separated sections before and after input. Instead of showing only the flattened prompt, display:

```
================ TASK CONTEXT (system) ================
<task_state.task_description>

================ MESSAGE HISTORY ================
[0] user: ...
[1] assistant: ...
[2] user: ...

================ FINAL PROMPT STRING (exact) ================
<exact prompt passed to input()>

================ AWAITING HUMAN INPUT ================
assistant (you):
```

After the human responds, show what is being recorded and returned:

```
================ SAMPLING RECORD ================
model: human
prompt_chars: 1243
answer_chars: 87

================ OUTPUT ================
raw_answer: ...
final_answer: ...
```

If postprocessing alters the output, display both versions to eliminate hidden transformations.

### Suggested Implementation Sketch

```python
raw_answer = input(prompt)
final_answer = raw_answer

if self.explain:
    print("=== SAMPLING RECORD ===")
    print(f"model: human")
    print(f"prompt_chars: {len(prompt)}")
    print(f"answer_chars: {len(raw_answer)}")

record_sampling(
    prompt=prompt,
    sampled=final_answer,
    model="human",
)

return SolverResult(final_answer)
```

## Final Comments

HumanCliSolver functions as a human baseline, a prompt debugging tool, and a sanity check within evaluation pipelines. Adding an optional explainability mode would improve transparency, reproducibility, and auditability—without changing default behavior—by making the solver self-documenting when deeper visibility is needed.

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Add Explainability / Visibility Mode to `HumanCliSolver` #1628

Describe the feature or improvement you're requesting

Current State

What is Missing

Proposed Enhancement: Optional Explainability Mode

Suggested Implementation Sketch

Final Comments

Additional context

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Feature Request: Add Explainability / Visibility Mode to HumanCliSolver #1628

Description

Describe the feature or improvement you're requesting

Current State

What is Missing

Proposed Enhancement: Optional Explainability Mode

Suggested Implementation Sketch

Final Comments

Additional context

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Feature Request: Add Explainability / Visibility Mode to `HumanCliSolver` #1628