Conversation
There was a problem hiding this comment.
Pull request overview
This PR adds three comprehensive Jupyter notebooks that document and demonstrate the AML (Anti-Money Laundering) investigation use case for evaluating AI agents. The notebooks provide a step-by-step walkthrough from data exploration to running evaluations.
Changes:
- Added notebook 01 to explore the IBM AML dataset, build a SQLite database, and demonstrate the ReadOnlySqlDatabase tool
- Added notebook 02 to explain case file structures, case generation, and running the agent on individual cases
- Added notebook 03 to showcase the full evaluation pipeline with item-level, trace-level, and run-level graders
Reviewed changes
Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.
| File | Description |
|---|---|
| implementations/aml_investigation/01_data_and_tools.ipynb | Introduces the AML dataset, demonstrates database setup and schema, and explains the ReadOnlySqlDatabase safety tool |
| implementations/aml_investigation/02_running_the_agent.ipynb | Documents the case file data structures, explains the four case types (TP/TN/FP/FN), and demonstrates running a single case through the agent |
| implementations/aml_investigation/03_evaluation.ipynb | Demonstrates the full evaluation pipeline including dataset upload to Langfuse, explains the three-tier grading system, and shows how to inspect results |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
It works in Coder. I just had to change the I can do this as a workaround: I'd recommend changing to |
Summary
Add notebooks for AML investigation use case.
Clickup Ticket(s): N/A
Type of Change
Changes Made
Testing
uv run pytest tests/)uv run mypy <src_dir>)uv run ruff check src_dir/)Manual testing details:
Ran the notebooks.
Screenshots/Recordings
N/A
Related Issues
N/A
Deployment Notes
N/A
Checklist