You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As a data scientist or analyst I want to run an end-to-end interactive evaluation process for Presidio So that I could quickly set up, configure, and assess Presidio's performance with synthetic data and structured evaluation
Acceptance Criteria
Provide an interactive pipeline (e.g., Gradio or Streamlit app) that includes:
Data Generation:
Users can define template sentences and Faker providers to generate synthetic PII data.
The generated dataset is structured and stored for evaluation.
Presidio Configuration:
Users can specify configuration settings via a YAML file (aligned with AnalyzerEngineProvider).
Entity Mapping:
Users can define a mapping between dataset entities and Presidio entities for evaluation.
Presidio Evaluation:
The Evaluator object runs Presidio on each sample and compares predictions to ground truth.
Metrics & Plots:
Compute and display key evaluation metrics (per-entity precision, recall, F2-score, and overall PII detection scores).
User Story
As a data scientist or analyst
I want to run an end-to-end interactive evaluation process for Presidio
So that I could quickly set up, configure, and assess Presidio's performance with synthetic data and structured evaluation
Acceptance Criteria
AnalyzerEngineProvider
).Evaluator
object runs Presidio on each sample and compares predictions to ground truth.The text was updated successfully, but these errors were encountered: