Enhanced Evaluation Metrics with Macro/Micro Scores and Per-Label Analysis #304

Xiaomin-HUANG · 2025-11-13T16:31:02Z

Added Macro F1 and per-label metrics to complement the existing Micro F1 score, enabling better visibility into model performance across different entity types.

What's New

Macro F1: Unweighted average across all labels (0.51)
Per-label breakdown: Precision, Recall, and F1 for each entity type
Formatted table: Sorted output for quick identification of best/worst performers

Enhenced output : {
        
        "per_class":{"tag1":{"precision":float, "recall":float,"f_score":float},
                    "tag2":{}...
                    },
        "micro":{"precision":float, "recall":float,"f_score":float},
        "macro":{"precision":float, "recall":float,"f_score":float},
        }

Formatted table eg :

…cores per label

Improve evaluation output by add more info : Macro & Micro scores & S…

985d8bd

…cores per label

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enhanced Evaluation Metrics with Macro/Micro Scores and Per-Label Analysis #304

Enhanced Evaluation Metrics with Macro/Micro Scores and Per-Label Analysis #304

Uh oh!

Xiaomin-HUANG commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Enhanced Evaluation Metrics with Macro/Micro Scores and Per-Label Analysis #304

Are you sure you want to change the base?

Enhanced Evaluation Metrics with Macro/Micro Scores and Per-Label Analysis #304

Uh oh!

Conversation

Xiaomin-HUANG commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant