Skip to content

Adding LLM Judge evaluation metric (#124)#124

Closed
anilkram wants to merge 1 commit into
facebookresearch:mainfrom
anilkram:export-D99365509
Closed

Adding LLM Judge evaluation metric (#124)#124
anilkram wants to merge 1 commit into
facebookresearch:mainfrom
anilkram:export-D99365509

Conversation

@anilkram
Copy link
Copy Markdown
Contributor

@anilkram anilkram commented Apr 6, 2026

Summary:

Adding LLM as Judge metric implementation. We include templates to call Claude, GPT and Gemini models.

Reviewed By: shree-gade, mgrange1998

Differential Revision: D99365509

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 6, 2026
@meta-codesync
Copy link
Copy Markdown

meta-codesync Bot commented Apr 6, 2026

@anilkram has exported this pull request. If you are a Meta employee, you can view the originating Diff in D99365509.

@meta-codesync meta-codesync Bot changed the title Adding LLM Judge evaluation metric Adding LLM Judge evaluation metric (#124) Apr 6, 2026
anilkram added a commit to anilkram/PrivacyGuard that referenced this pull request Apr 6, 2026
Summary:

Adding LLM as Judge metric implementation. We include templates to call Claude, GPT and Gemini models.

Reviewed By: shree-gade, mgrange1998

Differential Revision: D99365509
Summary:
Pull Request resolved: facebookresearch#124

Adding LLM as Judge metric implementation. We include templates to call Claude, GPT and Gemini models.

Reviewed By: shree-gade, mgrange1998

Differential Revision: D99365509
anilkram added a commit to anilkram/PrivacyGuard that referenced this pull request Apr 6, 2026
Summary:

Adding LLM as Judge metric implementation. We include templates to call Claude, GPT and Gemini models.

Reviewed By: shree-gade, mgrange1998

Differential Revision: D99365509
anilkram added a commit to anilkram/PrivacyGuard that referenced this pull request Apr 6, 2026
Summary:

Adding LLM as Judge metric implementation. We include templates to call Claude, GPT and Gemini models.

Reviewed By: shree-gade, mgrange1998

Differential Revision: D99365509
anilkram added a commit to anilkram/PrivacyGuard that referenced this pull request Apr 7, 2026
Summary:

Adding LLM as Judge metric implementation. We include templates to call Claude, GPT and Gemini models.

Reviewed By: shree-gade, mgrange1998

Differential Revision: D99365509
anilkram added a commit to anilkram/PrivacyGuard that referenced this pull request Apr 7, 2026
Summary:

Adding LLM as Judge metric implementation. We include templates to call Claude, GPT and Gemini models.

Reviewed By: shree-gade, mgrange1998

Differential Revision: D99365509
anilkram added a commit to anilkram/PrivacyGuard that referenced this pull request Apr 7, 2026
Summary:

Adding LLM as Judge metric implementation. We include templates to call Claude, GPT and Gemini models.

Reviewed By: shree-gade, mgrange1998

Differential Revision: D99365509
anilkram added a commit to anilkram/PrivacyGuard that referenced this pull request Apr 7, 2026
Summary:

Adding LLM as Judge metric implementation. We include templates to call Claude, GPT and Gemini models.

Reviewed By: shree-gade, mgrange1998

Differential Revision: D99365509
@meta-codesync meta-codesync Bot closed this in f9a6ba2 Apr 7, 2026
@meta-codesync
Copy link
Copy Markdown

meta-codesync Bot commented Apr 7, 2026

This pull request has been merged in f9a6ba2.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot. fb-exported Merged meta-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant