Skip to content

Anktrinity/ai-awards-judge

Repository files navigation

AI Awards Judge

A GitHub-ready starter kit for turning awards judging into a reusable, auditable, deployable workflow.

This repo packages the judging logic into a configurable system that can be reused for event tech awards, AI awards, startup competitions, innovation showcases, and shortlist reviews.

Release model

This repository is prepared for a hybrid publishing model:

  • public repo: framework, code, rubric, templates, docs, sanitized examples
  • private archive: original submissions, extracted text from real entrants, internal judging outputs, confidential notes

See docs/hybrid-publishing.md for the operating model.

What it includes

  • Config-driven rubric and category settings
  • Reusable judging methodology docs
  • A starter Python CLI for local batch runs
  • Templates for reports and master summaries
  • Docker-ready structure and environment config
  • Sanitized example inputs and outputs
  • Reusable OpenClaw skill assets

Repo layout

ai_awards_judging/
  config/                  # Rubric, category, prompt, and output settings
  docs/                    # Methodology, deployment notes, rubric, GitHub publishing
  examples/                # Sanitized sample input and output artifacts
  private_archive_reference/
  scripts/                 # Helper scripts and bootstrap helpers
  src/ai_awards_judge/     # Starter package and CLI
  templates/               # Report templates
  tests/                   # Starter tests

Current status

This is v1 scaffolded and ready for GitHub push.

Ready now:

  • standalone git repo on main
  • docs and config separation
  • starter CLI interface
  • sanitized sample data
  • hybrid publishing guidance

Still to build for a fuller product:

  • web UI or hosted API layer
  • hosted persistence or API auth layer
  • more advanced calibration workflows, adjudication queues, and reviewer assignment controls

Quick start

1. Install locally

cd ai_awards_judging
python3 -m venv .venv
source .venv/bin/activate
pip install -e .

2. Run the starter CLI

Preview config:

ai-awards-judge inspect-config

Create a starter run manifest:

ai-awards-judge plan-run --input-dir ./examples/sample_input --output-dir ./outputs

Run the initial scoring engine on sanitized text entries:

ai-awards-judge run ./examples/sample_input --output-dir ./outputs

Push to GitHub

If you already have a GitHub repo created:

git remote add origin <YOUR_GITHUB_REPO_URL>
git push -u origin main

If you use GitHub CLI and are authenticated:

gh repo create ai-awards-judge --public --source=. --remote=origin --push

Documentation

  • docs/methodology.md
  • docs/rubric.md
  • docs/deployment.md
  • docs/hybrid-publishing.md
  • docs/github-publish-checklist.md

Design principles

  • Auditable: scoring policy, prompts, and outputs are separate
  • Reusable: categories and rubric are configurable, not hardcoded
  • Defensible: evidence quality and tie-breaks are explicit
  • Portable: local-first, easy to containerize, easy to extend

Recommended GitHub repo description

Configurable AI-assisted judging system for award entries, shortlist reviews, and category ranking workflows.

Current scoring engine

The run command now supports three modes:

  • heuristic, local keyword-and-evidence scoring
  • llm, model-backed scoring with structured JSON output via an OpenAI-compatible API
  • auto, try LLM scoring first and fall back to heuristics if credentials or the API are unavailable

Every run writes:

  • judged_entries.json
  • judged_entries.csv
  • judging_report.md
  • run_manifest.json

Each judged entry includes structured review metadata such as:

  • concise summary
  • cited evidence list
  • judge id
  • scoring mode

Examples:

ai-awards-judge run ./examples/sample_input --output-dir ./outputs
ai-awards-judge run ./examples/sample_input --output-dir ./outputs --scoring-mode llm

LLM scoring setup

LLM scoring is optional.

You do not need an OpenAI key to use this project in heuristic mode.

If you want model-backed scoring, you bring your own API key in your own local environment or deployment. The repo does not include, ship with, or expect the maintainer's personal key.

In practice, that means:

  • heuristic mode works with no API key
  • auto tries LLM scoring if you provide a key, then falls back to heuristics if you do not
  • llm mode is for users who explicitly want to connect their own provider key
  • your key stays in your shell, .env, secret manager, or hosting platform settings, not in the repository

Set these before --scoring-mode llm or auto:

export OPENAI_API_KEY=...
# optional overrides
export AI_AWARDS_JUDGE_MODEL=gpt-4o-mini
export AI_AWARDS_JUDGE_BASE_URL=https://api.openai.com/v1
export AI_AWARDS_JUDGE_ID=judge-anca

If you do not want to use any external model API at all, just run:

ai-awards-judge run ./examples/sample_input --output-dir ./outputs --scoring-mode heuristic

Multi-judge reconciliation

Combine multiple judge JSON outputs into one consensus file:

ai-awards-judge reconcile judge_a/judged_entries.json judge_b/judged_entries.json --output-dir ./outputs/reconciled

This produces:

  • reconciled_entries.json
  • reconciled_entries.csv
  • reconciliation_report.md

Each reconciled entry now includes:

  • averaged consensus scores
  • score range across judges
  • low, medium, or high disagreement level
  • verdict distribution and confidence distribution

CI

A GitHub Actions workflow now runs the test suite on push and pull request.

About

Configurable AI-assisted judging system for award entries, shortlist reviews, and category ranking workflows.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages