📊 Agentic Workflow Lock File Statistics - 2025-11-21 #4446
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it was created by an agentic workflow more than 1 week ago. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This analysis examined all 89
.lock.ymlfiles in the.github/workflows/directory to identify structural patterns, usage trends, and characteristics of agentic workflows in this repository. The analysis reveals a mature ecosystem of AI-powered workflows with standardized patterns, diverse triggers, and consistent safety mechanisms.Key Findings:
Complete Statistical Analysis
Executive Summary
File Size Distribution
Size Statistics:
test-claude-oauth-workflow.lock.yml(80 KB)poem-bot.lock.yml(416 KB)Observation: The overwhelming majority (87.6%) of workflows exceed 100 KB, indicating rich, complex agentic systems with extensive configuration and safety mechanisms.
Trigger Analysis
Trigger Type Distribution
Key Insight: The dominance of
issuesandpull_requesttriggers (93% and 87% respectively) shows that agentic workflows are primarily event-driven and human-initiated, rather than fully autonomous.Common Trigger Combinations
Schedule Patterns
Most Common Cron Schedules:
0 9 * * *0 13 * * 1-50 0,6,12,18 * * *0 9 * * 1-50 10 * * *Insight: Scheduled workflows favor business hours (9 AM - 1 PM UTC) and weekday schedules, suggesting they're designed to support human workflows rather than 24/7 automation.
Safe Outputs Analysis
Safe outputs ensure that AI-generated content is reviewed before being published. This is a critical security feature.
Safe Output Types Distribution
Total workflows with safe outputs: 65 (73% of all workflows)
Discussion Categories Used
When creating discussions, workflows most commonly target:
Note: There's some inconsistency with category naming (e.g., "audits" vs "Audits" vs "audit") that could be standardized.
Workflows with Multiple Safe Outputs
Some workflows use multiple safe output mechanisms for comprehensive reporting. This pattern is less common but indicates sophisticated workflows that need to communicate through multiple channels.
Structural Characteristics
Job Complexity
Common Job Types:
activation(85 workflows, 96%) - Entry point and validationagent(85 workflows, 96%) - Main AI agent executiondetection(76 workflows, 85%) - Firewall and safety checksconclusion(77 workflows, 87%) - Cleanup and summarycreate_discussion,create_issue,add_commentInsight: The near-universal adoption of the activation → agent → detection → conclusion pattern shows strong standardization across the workflow ecosystem.
Step Complexity
Typical Workflow Structure:
Average Lock File Anatomy
Based on statistical analysis, a typical
.lock.ymlfile has:Permission Patterns
Permission Frequency Analysis
Permission Distribution Insights
contentspermission shows a 14:1 read/write ratio, indicating workflows primarily read code but rarely modify itPermission Combinations
Most workflows follow one of these patterns:
contents:read,issues:read,pull-requests:readissues:write,pull-requests:writediscussions:writecontents:writefor automated fixesEngine Distribution
The repository uses three primary AI engines to power workflows:
gh-aw-copilot-${{ github.workflow }}gh-aw-claude-${{ github.workflow }}gh-aw-codex-${{ github.workflow }}Note: Some workflows may use multiple engines, so total > 100%
Engine Selection Patterns:
Timeout Patterns
Workflow jobs use timeouts to prevent runaway executions:
Average Timeout: ~12 minutes across all jobs
Insight: The clustering around 5-10-20 minute timeouts shows deliberate tiering of job complexity, with most jobs completing quickly (10 min) but allowances for complex AI tasks (20+ min).
Concurrency Patterns
Concurrency groups prevent multiple workflow runs from interfering with each other:
gh-aw-${{ github.workflow }}gh-aw-copilot-${{ github.workflow }}gh-aw-claude-${{ github.workflow }}gh-aw-codex-${{ github.workflow }}Pattern: Nearly all workflows (96%) use workflow-specific concurrency groups, often combined with engine-specific groups. This prevents:
Tool & GitHub Actions Patterns
Most Used GitHub Actions
Key Observations:
github-scriptMCP Server Usage
While specific MCP servers aren't individually tracked in this analysis, the high frequency of mentions (961) indicates extensive use of Model Context Protocol for structured AI-agent interactions.
Feature Adoption Analysis
Security & Safety Features
Insights:
Workflow Naming Patterns
The repository follows consistent naming conventions:
test-claude-oauth-workflow,test-secret-maskingdaily-news,daily-code-metrics,daily-team-statussmoke-claude,smoke-copilot,smoke-detectorcopilot-pr-nlp-analysis,copilot-session-insightsgrumpy-reviewer,semantic-function-refactor, etc.Pattern Analysis:
Interesting Findings
1. High Standardization Despite Diversity
With 89 workflows serving different purposes, there's remarkable consistency:
This suggests strong governance and templates in workflow creation.
2. Security-First Architecture
The security posture is exceptionally strong for AI-powered automation.
3. Copilot Dominates, But Claude is Strong
The near-even split between Copilot and Claude suggests:
4. Workflows are Interactive, Not Autonomous
This shows agentic workflows are designed as AI assistants for humans, not autonomous agents.
5. Cache Memory is Growing (49% Adoption)
Nearly half of workflows use persistent cache memory, indicating:
6. Artifact-Heavy Architecture
Workflows extensively use GitHub Actions artifacts for:
7. Discussion Categories Need Standardization
The analysis found inconsistent naming:
A style guide for discussion categories would improve organization.
8. Minimal Use of Push Triggers (2%)
Only 2 workflows use push triggers, showing extreme caution about automatic code modifications. This is a strong safety signal.
Recommendations
Based on this analysis, here are recommendations for the workflow ecosystem:
1. Standardize Discussion Category Names
2. Increase Cache Memory Adoption
3. Document Engine Selection Guidelines
4. Sunset Codex (5% usage)
5. Create Workflow Complexity Tiers
Define standard tiers based on analysis:
6. Optimize Timeout Values
Current distribution: 49% use 10 min, 24% use 20 min
7. Expand Smoke Testing
Methodology
Data Collection
.github/workflows/*.lock.ymlfilesAnalysis Approach
ls -land byte countingon:section keywords- name:patterns)Cache Memory
Analysis scripts stored in
/tmp/gh-aw/cache-memory/scripts/for reuse:comprehensive_analysis.sh- Main analysis scriptdetailed_stats.sh- Extended statisticshistory/2025-11-21-analysis.jsonLimitations
Generated by Lockfile Statistics Analysis Agent on 2025-11-21
Beta Was this translation helpful? Give feedback.
All reactions