monitoring-error-rates
Monitor and analyze application error rates to improve reliability. Use when tracking errors in applications including HTTP errors, exceptions, and database issues. Trigger with phrases like "monitor error rates", "track application errors", or "analyze error patterns".
What this skill does
# Error Rate Monitor
Monitor and analyze application error rates across HTTP endpoints, database queries, external APIs, and background jobs with threshold-based alerting and error budget tracking.
## Overview
This skill automates the process of setting up comprehensive error monitoring and alerting for various components of an application. It helps identify, track, and analyze different types of errors, enabling proactive identification and resolution of issues before they impact users.
## How It Works
1. **Analyze Error Sources**: Identifies potential error sources within the application architecture, including HTTP endpoints, database queries, external APIs, background jobs, and client-side code.
2. **Define Monitoring Criteria**: Establishes specific error types and thresholds for each source, such as HTTP status codes (4xx, 5xx), exception types, query timeouts, and API response failures.
3. **Configure Alerting**: Sets up alerts to trigger when error rates exceed defined thresholds, notifying relevant teams or individuals for investigation and remediation.
## When to Use This Skill
This skill activates when you need to:
- Set up error monitoring for a new application.
- Analyze existing error rates and identify areas for improvement.
- Configure alerts to be notified of critical errors in real-time.
- Establish error budgets and track progress towards reliability goals.
## Examples
### Example 1: Setting up Error Monitoring for a Web Application
User request: "Monitor errors in my web application, especially 500 errors and database connection issues."
The skill will:
1. Analyze the web application's architecture to identify potential error sources (e.g., HTTP endpoints, database connections).
2. Configure monitoring for 500 errors and database connection failures, setting appropriate thresholds and alerts.
### Example 2: Analyzing Error Rates in a Background Job Processor
User request: "Analyze error rates for my background job processor. I'm seeing a lot of failed jobs."
The skill will:
1. Focus on the background job processor and identify the types of errors occurring (e.g., task failures, timeouts, resource exhaustion).
2. Analyze the frequency and patterns of these errors to identify potential root causes.
## Best Practices
- **Granularity**: Monitor errors at a granular level to identify specific problem areas.
- **Thresholding**: Set appropriate alert thresholds to avoid alert fatigue and focus on critical issues.
- **Context**: Include relevant context in error messages and alerts to facilitate troubleshooting.
## Integration
This skill can be integrated with other monitoring and alerting tools, such as Prometheus, Grafana, and PagerDuty, to provide a comprehensive view of application health and performance. It can also be used in conjunction with incident management tools to streamline incident response workflows.
## Prerequisites
- Access to application logs and metrics
- Monitoring infrastructure (Prometheus, Grafana, or similar)
- Read permissions for log files in ${CLAUDE_SKILL_DIR}/logs/
- Network access to monitoring endpoints
## Instructions
1. Identify error sources by analyzing application architecture
2. Define error types and monitoring thresholds
3. Configure alerting rules with appropriate severity levels
4. Set up dashboards for error rate visualization
5. Establish notification channels for critical errors
6. Document error baselines and SLO targets
## Output
- Error rate metrics and trends
- Alert configurations for critical thresholds
- Dashboard definitions for error monitoring
- Reports on error patterns and root causes
- Recommendations for error reduction strategies
## Error Handling
If monitoring setup fails:
- Verify log file permissions and paths
- Check monitoring service connectivity
- Validate metric export configurations
- Review alert rule syntax
- Ensure notification channels are configured
## Resources
- Monitoring platform documentation (Prometheus, Grafana)
- Application log format specifications
- Error taxonomy and classification guides
- SLO/SLI definition best practices
Related in Data & Analytics
clawarr-suite
IncludedComprehensive management for self-hosted media stacks (Sonarr, Radarr, Lidarr, Readarr, Prowlarr, Bazarr, Overseerr, Plex, Tautulli, SABnzbd, Recyclarr, Unpackerr, Notifiarr, Maintainerr, Kometa, FlareSolverr). Deep library exploration, analytics, dashboard generation, content management, request handling, subtitle management, indexer control, download monitoring, quality profile sync, library cleanup automation, notification routing, collection/overlay management, and media tracker integration (Trakt, Letterboxd, Simkl).
querying-soql
IncludedSOQL query generation, optimization, and analysis with 100-point scoring. Use this skill when the user needs SOQL/SOSL authoring or optimization: natural-language-to-query generation, relationship queries, aggregates, query-plan analysis, and performance or safety improvements for Salesforce queries. TRIGGER when: user writes, optimizes, or debugs SOQL/SOSL queries, touches .soql files, or asks about relationship queries, aggregates, or query performance. DO NOT TRIGGER when: bulk data operations (use handling-sf-data), Apex DML logic (use generating-apex), or report/dashboard queries.
app-store-optimization
IncludedApp Store Optimization (ASO) toolkit for researching keywords, analyzing competitor rankings, generating metadata suggestions, and improving app visibility on Apple App Store and Google Play Store. Use when the user asks about ASO, app store rankings, app metadata, app titles and descriptions, app store listings, app visibility, or mobile app marketing on iOS or Android. Supports keyword research and scoring, competitor keyword analysis, metadata optimization, A/B test planning, launch checklists, and tracking ranking changes.
habit-flow
IncludedAI-powered atomic habit tracker with natural language logging, streak tracking, smart reminders, and coaching. Use for creating habits, logging completions naturally ("I meditated today"), viewing progress, and getting personalized coaching.
app-store-optimization
IncludedApp Store Optimization (ASO) toolkit for researching keywords, analyzing competitor rankings, generating metadata suggestions, and improving app visibility on Apple App Store and Google Play Store. Use when the user asks about ASO, app store rankings, app metadata, app titles and descriptions, app store listings, app visibility, or mobile app marketing on iOS or Android. Supports keyword research and scoring, competitor keyword analysis, metadata optimization, A/B test planning, launch checklists, and tracking ranking changes.
visualizing-data
IncludedBuilds dashboards, reports, and data-driven interfaces requiring charts, graphs, or visual analytics. Provides systematic framework for selecting appropriate visualizations based on data characteristics and analytical purpose. Includes 24+ visualization types organized by purpose (trends, comparisons, distributions, relationships, flows, hierarchies, geospatial), accessibility patterns (WCAG 2.1 AA compliance), colorblind-safe palettes, and performance optimization strategies. Use when creating visualizations, choosing chart types, displaying data graphically, or designing data interfaces.