pp-apify
Every Apify platform feature, plus a local SQLite store, cross-Actor search, novelty diffing, cost-aware runs, and... Trigger phrases: `scrape twitter for AI mentions`, `run apify actor`, `what's new in my reddit scraper`, `build a newsletter from this week's scrapes`, `how much did my apify scrapes cost this month`, `use apify`, `run apify`.
What this skill does
<!-- GENERATED FILE — DO NOT EDIT.
This file is a verbatim mirror of library/developer-tools/apify/SKILL.md,
regenerated post-merge by tools/generate-skills/. Hand-edits here are
silently overwritten on the next regen. Edit the library/ source instead.
See the repository agent guide, section "Generated artifacts: registry.json, cli-skills/". -->
# Apify — Printing Press CLI
## Prerequisites: Install the CLI
This skill drives the `apify-pp-cli` binary. **You must verify the CLI is installed before invoking any command from this skill.** If it is missing, install it first:
1. Install via the Printing Press installer. It defaults binaries to `$HOME/.local/bin` on macOS/Linux and `%LOCALAPPDATA%\Programs\PrintingPress\bin` on Windows:
```bash
npx -y @mvanhorn/printing-press-library install apify --cli-only
```
2. Verify: `apify-pp-cli --version`
3. Ensure the reported install directory is on `$PATH` for the agent/runtime that will invoke this skill.
If the `npx` install fails (no Node, offline, etc.), fall back to a direct Go install (requires Go 1.26.3 or newer):
```bash
go install github.com/mvanhorn/printing-press-library/library/developer-tools/apify/cmd/apify-pp-cli@latest
```
If `--version` reports "command not found" after install, the runtime cannot see the binary directory on `$PATH`. Do not proceed with skill commands until verification succeeds.
## When to Use This CLI
Reach for apify-pp-cli when you need to orchestrate multiple Apify Actors as part of a recurring research, scraping, or newsletter pipeline. It's the right tool when you want per-run cost visibility, novel-only output, cross-Actor search, or schedule-as-code. For Actor development (writing your own scrapers), use the official `apify-cli`; this CLI is for platform operators.
## Unique Capabilities
These capabilities aren't available in any other tool for this API.
### Local state that compounds
- **`run`** — Run an Actor and emit only items not seen in prior runs of that Actor.
_Reach for this when you need 'what's new since last run' across recurring scrapes for a newsletter or alert workflow._
```bash
apify-pp run trudax/reddit-scraper --input @sub-list.json --only-new --format markdown
```
- **`search items`** — Full-text search across every cached dataset from every Actor, normalized into a common schema.
_Use this when you want 'what did anyone say about X this week' without juggling 8 dataset IDs._
```bash
apify-pp search "model context protocol" --since 7d --actors twitter,reddit,hn --json --select url,title,source_actor
```
- **`digest --offline`** — Run queries and template renders against the local SQLite copy of past datasets without hitting the API.
_Use this while tuning templates or dedupe heuristics so iteration is free._
```bash
apify-pp digest --offline --topic AI --since 7d --template draft.tmpl
```
### Cost discipline
- **`cost report`** — Per-run USD cost report joining cached run metadata with the Apify monthly usage endpoint.
_Reach for this whenever the user asks 'why is my Apify bill so high' or 'which Actor costs most per useful item.'_
```bash
apify-pp cost report --since 30d --group-by actor,schedule --json
```
- **`run --max-cost`** — Pre-flight cost projection from local p50/p90 of past runs for the same Actor; aborts run if projection exceeds budget.
_Use this on any long-tail or untrusted Actor to fail-closed on cost rather than learning about it on the invoice._
```bash
apify-pp run apidojo/twitter-scraper-lite --input @q.json --max-cost 0.50 --max-cu 100
```
- **`ab run`** — Run the same input through two competing Actors, normalize via unified schema, report cost-per-novel-item and overlap percentage.
_Reach for this whenever the user wonders 'is the cheaper Actor good enough' before committing to a recurring schedule._
```bash
apify-pp ab run apidojo/tweet-scraper kaitoeasyapi/twitter-x-data-tweet-scraper-pay-per-result-cheapest --input shared.json --judge novelty --json
```
### Newsletter workflows
- **`digest`** — Render a markdown digest from local store: dedupe by URL + title-similarity, rank by engagement+recency+novelty, fill a Go template.
_Use this as the last step of any newsletter / weekly report workflow; output is ready to paste into Beehiiv or pipe to a writer agent._
```bash
apify-pp digest --topic "AI dev tools" --since 24h --template weekly.tmpl
```
- **`workflow run`** — Run a YAML-declared workflow that chains run → normalize → novelty → digest → publish across multiple Actors.
_Use this when the user wants a recurring 'fire one command, get the newsletter' pipeline rather than orchestrating Actor calls by hand._
```bash
apify-pp workflow run ./weekly-newsletter.yaml --json
```
### GitOps
- **`schedules apply`** — Declarative cron + Actor input bundle with terraform-style plan/apply/diff against the live Apify schedule API.
_Reach for this when the user manages 3+ schedules and wants them version-controlled._
```bash
apify-pp schedule apply ./schedules.yaml --dry-run && apify-pp schedule diff
```
### Repeatability
- **`preset save`** — Capture known-good Actor input JSON from a prior run and replay with overrides.
_Use this to stop hand-rebuilding input JSON every time you re-run a recurring scrape._
```bash
apify-pp preset save twitter weekly-ai --from-run abc123 && apify-pp call twitter --preset weekly-ai --override [email protected]
```
## Command Reference
**actor-builds** — The API endpoints described in this section enable you to manage, and delete Apify Actor builds.
Note that if any returned build object contains usage in dollars, your effective
unit pricing at the time of query has been used for computation of this dollar equivalent, and hence it should be
used only for informative purposes.
You can learn more about platform usage in the [documentation](https://docs.apify.com/platform/actors/running/usage-and-resources#usage).
- `apify-pp-cli actor-builds delete` — Delete the build. The build that is the current default build for the Actor cannot be deleted. Only users with build...
- `apify-pp-cli actor-builds get` — Gets a list of all builds for a user. The response is a JSON array of objects, where each object contains basic...
- `apify-pp-cli actor-builds get-actorbuilds` — Gets an object that contains all the details about a specific build of an Actor. By passing the optional...
**actor-runs** — The API endpoints described in this section enable you to manage, and delete Apify Actor runs.
If any returned run object contains usage in dollars, your effective unit pricing at the time of query
has been used for computation of this dollar equivalent, and hence it should be used only for informative purposes.
For completed runs, aggregated fields such as `stats` or dollar usage totals are eventually consistent and update within a few seconds. For values that must match finalized totals, wait about 10 seconds after the run completed, then fetch the run again.
You can learn more about platform usage in the [documentation](https://docs.apify.com/platform/actors/running/usage-and-resources#usage).
- `apify-pp-cli actor-runs delete` — Delete the run. Only finished runs can be deleted. Only the person or organization that initiated the run can delete it.
- `apify-pp-cli actor-runs get` — Gets a list of all runs for a user. The response is a list of objects, where each object contains basic information...
- `apify-pp-cli actor-runs get-actorruns` — This is not a single endpoint, but an entire group of endpoints that lets you retrieve the run or any of its default...
- `apify-pp-cli actor-runs put` — This endpoint can be used to update both the run's status message and to configure its general resource access...
**actor-tasks** — The API endpoints described in this section enable you to create, manage, delete, and run Apify Actor tasks.
For more information, see the Related in Writing & Docs
jax-development
IncludedUse this skill when the user is writing, debugging, profiling, refactoring, reviewing, benchmarking, parallelising, exporting, or explaining JAX code, or when they mention JAX, jax.numpy, jit, grad, value_and_grad, vmap, scan, lax, random keys, pytrees, jax.Array, sharding, Mesh, PartitionSpec, NamedSharding, pmap, shard_map, Pallas, XLA, StableHLO, checkify, profiler, or the JAX repo. It helps turn NumPy or PyTorch-style code into pure functional JAX, fix tracer/control-flow/shape/PRNG bugs, remove recompiles and host-device syncs, choose transforms and sharding strategies, inspect jaxpr/lowering/IR, and benchmark compiled code correctly.
nature-article-writer
IncludedDrafts, rewrites, diagnostically critiques, and style-calibrates primary research manuscripts for Nature and Nature Portfolio journals. Use when the user wants a Nature-style title, summary paragraph or abstract, introduction, results, discussion, methods, figure legends, presubmission enquiry, cover letter, reviewer response, or when a scientific draft sounds generic, jargon-heavy, structurally weak, or AI-ish and needs precise, broad-reader-friendly prose without inventing data, analyses, or references. Best for primary research articles and letters rather than reviews or press releases unless explicitly adapting one.
deckrd
IncludedDocument-driven framework that derives requirements, specifications, implementation plans, and executable tasks from goals through structured AI dialogue. Use when user says "write requirements", "create spec", "plan implementation", "derive tasks", "structure this feature", "break down into tasks", or "document this module". Also use for reverse engineering existing code into docs (/deckrd rev). Do NOT use for direct code writing — use /deckrd-coder after tasks are generated. Do NOT use when the user only wants to run or fix existing code without planning.
clinical-decision-support
IncludedGenerate professional clinical decision support (CDS) documents for pharmaceutical and clinical research settings, including patient cohort analyses (biomarker-stratified with outcomes) and treatment recommendation reports (evidence-based guidelines with decision algorithms). Supports GRADE evidence grading, statistical analysis (hazard ratios, survival curves, waterfall plots), biomarker integration, and regulatory compliance. Outputs publication-ready LaTeX/PDF format optimized for drug development, clinical research, and evidence synthesis.
handling-sf-data
IncludedSalesforce data operations with 130-point scoring. Use this skill to create, update, delete, bulk import/export, generate test data, and clean up org records using sf CLI and anonymous Apex. TRIGGER when: user creates test data, performs bulk import/export, uses sf data CLI commands, needs data factory patterns for Apex tests, or needs to seed/clean records in a Salesforce org. DO NOT TRIGGER when: SOQL query writing only (use querying-soql), Apex test execution (use running-apex-tests), or metadata deployment (use deploying-metadata).
accelint-ac-to-playwright
IncludedConvert and validate acceptance criteria for Playwright test automation. Use when user asks to (1) review/evaluate/check if AC are ready for automation, (2) assess if AC can be converted as-is, (3) validate AC quality for Playwright, (4) turn AC into tests, (5) generate tests from acceptance criteria, (6) convert .md bullets or .feature Gherkin files to Playwright specs, (7) create test automation from requirements. Handles both bullet-style markdown and Gherkin syntax with JSON test plan generation and validation.