DocuTray
Skills

Skills

Drop-in skills for Claude and other agents — give your assistant document-understanding superpowers

@docutray/skills betanpx @docutray/skills installnpm install -g @docutray/skills

docutray-skills

docutray-skills is a curated collection of agent skills that expose the DocuTray API to AI assistants — Claude, Cursor, Cline and any MCP-compatible client. Drop them into your agent and it can convert documents, look up document types, and validate fields without you writing a single tool wiring.

Status: Beta. We add and refine skills as we learn how agents actually use the API in production.

Why skills, not just an SDK?

LLMs work best with focused, well-described tools. A naive "here's the OpenAPI spec" approach overwhelms them. docutray-skills ships:

  • Hand-written prompts that explain when to call each tool.
  • Tight JSON schemas instead of the full OpenAPI surface.
  • Few-shot examples showing typical doc-understanding flows.
  • Error recovery patterns for low-confidence fields and retries.

The result: agents pick the right tool more often, and call it with better arguments.

Install

npx @docutray/skills install

You'll need a DocuTray API key in the environment:

export DOCUTRAY_API_KEY=sk_live_...

Available skills

SkillWhat the agent gets
docutray.convertExtract structured data from any supported doc
docutray.types.listBrowse supported document types
docutray.types.describeRead the schema for a type before calling convert
docutray.validateCheck a field's value against the schema
docutray.webhooks.recentList recent webhook events for debugging

See the skill overview for full install instructions, contents, and usage details.

Quick example

A Claude session with the skill installed:

User:  Pull the totals out of these three invoices and sum them.
       [attaches 3 PDFs]

Claude: I'll convert each one with docutray.convert, then sum the
        net_total fields...

[uses docutray.convert × 3]
[reads the structured response]

Claude: The three invoices total €4,287.50. Here's the breakdown:
        - INV-1042: €1,200.00
        - INV-1043:   €987.50
        - INV-1044: €2,100.00

No glue code. The agent reads the schema, calls the tool, parses the response.

Source

docutray-skills is open source: github.com/docutray/docutray-skills

On this page