Micro-Functions

PDF Highlighter

Advanced PDF text highlighter with intelligent comment positioning, multiple output formats, and extensive customization options. Supports both digital and scanned PDFs with OCR.

10 Inputs AUTO Output

Test Function

PDF file to highlight (supports both digital and scanned PDFs)

Example format with all features:

[
  {
    "keyword": "Account Number",
    "comment": "Important - Review this"
  },
  {
    "keyword": "Confidential",
    "comment": "Sensitive information",
    "highlight_color": "red",
    "highlight_style": "underline",
    "comment_fill_color": "yellow",
    "comment_border_color": "orange"
  },
  {
    "keyword": "Total",
    "comment": "Check calculation",
    "highlight_color": "blue",
    "highlight_style": "squiggly",
    "comment_fill_color": "cyan",
    "comment_text_color": "black"
  }
]

Available styles: "rectangle", "underline", "strikethrough", "squiggly"

Available colors: red, blue, green, yellow, orange, purple, pink, cyan, magenta, white, black, gray, brown, lime, navy, teal, maroon, olive, silver, gold

Comment box colors: Use "comment_fill_color" (background), "comment_border_color" (border), and "comment_text_color" (text) to customize each comment box

JSON array of keyword-comment pairs to highlight. Each object supports: - keyword (required): Text to search for - comment (required): Comment/note to add - highlight_color (optional): Color name (red, blue, green, yellow, orange, purple, pink, cyan, magenta, etc.) or RGB array [0.0-1.0, 0.0-1.0, 0.0-1.0] - highlight_style (optional): rectangle, underline, strikethrough, or squiggly - comment_fill_color (optional): Background color of comment box - comment_border_color (optional): Border color of comment box - comment_text_color (optional): Text color in comment box

Output format: 'png' (default, high quality), 'jpg' (smaller size), or 'pdf' (annotated PDF with native highlights). PDF format creates a true PDF with selectable text and annotations, not images.

Default highlight style for all keywords (can be overridden per keyword in keyword_comment_pairs)

Enable case-sensitive search (default: OFF = case-insensitive). When OFF, 'account' will match 'Account', 'ACCOUNT', etc.

Highlight ALL occurrences of each keyword (default: OFF = only first occurrence). Enable to highlight every instance of the keyword throughout the document.

Intelligently position comment boxes to avoid overlaps (default: ON). Uses smart algorithms to find optimal comment placement angles.

Crop output images to only show annotated areas with padding (default: OFF). Only applies to image outputs (png/jpg), not PDF. Useful for creating focused excerpts.

Enable OCR for scanned/image-based PDFs (default: OFF). Automatically converts scanned PDFs to searchable text before highlighting. Required for PDFs that are images.

DEPRECATED: Use output_format='pdf' instead. When enabled with image output_format, creates PDF from generated images.

Results