n8n node for PageCrawl.io - Website monitoring and change detection
npm install @pagecrawl/n8n-nodes-pagecrawlThis is an n8n community node that provides integration with PageCrawl.io for website monitoring and change detection.
You can install this node directly in n8n:
1. Go to Settings > Community Nodes
2. Search for @pagecrawl/n8n-nodes-pagecrawl
3. Click Install
To use this node, you'll need a PageCrawl.io API token.
> Note: API access requires a paid PageCrawl.io plan. Free accounts do not have API access.
1. Sign up or log in to PageCrawl.io
2. Upgrade to a paid plan if you haven't already
3. Go to Settings > API
4. Copy your API token
5. In n8n, create new PageCrawl credentials and paste your token
The main node for interacting with PageCrawl.io API, supporting the following resources:
#### Page Operations
- List All Pages - Get all tracked pages
- Get Page - Get specific page configuration
- Create Page - Create new tracked page with full configuration
- Create Simple Page - Quick page creation with minimal options
- Update Page - Update existing page configuration
- Delete Page - Remove a tracked page
- Run Check Now - Trigger an immediate check for a page
#### Check Operations (History)
- Get History - Retrieve check history for a page
- Get Text Diff HTML - Get text differences as HTML
- Get Text Diff Image - Get text differences as an image
- Get Text Diff Markdown - Get text differences as Markdown
#### Screenshot Operations
- Get Latest Screenshot - Get the most recent full-page screenshot
- Get Latest Screenshot Diff - Visual diff of latest vs previous
- Get Check Screenshot - Screenshot for specific check
- Get Check Screenshot Diff - Visual diff for specific check
#### Webhook Operations
- List Webhooks - Get all configured webhooks
- Create Webhook - Set up a new webhook
- Update Webhook - Modify webhook configuration
- Delete Webhook - Remove a webhook
- Test Webhook - Send test notification
Webhook trigger node that receives real-time notifications when changes are detected.
Features:
- Automatic webhook registration/deregistration
- Configurable payload fields
- Event filtering (changes, errors)
- Simplified output option
```
[PageCrawl Trigger] → [Gmail Send Email]
Configure the trigger to monitor specific pages and send notifications via email when changes are detected.
``
[Schedule Trigger] → [PageCrawl Get Page] → [IF Price Changed] → [Slack Message]
Periodically check product prices and notify via Slack when they change.
``
[Schedule Trigger] → [PageCrawl Get Screenshot] → [Google Drive Upload]
Automatically save website screenshots to Google Drive for compliance or archival.
```
[PageCrawl Trigger] → [MySQL Insert]
Store all detected changes in a database for analysis and reporting.
- URL: The webpage to monitor
- Elements: Specific page elements to track (CSS/XPath selectors)
- Frequency: How often to check (3 minutes to weekly)
- Location: Server location for checks (US, UK, CA, DE)
- Authentication: HTTP Basic auth support
- Actions: Pre-check actions (scroll, click, wait, etc.)
- Rules: Conditional notifications based on content
- Channels: Email, Slack, Discord, Teams, Telegram
- Rules: Text difference, content contains, number comparisons
- Advanced: Headers, proxies, user agent customization
- Default: 60 requests per minute
- Contact support for higher limits
- Rate limit errors return HTTP 429
1. E-commerce Price Monitoring - Track competitor prices and stock levels
2. Content Updates - Monitor news sites, blogs, or documentation
3. Compliance Monitoring - Ensure website content meets requirements
4. SEO Tracking - Monitor meta tags, titles, and content changes
5. Security Monitoring - Detect unauthorized website changes
6. Data Extraction - Regular scraping of structured data
The node includes comprehensive error handling:
- Validation errors (HTTP 422) with detailed messages
- Rate limiting (HTTP 429) with retry guidance
- Authentication errors with clear instructions
- Network errors with appropriate retry logic
- Documentation: PageCrawl.io Docs
- API Reference: API Documentation
- Issues: GitHub Issues
- Support: support@pagecrawl.io
MIT - See LICENSE file for details