Automate content creation, translation, and publishing with n8n, Firecrawl, OpenAI, and Ghost CMS. This tutorial will guide you through every step
System Architecture
Firecrawl: Crawls and extracts content from source blogs
OpenAI API: Translates the extracted content
Ghost CMS: Destination for translated content
n8n: Orchestrates the entire workflow
Prerequisites
n8n instance (self-hosted or cloud)
OpenAI API key
Ghost CMS admin access
Firecrawl API key
Setting Up Firecrawl
First, install and configure Firecrawl:
from firecrawl import FirecrawlApp
app = FirecrawlApp(api_key="YOUR_FIRECRAWL_API_KEY")
# Configure crawling parameters
crawl_params = {
'limit': 300,
'scrapeOptions': {'formats': ['markdown']}
}
n8n Workflow Setup
Create a New Workflow in n8n
Open your n8n instance
Click "Create New Workflow"
Name it "Blog Translation Automation"
Add Webhook Trigger Node
Add a webhook node as the trigger
Configure it to accept POST requests
Copy the webhook URL for later use
Add Firecrawl Node
Use HTTP Request node to call Firecrawl API
Configure the crawling parameters
Add OpenAI Translation Node
Configure OpenAI node with your API key
Set up translation parameters:
{
"model": "gpt-3.5-turbo",
"messages": [
{
"role": "system",
"content": "Translate the following content to [target language]"
},
{
"role": "user",
"content": "{{$node['Firecrawl'].json.markdown}}"
}
]
}
Add Ghost CMS Node
Configure Ghost credentials
Set up post creation parameters
Workflow Configuration
Create the following node sequence:
Webhook → Triggers the workflow
Firecrawl HTTP Request → Crawls the source blog
Function → Processes crawled content
OpenAI → Translates content
Ghost → Creates new post
Ghost CMS Integration
Configure the Ghost node with these settings:
{
"operation": "create",
"title": "{{$node['OpenAI'].json.title}}",
"content": "{{$node['OpenAI'].json.translated_content}}",
"status": "draft",
"tags": ["translated", "automated"]
}
Automation Schedule
Set Up Timing
Add Schedule node to trigger workflow automatically
Configure desired frequency (e.g., daily, weekly)
Error Handling
Add Error Trigger nodes
Configure notification settings for failed executions
Testing and Deployment
Test the Workflow
Use test data to verify each node
Check translation quality
Verify Ghost post creation
Monitor Performance
Set up workflow analytics
Track successful executions
Monitor error rates
Security Considerations
Store API keys securely in n8n credentials
Use environment variables for sensitive data
Implement rate limiting for API calls
Best Practices
Content Processing
Clean HTML before translation
Preserve formatting
Maintain metadata
Translation Quality
Review translated content periodically
Adjust OpenAI prompts as needed
Keep track of common translation issues
Resource Management
Implement caching for crawled content
Set up proper error handling
Monitor API usage limits
This automation will continuously crawl your specified blog, translate new content, and publish it to your Ghost CMS instance. Remember to regularly monitor the workflow and adjust parameters as needed for optimal performance.