Automate HN 'Who is Hiring' Posts into Structured Job Data with AI
Automatically extract and structure hundreds of Hacker News job postings monthly, reducing manual data entry by over 90% and accelerating talent sourcing.
Manually sifting through Hacker News 'Who is hiring?' posts for job opportunities and extracting relevant details is a time-intensive and inefficient process. This workflow automates the scraping of these monthly posts, leveraging AI to extract and structure critical job information, thereby significantly streamlining your recruitment and talent sourcing efforts.

Documentation
Automate Hacker News Job Scraping with AI
This workflow provides an automated solution for extracting job postings from Hacker News's monthly "Ask HN: Who is hiring?" threads. It's ideal for recruiters, talent acquisition specialists, and market researchers looking to efficiently track job opportunities and trends.
Key Features
- Automatically scrapes the latest 'Ask HN: Who is hiring?' posts from Hacker News.
- Filters for recent job postings, typically within the last 30 days, to ensure relevancy.
- Cleans raw HTML and unstructured text data from job descriptions for better processing.
- Utilizes advanced AI (GPT-4o-mini via LangChain) to parse unstructured text into a unified, structured JSON format (company, title, location, salary, etc.).
- Seamlessly stores all extracted and structured job data directly into an Airtable base for easy management and analysis.
How It Works
The workflow initiates by querying the Hacker News Algolia API to find the latest 'Ask HN: Who is hiring?' posts. It then retrieves the main post and all its child comments (individual job postings). A custom code node cleans the raw text of each job post by removing HTML tags and other artifacts. This cleaned text is then fed into an AI Chat Model (OpenAI GPT-4o-mini) via a LangChain node, which, guided by a structured output parser, extracts key job details such as company, title, location, salary, and description into a standardized JSON format. Finally, these structured job entries are written to your specified Airtable base, creating a searchable and organized database of Hacker News job leads.