Python Automation Engineer – Multi-Source Scraping & Data Pipeline Build

Remote Full-time
We are looking for a Python automation engineer to build a fully automated data pipeline that gathers AI company data from multiple sources (APIs + web scraping), deduplicates it intelligently, and outputs clean structured data to Airtable or Notion on a weekly schedule. You must have proven experience building production-grade scrapers, not basic scripts. Required: Strong Python (Scrapy, BeautifulSoup, requests) API integrations (REST, authenticated APIs) Experience automating recurring pipelines (cron jobs, scheduled tasks, etc.) Data cleaning, deduplication logic, CSV/JSON handling Ability to write clean, well-structured code Nice to have (not required): Selenium or Playwright Experience with Airtable/Notion API Experience with LLMs for data enrichment Deliverables: Scrapers for multiple AI-related sources (APIs + websites) Deduplication + merging logic across sources Weekly automated update pipeline Output to Airtable/Notion in structured columns Clear documentation so we can maintain it long-term This project should take 2–3 weeks to build, with optional monthly maintenance. If you’ve built multi-source scrapers before, please apply with examples. Apply tot his job
Apply Now

Similar Opportunities

Senior Marketing Data Engineer

Remote

Data Analyst/Engineer - Salesforce, Stripe, Snowflake & Hex Pipelines - Contract to Hire

Remote

Data Engineer- ETL / ELT - Hybrid / Remote (Columbus)

Remote

Principal Consultant (Data Protection SME)

Remote

Cyber Security Engineer (Data Loss Prevention) - Birmingham

Remote

Staff Product Manager, SaaS Data Protection - Salesforce

Remote

Data Security & Compliance Advisor

Remote

Data Privacy Officer

Remote

Data Protection & Classification Specialist

Remote

Technical Product Manager – Data and Infrastructure

Remote

Entry-Level Data Entry Clerk Administrator – Fully Remote Opportunity for Detail-Oriented Professionals to Join blithequark

Remote

Senior Manager, Software Engineering - Remote or Hybrid from MN or DC

Remote

Utilization Management Nurse Consultant - Work at Home

Remote

Python Developer - Remote Contract Job at Fusion Solutions, LLC in Plano

Remote

Experienced Implementation Specialist for Guest Engagement – Remote (USA) – Customer Experience & Project Management

Remote

Petroleum Transportation Logistics Dispatch Manager

Remote

**Experienced Home Depot Customer Support Specialist - Remote Customer Care and Sales Role with Competitive $25/Hour Salary**

Remote

**Experienced Full Stack Data Entry Associate – Remote Work Opportunity at arenaflex**

Remote

Biometrics Senior Consultant- Remote/Delivery Center Role

Remote

Entry-Level Data Entry Clerk – Full-Time Remote Opportunity for Career Growth and Development with blithequark

Remote
← Back to Home