Consultant

Founding Data Engineer

Remote, Work must be performed in or near Stanford, CA
Apply


  • Details

    Job Type:
    Contract / Freelance
    Start Date:
    September 8, 2025
    End Date:
    September 22, 2025
    Application Deadline:
    September 7, 2025
    Education:
    4-Year Degree Required
    Compensation:
    USD $40 - $62.50 / hour
    Cause Areas:
    Climate Change, Urban Areas, Civic Engagement, Community Development, Research & Social Science, Economic Development, Entrepreneurship, Environment & Sustainability

    Description

    Building the first comprehensive database and AI-powered platform that aggregates climate resilience insights from various planning documents across all US states. The platform will enable investment professionals to quickly research climate adaptation efforts for due diligence.

    • Trial: 1-3 weeks, 20 hrs/week starting immediately
    • $40-62.5/hr based on experience
    • Remote or hybrid SF Bay Area
    • Potential founding team role based on performance

    What You'll Build

    Transform our scrappy prototype into a scalable system that processes tens of thousands of municipal documents nationwide, extracts structured project data using AI, and provides fast location-based search for climate adaptation projects.

    Key Responsibilities

    Scale the Architecture

    • Migrate prototype to production cloud infrastructure (AWS/GCP)
    • Build distributed systems for parallel document processing and web scraping
    • Design scalable databases (relational + vector) with cost/performance optimization

    Production Data Pipeline

    • Create robust ETL with error handling, monitoring, and automated retries
    • Implement accurate geocoding across inconsistent municipal address formats
    • Standardize data validation across diverse state/municipal document types
    • Build RESTful APIs and efficient search functionality

    AI-Powered Extraction

    • Scale LLM-based PDF processing while managing API costs
    • Implement semantic search across infrastructure project databases
    • Enhance AI extraction accuracy from unstructured municipal documents

    Technical Requirements

    • Languages: Python, TypeScript, SQL
    • Cloud Platforms: AWS, GCP, or Azure with distributed systems experience
    • Databases: PostgreSQL, vector databases (Pinecone, Supabase)
    • AI/ML: LangChain, vector embeddings, RAG, conversational agents
    • Data Pipeline Tools: Apache Airflow (or similar tools)
    • Web Scraping: Scrapy, Selenium, Google Custom Search API
    • Geocoding: Google Maps API, OpenStreetMap, PostGIS
    • PDF Processing: Text extraction and document parsing libraries

    Nice to have

    Technical Mindset

    • Thinks in systems and can architect for scale from day one
    • Comfortable making technical decisions with limited guidance
    • Experience debugging production issues and optimizing performance
    • Wants to shape engineering culture and hiring as the team grows

    Enjoy working in start-up environment

    • Takes ownership and drives projects to completion
    • Adapts quickly to changing requirements and priorities
    • Direct communication style with both technical and non-technical stakeholders

    Passionate about urban climate adaptation and resilience

    • Background in municipal/government document analysis
    • Familiarity with infrastructure planning or environmental data
    • Interest in climate risk assessment or sustainability tech
    • Previous work with public sector or policy-related datasets

    Building the first comprehensive database and AI-powered platform that aggregates climate resilience insights from various planning documents across all US states. The platform will enable investment professionals to quickly research climate adaptation efforts for due diligence.

    • Trial: 1-3 weeks, 20 hrs/week starting immediately
    • $40-62.5/hr based on experience
    • Remote or hybrid SF Bay Area
    • Potential founding team role based on performance

    What You'll Build

    Transform our scrappy prototype into a scalable system that processes tens of thousands of municipal documents nationwide, extracts structured project data using AI, and provides fast location-based search for climate adaptation projects.

    Key Responsibilities

    Scale the Architecture

    • Migrate prototype to production cloud infrastructure (AWS/GCP)
    • Build distributed systems for parallel document processing and web scraping
    • Design scalable databases (relational + vector) with cost…

    Location

    Remote
    Work must be performed in or near Stanford, CA
    Associated Location
    Stanford, CA, USA

    Apply to This Job

    All fields are required
    Resume must be uploaded in PDF format
    Choose a file or drag it here
    No file chosen (maximum size: 10 MB)
    I acknowledge that use of the Idealist Applicant Tracking System is subject to Idealist's Privacy Policy and Terms of Service.

    Similar Jobs

    Illustration

    Take the Next Step in Your Career

    Match with social-impact hiring managers, explore the latest job opportunities, and get notified when new opportunities meet your search criteria.