Founding Data Engineer (Core Data Platform)

embedding-vcยท OpenArt
Apply Now โ†—
๐ŸŒ Remote๐Ÿ“ San Francisco Bay AreaFullTime

About this role

๐ŸŽจ About OpenArt

OpenArt is an AI Storytelling and Visual Creation Platform used by millions worldwide. Weโ€™re building the next generation of creative tools powered by cutting-edge AI, enabling anyone to create videos, visuals, characters, and stories with unprecedented speed and imagination. We believe the future of creativity is AI-native, and weโ€™re shaping that future.

๐Ÿš€ Why Join OpenArt

  • Own the entire data foundation of a fast-scaling AI company โ€” from raw data to executive metrics.

  • Build from 0 โ†’ 1 โ€” define the architecture that powers product, finance, and company-wide decision making.

  • High visibility and impact โ€” your work directly informs leadership, product direction, and company strategy.

  • Founder-led, fast-moving culture โ€” high ownership, low process, high trust.

  • AI-native company โ€” help define how data supports AI systems, agents, and long-term intelligence.

  • 7โ€“10X revenue growth over the past 2 years โ€” now scaling the data layer to match.

๐ŸŽฏ About the Role

Weโ€™re looking for a Founding Data Engineer to build and own OpenArtโ€™s core data platform and source of truth, supporting product, finance, and leadership decision-making.

This is a 0 โ†’ 1 role focused on data reliability, modeling, and long-term scalability โ€” not just analytics or dashboarding.

You will define how data is structured, validated, and served across the company โ€” ensuring that key metrics are consistent, trusted, and production-grade.

Youโ€™ll work closely with the Head of Data, engineering, and leadership to establish a robust data foundation that scales with the company.

๐Ÿ›  What Youโ€™ll Do

  • Design and build core data pipelines (e.g., product events, payments, internal systems โ†’ BigQuery)

  • Define and maintain the data warehouse architecture, including schema design, data modeling, and table structure

  • Establish and own the single source of truth (SOT) for product and business metrics

  • Build and maintain core data models (user, subscription, revenue, engagement, etc.)

  • Ensure data consistency across systems (product analytics, billing, internal tools)

  • Lead data reconciliation efforts (e.g., Stripe vs internal systems vs reporting)

  • Implement data quality checks, validation, and monitoring systems

  • Build reliable reporting layers used by leadership and finance (not ad hoc dashboards)

  • Establish data standards and contracts (event naming, schema governance, tracking consistency)

  • Partner with engineering to improve instrumentation and data correctness at source

  • Support downstream teams (analytics, DS) by providing clean, well-documented datasets

  • Continuously improve data reliability, performance, and cost efficiency

๐Ÿง‘โ€๐Ÿ’ป What Weโ€™re Looking For

Core Requirements

  • 5+ years of experience in data engineering or analytics engineering

  • Proven experience building data platforms or warehouses from 0 โ†’ 1

  • Strong SQL and Python โ€” you write clean, production-quality data code

  • Deep expertise in data modeling, ETL/ELT design, and warehouse architecture

  • Experience with modern data stack:

    • BigQuery / Snowflake / Redshift

    • dbt or similar transformation tools

    • Workflow orchestration tools (Airflow / Prefect or similar)

  • Experience working with financial and product data (e.g., payments, subscriptions, usage data)

  • Strong understanding of data reliability, testing, and validation

  • Ability to translate business definitions into durable, consistent data models

  • High ownership โ€” you can define and drive architecture decisions independently

  • Comfortable operating in ambiguous, fast-moving environments

Nice to Have

  • Experience building data systems for finance or revenue reporting

  • Experience with data reconciliation across multiple systems

  • Familiarity with BI tools (Metabase, Looker, etc.)

  • Experience designing semantic layers or metric definitions

  • Prior experience as an early or founding data hire

โš™ Tech Stack Youโ€™ll Work With

BigQuery, dbt (or similar), Airbyte/Fivetran (or custom pipelines), Metabase, Amplitude, Stripe, Python, SQL, GCP

๐Ÿ’ฐ Compensation

  • Competitive base salary and bonus program

  • Equity โ€” meaningful ownership in what you build

  • High autonomy, high growth environment

๐ŸŒ Work Setup

  • Bay Area preferred (hybrid allowed)

  • Visa sponsorship available

  • Weโ€™ll consider remote

Frequently Asked Questions

Is the salary disclosed for the Founding Data Engineer (Core Data Platform) position at embedding-vc?
The salary for this Founding Data Engineer (Core Data Platform) role at embedding-vc is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Is the Founding Data Engineer (Core Data Platform) job at embedding-vc remote?
Yes, this Founding Data Engineer (Core Data Platform) position at embedding-vc is remote, with team members based in San Francisco Bay Area. You can work from home or anywhere in the supported regions.
Is the Founding Data Engineer (Core Data Platform) role at embedding-vc full-time or part-time?
This is listed as a FullTime position. It is posted as a Founding Data Engineer (Core Data Platform) role in the OpenArt department at embedding-vc.
Which team or department does the Founding Data Engineer (Core Data Platform) at embedding-vc belong to?
This Founding Data Engineer (Core Data Platform) position is part of the OpenArt department at embedding-vc. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Founding Data Engineer (Core Data Platform) position at embedding-vc?
Click the "Apply Now" button on this page. You will be redirected to embedding-vc's official application portal hosted on ashby where you can submit your application directly.
When was the Founding Data Engineer (Core Data Platform) job at embedding-vc posted?
This Founding Data Engineer (Core Data Platform) position at embedding-vc was posted on Mar 26, 2026. Apply as soon as possible โ€” early applications are often reviewed first.
Founding Data Engineer (Core Data Platform)
embedding-vc
Apply for this role โ†—

You'll be redirected to embedding-vc's official application page on Ashby ATS.