Data Platform DeveloperNewforma, Inc. • Quebec, Capitale-Nationale, CA

No longer accepting applications

Data Platform Developer

Newforma, Inc. • Quebec, Capitale-Nationale, CA

7 days ago

Job type

Full-time

Job description

Overview

We're seeking a talented Data Platform Developer to join our Platform Engineering team and architect the data foundation that will power Newforma's next generation of AI-driven capabilities and analytics. You'll design and implement modern data architectures including medallion / lakehouse patterns, build event-driven data pipelines that process billions of project documents and communications in real-time, and create the analytics infrastructure that enables both business intelligence and AI / ML initiatives. This is a foundational role at an exciting time—as we migrate to AWS and invest heavily in AI, you'll establish the data practices and infrastructure that will serve the company for years to come.

Newforma manages billions of emails, documents, RFIs, submittals, drawings, and project files for thousands of construction projects worldwide. This rich dataset represents an incredible opportunity for AI-powered insights, intelligent automation, and advanced analytics. You'll build the data infrastructure to unlock this potential, creating pipelines that transform raw project data into clean, structured, and AI-ready datasets while also enabling real-time analytics and business intelligence. Working closely with our Director of AI Engineering and Platform Engineering team, you'll establish data architecture patterns that support everything from semantic search and RAG systems to executive dashboards and predictive analytics.

In this role, your responsibilities will include :

Data Architecture & Strategy

Design and implement medallion architecture (bronze, silver, gold layers) or lakehouse patterns on AWS to organize and transform data at scale
Establish data modeling standards, governance practices, and quality frameworks across the organization
Define data retention, archival, and lifecycle management policies for massive volumes of project data
Create reference architectures and best practices for data engineering across teams
Partner with the Director of AI Engineering to design data pipelines optimized for AI / ML workloads including vector embeddings and model training
Work with the Lead Software Architect to ensure data architecture aligns with overall platform strategy
Design data schemas and structures that support both analytical queries and AI applications
Design and implement event-driven data architectures using AWS EventBridge, Kinesis, MSK (Kafka), SNS, and SQS
Build real-time data streaming pipelines that capture, process, and route project events across the platform
Architect event schemas and patterns for domain events (document uploads, email filing, RFI submissions, etc.)
Implement change data capture (CDC) patterns to stream database changes to data lakes and analytics systems
Design event-driven workflows that trigger AI processing, notifications, and downstream system updates
Establish event governance including versioning, documentation, and monitoring
Optimize event processing for low latency and high throughput at scale
Build robust, scalable ETL / ELT pipelines using AWS Glue, Step Functions, Lambda, and EMR
Develop data transformation jobs that cleanse, enrich, and structure unstructured project data
Implement data quality checks, validation rules, and monitoring throughout pipelines
Create reusable pipeline components and frameworks that teams can leverage
Optimize pipeline performance and cost efficiency for processing billions of documents
Handle diverse data formats including emails, PDFs, CAD drawings, images, and structured databases
Implement data lineage tracking and metadata management
Design and build data warehouses and data marts using Amazon Redshift, Athena, or similar technologies
Create dimensional models and star schemas optimized for analytical queries
Build datasets and aggregations that power executive dashboards and operational reports
Implement BI solutions using tools like QuickSight, Tableau, PowerBI, or similar platforms
Partner with product and business teams to understand analytics requirements and deliver insights
Create self-service analytics capabilities that empower teams to explore data independently
Establish KPIs, metrics, and reporting frameworks for product and business analytics

AI / ML Data Infrastructure

Prepare and structure data to support AI initiatives including document classification, semantic search, and intelligent agents

Build pipelines for generating and storing vector embeddings for RAG (Retrieval-Augmented Generation) systems

Create training datasets and feature stores for machine learning models

Implement data versioning and experiment tracking for AI / ML workflows

Design scalable inference pipelines that serve AI models with fresh, contextualized data

Collaborate with the AI Engineering team to optimize data formats and access patterns for LLM applications

Data Operations & Monitoring

Implement comprehensive monitoring, alerting, and observability for data pipelines and systems

Build data quality dashboards and anomaly detection systems

Create operational runbooks and documentation for data platform components

Optimize costs across data storage, processing, and querying

Ensure data security, encryption, and compliance with privacy regulations

Participate in on-call rotation to support production data systems

Collaborate with other platform engineering team members to accomplish tasks

Participate in agile ceremonies including daily stand-ups, sprint planning, and retrospectives

Work closely with development teams and with the software architect to establish good data engineering practices for newly developed features

Requirements

5+ years of experience in data engineering, analytics engineering, or related roles

Strong hands-on experience with AWS data services including S3, Glue, Athena, Redshift, Kinesis, EventBridge, Lambda, and EMR

Proven expertise designing and implementing event-driven architectures using streaming technologies (Kafka / MSK, Kinesis, EventBridge)

Experience building medallion architectures, lakehouse platforms, or similar modern data architectures (bronze / silver / gold patterns, Delta Lake, Iceberg)

Proficiency with SQL and database design including both relational (PostgreSQL, MySQL) and analytical databases (Redshift, Snowflake)

Strong programming skills in Python for data processing, transformation, and automation

Experience with data orchestration tools such as Apache Airflow, AWS Step Functions, or Prefect

Knowledge of data modeling techniques including dimensional modeling, star schemas, and data vault

Experience with analytics and BI tools (QuickSight, Tableau, PowerBI, Looker) and building reports / dashboards

Understanding of data quality, data governance, and master data management principles

Familiarity with infrastructure-as-code (Pulumi, Terraform, CloudFormation) for managing data infrastructure

Strong problem-solving skills and ability to optimize complex data workflows

Excellent communication skills with ability to explain technical concepts to diverse audiences

Team player who collaborates effectively across engineering, product, and business teams

Nice to have qualifications

AWS certifications (AWS Data Analytics - Specialty, AWS Solutions Architect, or similar)

Experience with Azure data services (Data Factory, Synapse, Event Hubs) and Azure-to-AWS data migrations

Knowledge of real-time stream processing frameworks (Apache Spark Streaming, Flink, Kafka Streams)

Experience preparing data for AI / ML applications including vector databases (Pinecone, Weaviate, pgvector)

Familiarity with document processing, OCR, and unstructured data extraction techniques

Experience with data catalog and metadata management tools (AWS Glue Data Catalog, Alation, Collibra)

Knowledge of .NET / C# and integrating data pipelines with .NET applications

Understanding of SaaS multi-tenancy patterns in data architecture

Experience with data privacy and compliance frameworks (GDPR, SOC 2, CCPA)

Background in the AECO industry or project management domain

Familiarity with graph databases (Neptune, Neo4j) for relationship modeling

Experience with serverless data architectures and cost optimization strategies

Knowledge of dbt (data build tool) or similar transformation frameworks

#J-18808-Ljbffr

Create a job alert for this search

Data Platform Developer • Quebec, Capitale-Nationale, CA

Similar jobs

Enterprise Data Platform Engineer & Customer Advisor — Equity

Collibra • Quebec

Full-time

A leading data management firm is seeking a Customer Engineer to enhance customer relationships and ensure product adoption. This role involves mapping solutions for clients, conducting product demo...Show more

Last updated: 16 days ago • Promoted

Senior Java / Scala Developer for Data Quality Platform

ALLTECH CONSULTING SVC INC • Quebec

Full-time

An innovative consulting firm is seeking an experienced Java / Scala developer to join their Institutional Technology team. In this role, you'll contribute to the design and development of a cutting-e...Show more

Last updated: 16 days ago • Promoted

Senior Data Engineer

Targeted Talent • Québec, QC, Canada

Permanent

We are looking for an experienced.This is a permanent position that is completely remote! Our client is a global enterprise company with a product that you've likely used.Within this role, you&...Show more

Last updated: 30+ days ago • Promoted

Lead / Principal AI Developer

BENTLEY SYSTEMS, INC. • Quebec

Full-time

We’re building a new AI Code Modernization team focused on transforming large-scale, legacy software systems using state-of-the-art AI automation. This role blends deep technical experimentation wit...Show more

Last updated: 11 days ago • Promoted

R Developer

Targeted Talent • Québec, QC, Canada

Permanent

We are looking for an experienced.Our client is a fintech company based out of Vancouver.Years experience working in Data Engineering / Data Science utilizing R (purrr, tidyr, dplyr, tibble, & th...Show more

Last updated: 30+ days ago • Promoted

Data Scientist

freelance.ca • Quebec City, Canada

Full-time

Our client is seeking an experienced Data Scientist to contribute to a cutting-edge data science initiative focused on advanced optimization and predictive modeling. The role supports complex analyt...Show more

Last updated: 18 days ago • Promoted

Ingénieur de données / Data Engineer

Valsoft Corporation • Québec, QC, Canada

Full-time

PMS dentaire qui combine une base de données.Dans ce poste, vous relèverez directement du.IA de pointe pour accélérer le travail d’analyse, de requêtage, de ...Show more

Last updated: 30+ days ago • Promoted

Lead / Principal AI Developer

Bentley Systems • Quebec

Full-time

Last updated: 12 days ago • Promoted

Contract T4 || Oracle Integration Cloud (OIC) - lévis

Ampstek • lévis, qc, ca

Full-time

Role : Oracle Integration Cloud (OIC).Good knowledge in with Oracle Integration Cloud (OIC).Oracle SaaS modules and their integration touchpoints. Exposure to REST / SOAP web services, XML, JSON and o...Show more

Last updated: 1 day ago • Promoted

Tech Lead Data Sciences

Intact Financial • Quebec

Full-time +1

Our employees are at the heart of everything we do.Together, we help people, businesses, and society prosper in good times and be resilient in bad times. Our employee promise represents Intact’s com...Show more

Last updated: 16 days ago • Promoted

SharePoint Developer - québec city

Systematix • québec city, qc, ca

Full-time

We are Systematix and we are looking for a.The ideal candidate must already possess a security clearance at the Secret level (Level II) or higher. The Developer will work within SharePoint online an...Show more

Last updated: 12 hours ago • Promoted • New!

2 - Senior Data Engineer

Targeted Talent • Québec, QC, Canada

Permanent

We are looking for an experienced.This is a permanent position that is remote to start with later relocation to.Our client is a global enterprise company with a product that you've likely used....Show more

Last updated: 30+ days ago • Promoted

Technical Data Architect, Platform & Data Products

Kinaxis • Quebec

Full-time

Elevate your career journey by embracing a new challenge with Kinaxis.We are experts in tech, but it’s really our people who give us passion to always seek ways to do things better.As such, we’re s...Show more

Last updated: 16 days ago • Promoted

Quadient Inspire Developer

Aevis Technologies • Quebec

Full-time

We are looking for an experienced Quadient Inspire Developer to join a dynamic team working on Customer Communications Management (CCM) solutions. The role focuses on designing and delivering high-q...Show more

Last updated: 6 days ago • Promoted

Senior React Native Developer (Remote)

Rivalry • Québec, QC, Canada

Remote

Full-time

Working closely with : Engineering and Product.We are looking to add a highly skilled and experienced React Native developer to our team. You will help build and launch our new mobile app on Android ...Show more

Last updated: 30+ days ago • Promoted

Principal DevOps Developer - AEC Data

PowerToFly • Quebec

Full-time

Développeur DevOps principal – Données AEC.Autodesk recherche un Développeur DevOps principal hautement motivé et expérimenté pour rejoindre l'équipe chargée des données d'architecture, d'ingénieri...Show more

Last updated: 1 day ago • Promoted

Director Data Development

BioTalent Canada • Quebec

Full-time +1

For 75 years, Charles River employees have worked together to assist in the discovery, development and safe manufacture of new drug therapies. When you join our family, you will have a significant i...Show more

Last updated: 12 days ago • Promoted

DataStage Developer

Alithya Group • Quebec

Full-time

Are you a DataStage Developer looking for your next meaningful mission? Join us to contribute to large‑scale initiatives that truly transform data into business value. We are seeking passionate spec...Show more

Last updated: 16 days ago • Promoted