Search jobs > Toronto, ON > Data engineer

Data Engineer (Data Bricks)

HCLTech
Toronto, Ontario, Canada
$60 an hour (estimated)
Full-time

Position : Data Engineer (Data Bricks)

Location : Toronto, Ontario / Remote

Skills Requirements : JD

Data Bricks :

  • Strong hands on in Pyspark and Apache Spark
  • Strong hands on in Medallion architecture
  • Experience in Native Spark Migration to Databricks.
  • Experience in Building Data Governance Solutions like Unity Catalog, Azure Purview etc.
  • Highly experienced in Usability Optimization (Auto Compaction, ZOrdering, Vaccuming), Cost Optimization and Performance Optimization.
  • Build Very Strong Orchestration Layer in Databricks / ADF . Workflows.
  • Build CICD for Databricks in Azure Devops.
  • Process near Real time Data thru Auto Loader, DLT Pipelines.
  • Implement Security Layer in Delta Lake.
  • Implement Massive Parallel Processing Layers in Spark SQL and PySpark.
  • Implement Cost effective Infrastructure in Databricks.
  • Experience In extracting logic and from on prem layers, SAP, ADLS into Pyspark / ADLS using ADF / Databricks.

Azure Synapse Analytics / Azure data Factory (ADF) :

  • Hands on Experience in Azure Synapse Analytics, Azure Data Factory and Data Bricks, Azure Storage, Azure Key Vault, SQL Pools CI / CD Pipeline Designing and other Azure services like functions, logic apps
  • Linked services, Various Runtimes, Datasets, Pipelines, Activities
  • Strong Hands on Experience in Various Activites like Control flow logic and conditions (For Each, if, switch, until), Lookup, Stored procedure, scripts, validations, Copy Data, Data flow, Azure functions, Notebooks, SQL Pool Stored procedures and etc
  • Strong hands on exp in deployment of code through out landscape (Dev ->

QA ->

Prod), Git Hub, CI / CD pipelines and etc

SQL Server stored procedures :

strong hands on creating the SQL stored procedures

  • Functions, Stored Procedures, how to call one SP into another, How to process record-by-record
  • Dynamic SQL

Python :

Must have strong background about the Python libraries like PySpark, Pandas, NumPy, pymysql, Oracle, Pyspark libraries

  • Must have strong hands on to get data through APIs
  • Must be able to install libraries and help users to troubleshoot issues
  • Must have knowledge to get the data through stored procedures via Python
  • Should be able to debug the Python code

Sparks :

  • Hands on experioence in Spark Pools, PySpark
  • Should be able to merge data / delta loads through Notebooks
  • Must have strong background about the Python libraries and PySpark
  • 11 days ago
Related jobs
Promoted
HCLTech
Toronto, Ontario
Full-time

Position. Data Engineer (Data Bricks) Location. Toronto, Ontario. Remote Skills Requirements. JD Data.. JD Data Bricks. Strong hands on in Pyspark and Apache Spark. Strong hands on in Medallion architecture..

New!
Tata Consultancy Services
Toronto, Ontario
Full-time

Skills Required. Azure Data Lake. Azure Data Factory (Primarily pipelines, data flows). Azure SQL.. Experience developing complex ETL mapping to extract data and create output files in multiple formats..

Promoted
New!
I-cube Software Llc
Toronto, Ontario
Full-time

The role. In collaboration with the Business Intelligence (BI) Manager, the Data Engineer will undertake.. This includes the design and implementation of the Data Lake, as well as the optimization of data..

Promoted
Procom Labs
Greater Toronto Area, Ontario
Full-time

The role. In collaboration with the Business Intelligence (BI) Manager, the Data Engineer will undertake.. This includes the design and implementation of the Data Lake, as well as the optimization of data..

Promoted
GalaxE.Solutions
Toronto, Ontario
Full-time

What You Will Do Work and develop in the Data Science and Data Engineering area Skills and Experience You Will Need Required 2 5 years of experience with the following..

Promoted
hireVouch
Toronto, Ontario
Full-time

Data Engineer. Reports to. Senior Manager Decision Support and Data Insights. Job Summary. We are.. This role will collaborate closely with our data scientist to enhance development practices, set up unit..