Software Engineer - Lakeflow PhD Candidates Mountain View, California
Company: Databricks Inc.
Location: Mountain View
Posted on: May 18, 2025
|
|
Job Description:
Databricks is radically simplifying the entire data lifecycle,
from ingestion to generative AI and everything in-between. We're
doing it cross-cloud with a unified platform, serving over 10k
customers, processing exabytes of data/day on 15+ million VMs, and
growing exponentially.The Lakeflow team is looking for recent PhD
graduates. The team includes products like Apache Spark Structured
Streaming, Delta Live Tables (DLT), and Materialized Views. Apache
Spark Structured Streaming is one of the world's most popular
streaming engines. DLT simplifies building and managing reliable
batch and streaming data pipelines on the Databricks Lakehouse
Platform, with features like declarative pipeline development,
automatic data testing, and deep monitoring and recovery
capabilities. DLT also optimizes pipeline execution through logical
and physical optimization techniques, including instance type
selection and autoscaling.Additionally, DLT features Eenzyme, a
catalyst optimization layer designed to accelerate ETL processes
and enable declarative ETL computation by incrementally computing
and materializing intermediate results. Eenzyme maintains
up-to-date materializations of query results stored in Delta tables
using a cost model that selects among various techniques inspired
by traditional materialized view maintenance, delta-to-delta
streaming, and manual ETL patterns.As part of the Lakeflow DLT
team, there are opportunities to innovate in areas such as:
#J-18808-Ljbffr
Keywords: Databricks Inc., Modesto , Software Engineer - Lakeflow PhD Candidates Mountain View, California, IT / Software / Systems , Mountain View, California
Click
here to apply!
|