Staff Data Engineer
staff · version v1-1PTHgX
c4b8c2d8-8d84-4c16-937a-a5a282ef8c81
Competencies
weights sum to 1.00| Name | ID | Weight | Definition |
|---|---|---|---|
| Distributed Systems & Data Platform Architecture | distributed_systems_architecture | 0.30 | Design and own high-throughput data pipelines (10TB+/day scale) with schema evolution, partitioning strategies, and lineage tracking. Make principled decisions on tool selection (Kafka, Flink, Iceberg) and justify cost/latency/reliability tradeoffs for 800+ enterprise customers. |
| SQL Performance & Query Optimization | sql_query_optimization | 0.20 | Read EXPLAIN plans, diagnose inefficient joins and aggregations, and rewrite queries for production workloads. Optimize warehouse costs by identifying and fixing expensive queries across analytics, ML features, and billing pipelines. |
| Technical Leadership & Mentorship | technical_leadership | 0.25 | Guide 4 mid-level engineers through system design decisions; review complex architecture docs and production handoffs. Lift team velocity by establishing data contracts, backfill strategies, and operational patterns that prevent costly incidents. |
| Streaming & Batch Processing Systems | streaming_batch_systems | 0.15 | Hands-on expertise with one or more stream processors (Flink, Spark, Beam) to build fault-tolerant, low-latency ingestion. Handle backfills and schema migrations without downtime in a production warehouse serving live analytics and billing. |
| Data Infrastructure Cost Optimization | cost_optimization | 0.10 | Identify and execute initiatives that reduce six-figure monthly warehouse spend through smarter partitioning, storage formats (Iceberg), and query patterns. Model infrastructure costs and tradeoffs between compute, storage, and latency. |
Scoring Weights
Adjust how much each competency contributes to the final score. Changes apply to all future evaluations for this job spec.
distributed_systems_architecture
30%0.30
sql_query_optimization
20%0.20
technical_leadership
25%0.25
streaming_batch_systems
15%0.15
cost_optimization
10%0.10
Total weight: 1.00 ✓ balanced
Interview questions
AI-generated from the JD · auto-published · edit anytime3 currently · candidate sees this many
Behavioral round (Round 02)
Voice Q&A · pre-generated from the JD · edit anytime4 prompts · Cal walks through each in order during Round 02
Live performance
Aggregated from candidates who took this job spec.
Interviews
1
0 completed
Avg score
—
out of 100
Completion
0%
0 of 1
Avg score per competency
Distributed Systems & Data Platform Architectureweight 0.300.17n=6
SQL Performance & Query Optimizationweight 0.200.17n=6
Must-haves
- required8+ years in data engineering or distributed systems
- requiredProduction experience designing for schema evolution and backfills without downtime
- requiredHands-on expertise with at least one streaming or batch processing framework (Spark, Flink, Beam, or dbt-at-scale)
- requiredDemonstrated track record of mentoring engineers and improving team output