We are seeking a Senior Data Engineer to lead a critical pilot project focused on modernizing our enterprise customer data consolidation from SQL Server to our Databricks-based data lake. This role combines traditional Oracle / PL / SQL expertise with modern PySpark development to support our East region initiative across three billing platforms.
Key Responsibilities
Primary Project (Enterprise Customer Table Pilot)
- Design and implement data consolidation solutions moving from SQL Server to Databricks data lake
- Work with business stakeholders and cross-functional teams to define enterprise customer table specifications
- Determine optimal approach for data processing - either within existing Oracle systems or in the data lake environment
- Collaborate with enterprise data lake team to leverage existing PySpark resources and infrastructure
- Produce modified data inputs for the new enterprise customer table consolidation process
- Ensure data quality and consistency across three different billing platform feeds
Secondary Pilot (Code Conversion)
Convert existing Oracle / PL / SQL code to PySpark for data lake processingEvaluate feasibility of migrating current data warehouse operations to PySparkProvide proof-of-concept for future large-scale migration initiativesTest and validate converted code performance in the data lake environmentTeam Development & Knowledge Transfer
Train and mentor existing PL / SQL team members on PySpark technologiesWork independently with minimal supervision while collaborating effectively with stakeholdersProvide technical leadership and architectural guidance for data processing solutionsDocument best practices and create knowledge base for future PySpark implementationsRequired Technical Skills
Core Requirements
10+ yearsof back-end development experienceExpert-level PL / SQL and Oracledatabase development7+ years of PySparkexperience with data lake implementationsStrong experience withDatabricksplatformProficiency indata modeling and schema designExperience withdata pipeline developmentCustom data warehouse development experiencePreferred Technologies
Delta LakeexperienceApache Airflowfor job scheduling and pipeline orchestrationGCP (Google Cloud Platform)our primary cloud environmentAWS or Azurecloud experience (transferable)Data warehouse and ETL / ELT processesExperience with enterprise-scale data integration projectsTechnical Environment
Current StackOracle-based custom data warehouse, PL / SQL processingTarget StackDatabricks data lake, PySpark, Delta Lake, GCPIntegration PointsThree separate billing systems, SQL Server consolidation layerData VolumeEnterprise-scale customer data across multiple regionsBusiness Context & Domain Knowledge
Support for multiple regions : East, Texas (largest), and PaneraIntegration challenges across three separate billing systems with different data formatsEnterprise-level customer data consolidation and reporting requirementsMigration from legacy SQL Server data warehouse to modern data lake architectureSales reporting and sales count reporting focusExperience with acquired company data integration challenges preferredRequired Competencies
Technical Leadership
Ability to analyze existing systems and recommend architectural improvementsExperience designing scalable data processing solutionsStrong debugging and troubleshooting skills across multiple platformsCode review and quality assurance capabilitiesBusiness Acumen
Understanding of enterprise data warehouse conceptsExperience with customer data management and consolidationKnowledge of sales reporting and business intelligence requirementsFamiliarity with multi-system integration challengesCommunication & Collaboration
Excellent stakeholder management skillsAbility to translate technical concepts to business usersExperience working with cross-functional teamsStrong documentation and knowledge sharing abilitiesWork Arrangement
Hybrid role3 days on-site (Monday, Tuesday, Thursday) in Houston, TX2 days remote workCandidates willing to relocate to Houston will be consideredOffice LocationHouston, Texas (specific location to be provided)