Logo
Autodesk

Principal Software Engineer - Analytics Data

Autodesk, Snowflake, Arizona, United States, 85937

Save Job

**Job Requisition ID #**25WD91419**Position Overview****Responsibilities****Technical Leadership*** Define and drive the technical strategy for Batch Processing within ADP* Design and scale PySpark based distributed processing frameworks integrated with Airflow for orchestration* Champion Apache Iceberg-based data lakehouse adoption (versioning, branching, WAP, indexing)* Introduce performance optimization patterns for data pipelines (e.g., partition pruning, caching, resource tuning)* Establish patterns for metadata-driven, intent-based data processing aligned with AI-assisted pipelines**Architecture & Delivery*** Partner with architects to design multi-tenant, secure, and compliant (SOC2/GDPR/CCPA) batch processing services* Define reference architectures and reusable frameworks for cross-domain processing workloads* Lead technical reviews, solution designs, and architectural forums across ADP* Guide the evolution of Airflow APIs and SDKs to enable scalable pipeline self-service**Mentorship & Collaboration*** Mentor senior engineers, staff engineers, and tech leads within the Batch Processing and adjacent teams* Partner with the Batch Ingestion and Data Security teams to deliver unified ingestion + processing flows**Innovation & Impact*** Drive modernization initiatives to migrate away from legacy tools* Pioneer AI-augmented data engineering practices (e.g., pipeline optimization agents, anomaly detection)* Ensure scalability, cost-efficiency, and reliability for thousands of production pipelines across Autodesk* Influence company-wide data engineering strategy by contributing thought leadership and whitepapers**Minimum Qualifications*** 8+ years of experience in software/data engineering, with at least 3 years in big data platform leadership roles* Expert in distributed data processing (Spark, PySpark, Ray, Flink)* Deep experience with workflow orchestration (Airflow, Dagster, Prefect)* Strong hands-on expertise in Lakehouse technologies (Iceberg, Delta, Hudi) and cloud platforms (AWS/Azure/GCP)* Proven track record in architecting secure, multi-tenant, and compliant data platforms* Skilled in SQL/NoSQL databases, Snowflake, Glue, and modern metadata/catalog tools* Strong problem-solving, communication, and cross-geo collaboration skills* Experience mentoring engineers, building strong technical culture, and influencing at scale**Preferred Qualifications*** Exposure to AI/ML-driven data engineering (optimizers, anomaly detection, auto-scaling)* Experience with data governance, lineage, and observability tools (Atlan, Databand, Collibra, etc.)* Familiarity with streaming + batch hybrid architectures (Kappa/Lambda)#LI-KS2****Learn More******About Autodesk**Welcome to Autodesk! Amazing things are created every day with our software from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.We take great pride in our culture here at Autodesk its at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.**Salary transparency**Salary is one part of Autodesks competitive compensation package. Offers are based on the candidates experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.**Diversity & Belonging** We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here:Please search for open jobs and apply internally (not on this external site).We are seeking a highly experienced **Principal Engineer** to lead the design, development, and evolution of our **Batch Processing platform**, which powers Autodesks Analytics Data Platform (ADP). This role requires deep technical expertise in distributed data systems, large-scale pipeline orchestration, and hands-on leadership in shaping next-generation data platform capabilities. You will partner closely with Engineering Managers, Architects, Product teams, and Partner Engineering to modernize our data lakehouse architecture, deliver highly reliable data pipelines, and establish technical excellence across ingestion, processing, and governance. #J-18808-Ljbffr