Data Engineer
- Job ID
- 64935
- Category
- Ford Credit Services
- Location
- Chennai, India
- Work Type
- Hybrid
We’re seeking an experienced GCP Data Engineer who can build cloud analytics platforms to meet ever-expanding business requirements with speed and quality using lean Agile practices. You will work on analyzing and manipulating large datasets supporting the enterprise by activating data assets to support Enabling Platforms and Analytics in the Google Cloud Platform (GCP). You will be responsible for designing the transformation and modernization on GCP, as well as landing data from source applications to GCP.
Experience with large-scale solutioning and operationalization of data warehouses, data lakes, and decentralized data architecture (such as Data Mesh) on GCP is a must. You will also play a key role in designing and building the foundational data infrastructure required to power advanced AI, Machine Learning (ML), and Generative AI initiatives across the enterprise. We are looking for candidates who have a broad set of technology skills across these areas and who can demonstrate an ability to design the right solutions with the appropriate combination of GCP and third-party technologies.
You will:
- Implement and champion Data Mesh principles, treating data as a product, establishing clear data contracts, and enabling self-serve data platform capabilities across business domains.
- Design, build, and maintain high-performance analytical data models using robust dimensional modeling principles (Star Schema, Snowflake Schema) to support enterprise BI, analytics and AI/ML models.
- Develop exceptional Analytics data products using streaming and batch ingestion patterns in GCP.
- Collaborate with Data Science and AI teams to architect robust MLOps pipelines and integrate Generative AI capabilities (e.g., LLMs, Vector Search) into data workflows.
- Be the Subject Matter Expert in Data Engineering, AI integrations, and GCP services.
- Demonstrate technical knowledge/leadership skills and advocate for technical excellence.
- Work in a collaborative environment including pairing and mobbing with other cross-functional engineers.
- Work on agile teams to build modern data warehouses and deliver data products.
- Work effectively with data engineers, product owners, data stewards, and other technical experts.
- Experience in analyzing complex data, organizing raw data, and integrating massive datasets from multiple data sources to build domain-specific, reusable, and secure "data products" within a Data Mesh framework.
- Expertise in modern data modeling methodologies, specifically dimensional modeling (Star Schema, Snowflake Schema), and optimizing schemas for cloud-native data warehouses like BigQuery.
- Experience working in an implementation team from concept to operations, providing deep technical subject matter expertise for successful deployment. Implement methods for automation of all parts of the data pipeline to minimize labor in development and production.
- Experience working with architects to evaluate and productionalize appropriate GCP tools for data ingestion, integration, presentation, and reporting.
- Experience working with all stakeholders to formulate business problems as technical data requirements, identifying and implementing technical solutions while ensuring key business drivers are captured in collaboration with product management.
- Experience designing and deploying pipelines with automated data lineage, robust data quality frameworks, and data observability. Identify, develop, evaluate, and summarize Proof of Concepts to prove solutions. Test and compare competing solutions and report out a point of view on the best solution.
- Experience with data governance, cataloging, and access control integration (e.g., GCP Dataplex, GCP Data Catalog) in a decentralized environment.
- In-depth understanding of Google Cloud Platform and underlying architectures.
- 6+ years of Data engineering and analytics application development experience.
- Experience working in Google Cloud Platform (GCP) services: Big Query, Dataflow, Dataform, Astronomer, Data Fusion, Dataproc, Cloud Composer/Air Flow, Cloud SQL, Compute Engine, Cloud Functions, Cloud Run, Artifact Registry, GCP APIs, Cloud build and App Engine, and real-time data streaming platforms like Apache Kafka, GCP Pub/Sub and Dataplex.
- 5+ years of SQL development experience (including advanced optimization and analytical queries).
- 2+ years of professional development experience in Java or Python, and Apache Beam.
- 2+ year of designing and building Tekton or similar CI/CD pipelines.
- Extracting, Loading, Transforming (ELT/ETL), cleaning, and validating data.
- Designing pipelines and architecture for data processing.
Additional Experience Preferred:
- Experience building and productionalizing Machine Learning and Generative AI solutions using Vertex AI (including Gemini, Model Garden, Vector Search, and Vertex AI Pipelines), TensorFlow, BigQueryML, and AutoML.
- Proficient in Machine Learning model architecture, data pipeline interaction, and metrics interpretation. This includes designing, deploying, and monitoring end-to-end MLOps pipelines, managing feature stores, and orchestrating pipelines for Generative AI (including Retrieval-Augmented Generation (RAG) and LLM orchestration).
- Experience in building solution architecture, provisioning infrastructure (IaC via Terraform), and securing reliable, compliant, data-centric services and applications in GCP.
- Experience implementing data quality, lineage, and governance policies using Dataplex or other in a self-serve platform environment.
- Experience with development ecosystems such as Git, Jenkins, and CI/CD.
- Advanced experience with Analytics Engineering tools like DBT (Data Build Tool) or Dataform for modeling and transforming data within BigQuery.
- Experience working with Agile and Lean methodologies.
- Team player and attention to detail.
- Performance tuning experience (query optimization, partitioning, clustering, and cost management in BigQuery).
Education Required:
- Bachelor’s degree in computer science or related scientific field.
- IT or related associated topics: data architect, data center, data integrity, data manager, data management, data scientist, data warehousing, SQL, AI and ML.
Additional Education Preferred:
- GCP Professional Data Engineer or Machine Learning Engineer Certified.
- Master’s degree in computer science or related field.
-
Built on one bold idea and the passion to define sustainable transportation for generations to come, Ford is a story about people with a vision that’s still being written.
What We Do -
Ford’s culture fuels the kind of momentum where ideas flow, progress is unstoppable, and our people keep redefining what it means to innovate.
Our People and Culture -
At Ford, your work matters, your life matters and we’re here to back the whole you—from growth to well-being—so you show up ready to realize your full potential.
Your Benefits
Jobs For You.
Explore roles tailored to your interests, based on your preferences and experience.
-
Human Resource Generalist
- Pretoria, South Africa
-
Senior Compliance Officer
- United Kingdom
-
Manager, HR Business Partner
- Dubai, United Arab Emirates
-
Infotainment Software Developer – User Interface (HMI) & APP
- Dearborn, Michigan