About the position
Capgemini is hiring GCP Architect (GKE Lead) who will play a critical role in designing, implementing, and supporting cloud infrastructure and application deployments for the Merchandising team. This position requires hands-on expertise in Google Cloud Platform (GCP), a strong DevOps foundation, and close collaboration with cross-functional teams. We are open to considering candidates willing to work hybrid from either of the locations - Chicago, Dallas, New York, New Jersey, Nashville or Atlanta.
Responsibilities
• Configure log-based metrics such as error rate, latency, and memory usage, along with retention policies and export settings.
• Monitor health metrics including latency, error rate, CPU/memory utilization, and API response times.
• Track user traffic patterns and analyze system usage.
• Create unified dashboards in Looker Studio or Grafana to present consolidated views of multiple services.
• Integrate billing data, usage metrics, and error dashboards for holistic monitoring.
• Define Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs).
• Provide guidance and decision trees for choosing the appropriate GCP datastore (Cloud SQL, Cloud Spanner, Firestore, Bigtable, BigQuery) based on latency, consistency, schema flexibility, global scalability, and cost.
• Advise on selecting between Kubernetes and Platform-as-a-Service (GKE, Cloud Run, App Engine, Cloud Functions) considering application complexity, control requirements, scalability, and team skill set.
Requirements
• Senior candidate with 10+ year of relevant experience
• Experience developing architecture patterns for -Web application frontends, RESTful APIs, Backend services using Cloud SQL or Cloud Spanner, Stateless app deployments utilizing GKE, Cloud Run, or App Engine, CI/CD integration points, caching layers such as Redis, and queue-based microservices with Pub/Sub or Cloud Tasks
• Set up comprehensive monitoring and logging solutions for Merchandising Apps, leveraging Cloud Monitoring, Logging, Error Reporting, and Trace.
• Implement centralized dashboards using Cloud Monitoring Workspace or Grafana with a BigQuery backend.
• Create log sinks to BigQuery or Pub/Sub for data processing and retention.
• Develop reusable Terraform modules for - Compute resources (GCE, Cloud Run, GKE), IAM policies, Projects, Service Accounts, Cloud SQL and Spanner databases, VPCs, Subnets, Load Balancers, Firewalls, Maintain GitOps workflows for Terraform.
• Design and deploy CI/CD pipelines using Cloud Build and GitHub Actions.
• Design and implement VPC and Shared VPC architectures.
• Configure and manage Load Balancers (Global, Regional, HTTPS, Internal).
• Set up VPC Service Controls to protect against data exfiltration.
Benefits
• Flexible work
• Healthcare including dental, vision, mental health, and well-being programs
• Financial well-being programs such as 401(k) and Employee Share Ownership Plan
• Paid time off and paid holidays
• Paid parental leave
• Family building benefits like adoption assistance, surrogacy, and cryopreservation
• Social well-being benefits like subsidized back-up child/elder care and tutoring
• Mentoring, coaching and learning programs
• Employee Resource Groups
• Disaster Relief