
Cloud Observability Engineer -VICE PRESIDENT
- Job Req Id:
- 25892242
- Location(s):
- Irving, Texas, United States
- Job Type:
- Hybrid
- Posted:
- Oct. 22, 2025
Discover your future at Citi
Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you’ll have the opportunity to grow your career, give back to your community and make a real impact.
Job Overview
As a Cloud Observability Engineer, you will be a critical part of our Cloud Technology team, responsible for designing, building, and maintaining the foundational observability platform and underlying infrastructure across our multi-cloud environment. You will empower development, operations, and SRE teams by providing the robust capabilities they need to generate and consume key metrics, logs, and traces. Your expertise will be instrumental in architecting and evolving the systems that enable proactive issue detection, rapid troubleshooting, and continuous improvement of our cloud platform's reliability and developer experience.
Key Responsibilities:
Design, build, and maintain the end-to-end observability platform and infrastructure, covering monitoring, logging, tracing, and alerting capabilities for cloud-native applications and infrastructure.
Select, configure, and optimize core observability tools and technologies (e.g., Prometheus, OpenTelemetry, cloud-native monitoring services like CloudWatch, Google Cloud Monitoring) to form a robust and scalable platform foundation.
Develop and maintain the frameworks, tooling, and automation that enable engineering teams to create, manage, and consume their own dashboards, alerts, and reports, providing real-time visibility into system performance, availability, and resource utilization.
Architect and implement highly scalable, reliable, and cost-effective data ingestion pipelines and storage solutions for metrics, logs, and traces.
Ensure the observability platform itself is highly available, performant, and resilient, including disaster recovery strategies and security best practices for observability data.
Develop and maintain internal applications and tools to provide operational visibility into the observability platform's health and performance, and to orchestrate its deployment, configuration, and ongoing management.
Automate the deployment, configuration, and ongoing lifecycle management of observability tools and infrastructure components using Infrastructure as Code (IaC) principles.
Implement and manage the underlying infrastructure and services for synthetic monitoring and real user monitoring (RUM).
Mentor junior engineers and contribute to the overall technical growth of the team.
Stay up-to-date with emerging observability trends, tools, and technologies.
Qualifications:
Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
6+ years of experience in a dedicated Observability, Monitoring, SRE, or DevOps role with a strong focus on building and managing cloud environments.
Proven expertise with at least one major cloud provider (AWS or GCP preferred).
Deep understanding of monitoring concepts, metrics collection, log aggregation, and distributed tracing.
Extensive experience with architecting and implementing observability platforms and tools (e.g., Prometheus, OpenTelemetry, Fluentbit, OpAMP).
Proficiency in scripting and automation (e.g., Python, Go).
Experience with Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
Strong understanding of containerization technologies (Docker, Kubernetes) and their observability challenges.
Excellent problem-solving skills and the ability to diagnose complex technical issues across distributed systems.
Strong communication and collaboration skills.
Preferred Qualifications:
Experience with Kafka, PubSub, Kinesis or other message queuing systems.
Familiarity with serverless architectures (AWS Lambda, Google Cloud Functions).
Knowledge of security best practices in cloud environments.
Education:
Bachelor’s degree/University degree or equivalent experience
------------------------------------------------------
Job Family Group:
Technology------------------------------------------------------
Job Family:
Systems & Engineering------------------------------------------------------
Time Type:
Full time------------------------------------------------------
Primary Location:
Irving Texas United States------------------------------------------------------
Primary Location Full Time Salary Range:
$125,760.00 - $188,640.00
In addition to salary, Citi’s offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards. Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs. Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays. For additional information regarding Citi employee benefits, please visit citibenefits.com. Available offerings may vary by jurisdiction, job level, and date of hire.
------------------------------------------------------
Most Relevant Skills
Please see the requirements listed above.------------------------------------------------------
Other Relevant Skills
For complementary skills, please see above and/or contact the recruiter.------------------------------------------------------
Anticipated Posting Close Date:
Oct 28, 2025------------------------------------------------------
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.

Global Benefits
Discover the top benefits offered to our global workforce, designed to support your well-being, growth and work-life balance. Explore a few of the highlights that make working with us rewarding.

Explore More Jobs
-
Vice President - Operations Core Project Lead Analyst (Oracle ERP)
- Chennai, Tamil Nadu
-
Third Party Information Security Assessor
- Tampa, Florida, Jacksonville, Florida, Irving, Texas, New Castle, Delaware
-
Technology Product Management Lead Analyst
- London, England
-
Structurer - C11 - SAO PAULO
- São Paulo, São Paulo
-
Early Careers Talent Network
Sign up to receive personalized job matches based on your skills and interests. We'll help you discover opportunities that align with your goals.
-
Career Professionals Talent Network
Sign up to receive tailored job matches based on your skills and experience. Discover opportunities that align with your ambitions.