• Fix support escalation issues: Optimize on-call rotations and processes - Improve system reliability through the optimization of on-call processes. Add automation and context to alerts – leading to better real-time collaborative response from on-call responders. Additionally, update runbooks, tools and documentation to help prepare on-call teams for future incidents.
• Document “tribal” knowledge - Gain exposure to systems in production, and take part in work with software development, support, IT operations and on-call duties – to build up historical knowledge over time. Instead of silo-ing this knowledge, ensure constant upkeep of documentation and runbooks to ensure that teams get the information they need right when they need it.
• Conducting post-incident reviews - Thorough and transparent post-incident reviews to keep teams honest and ensure that everyone is conducting post-incident reviews, documenting their findings and taking action on their learnings. Take action items for building or optimizing parts of the SDLC or incident lifecycle to bolster reliability of the service.
• Develop automation for mission critical applications using scripts, programs
• Provide customer impact analysis and troubleshoot complex issues using domain knowledge of AT&T Sales & Ordering flows, applications, and downstream interfaces
• Support APIs in K8s environment
• Contribute to design and implementation of new system layers utilizing principles of high-complexity compute environments.
• Provide on-call support for Production customer facing issues
• Work with developers, environment teams to identify necessary resources and remove constraints to increase application availability.
• 16 x 7 Second level trouble shooting of incidents for mission critical high-performance applications
Primary / Mandatory skills:Prod support (online / Digital apps) level 2/3, Azure , Kubernetes, Microservices Architecture, Java knowledge.
• Overall Experience: - 7+ experience performing Production Support for Mission Critical, high performance applications • 4+ years of experience using Docker, Kubernetes and Cloud environments preferably Azure • Strong experience in Unix, Networking and troubleshooting knowledge, Docker, Kubernetes and Cloud environments • Experience in Java Support, Python, Shell Scripts • Knowledge of Java, ReactJS, Spring & Spring Boot framework, microservices & RESTful API architecture
• Solid understanding and experience in Application Performance Monitoring tools like Dynatrace, AppDynamics, Introscope, etc.
• Experience with Log tools like Splunk, Kibana and Grafana (EFK stack experience preferred) • Experience with Site Reliability Engineering preferred
• Experience with automation, AI/ML is preferred • Experience with web-based applications, http, https, SSL/TLS
Secondary / Desired skills:
• Experience in Relational & NoSQL databases like Oracle & Cassandra. Excellent knowledge of SQL: Excellent written and verbal English communication skills to work in a Global team
• understanding of security principles
• Agile, Lean Agile and/or Scaled Agile methodologies • Familiarity with version control systems (Git, Bitbucket) and modern version control for use in continuous deployments
• Experience working in a large scale technically diverse organization • Knowledge about automated CI and CD pipelines using technologies such as Azure DevOps Server, Jenkins, Maven, Ansible, Chef, SonarQube, Puppet, etc
It is the policy of AT&T to provide equal employment opportunity (EEO) to all persons regardless of age, color, national origin, citizenship status, physical or mental disability, race, religion, creed, gender, sex, sexual orientation, gender identity and/or expression, genetic information, marital status, status with regard to public assistance, veteran status, or any other characteristic protected by federal, state or local law. In addition, AT&T will provide reasonable accommodations for qualified individuals with disabilities. AT&T is a fair chance employer and does not initiate a background check until an offer is made.
This one's for the grads and early careerists: Our leading internship and development program recruiters weigh in on how to prepare for and handle your interview.
Learn more
September 19, 2024ArticleCareer AdviceRelated Content
T&T’s India Development Centers (IDC) plays a pivotal role in AT&T’s connectivity strategy, and no one is better suited to speak to that importance more than Santosh Bijur, Vice President of the India Development Center
In our India Development Center (IDC), we’re building a talented technology team. By offering essential resources and the chance to work alongside industry leaders, our goal is to support the next generation of innovators in India.