Software Developer
Software Developer, IBM Corporation, Austin, TX: Develop metrics as needed to measure characteristics of Cloud Infrastructure Services. Work with cross-service teams on the design and automation of generation and collection of metrics for status of cloud infrastructure services and their operation. Monitor metrics related to performance, availability, and stability of cloud infrastructure services. Assist in the creation of reports of these metrics using diverse data sources in cloud infrastructure operations. Develop systems and infrastructure for continuous and periodic collection and display of these metrics. Develop systems and infrastructure for periodic and on demand testing of cloud infrastructure service availability and performance. Develop dashboarding and alerting systems that consume generated metrics and integrate them with functions of cloud operations. Apply metrics related functionality to cover more areas within Cloud Infrastructure Services. Develop and operationalize data pipelines for AIOps (applying AI and data driven methods to IT operations) solutions. Implement metrics to gauge performance and usefulness of AIOps solutions. Work with Cloud AIOPS team getting involved in process of planning, designing and developing solutions to generate performance and stability metrics across Cloud Infrastructure Services such as availability based on provisioning workflow logs and customer impacting events from real customer data to be able to achieve better visibility of stability metric across all Cloud's infrastructure services and provide performance data to clients using platforms such as Spark, Kafka and zookeeper services. Lead monitoring system developing efforts using Grafana, Slack and Cloud Kubernetes platforms to support Cloud Infrastructure Operation team to achieve better visibility into customer experience through creating smart workload patterns using python, bash and terraform platforms to maintain reliability of the platform and maintain 99.99% availability of Cloud Infrastructure system. Utilize: Python, Automation, SQL, and Event Recognition. Required: Master’s degree or equivalent in Computer Science, Engineering or related (employer will accept a Bachelor's degree plus five (5) years of progressive experience in lieu of a Master’s degree) and one (1) year of experience as a Client Technical Specialist or related. One (1) year of experience must include utilizing Python, Automation, SQL, and Event Recognition. Please send resumes to recruitad@us.ibm.com. Applicants must reference K168 in the subject line.