Site Reliability Engineer
Description
Reviewing hardware, software infrastructure and application functionality for identifying and optimizing performance bottlenecks.
Developing and maintaining application services and driving incident management.
Designing and implementing comprehensive monitoring for applications, integrations and anomalies
Involved in implementation and rollout of high performance, large scale security platforms
Onboard, maintain expansive data pipelines for various security platforms
Involved in analysis and troubleshooting of security detections to minimize false positives and improve detection
Working closely with Cross functional IT organization, Business group, Apple's production support team, application engineers, systems engineers, database administrators and QA team to effectively ensure implementation and reliability of Platforms/Applications.
Minimum Qualifications
10+ years of experience in software engineering.
Hands on experience in at least one of the object oriented language. Preferably Java/JEE
Hands on experience with automation tools (e.g., Ansible, Terraform)
Strong programming and scripting fundamentals (Python/Bash/LUA)
Strong relational and Non-relation Database fundamentals with hands on PL/SQL experience.
Preferred Qualifications
Strong analytical skills
Java and JVM technologies runtime configurations and troubleshooting is a plus
Familiarity with Modern web services architectures, cloud platforms such as AWS, GCP, Azure and distributed storage systems (ScaleIO, Amazon S3)
Experience with monitoring and logging tools (e.g., Prometheus,Splunk, Grafana, Cloudwatch) is a plus
Understanding of CI/CD, Release Engineering and DevOps.
Good understanding of various machine language algorithms and patterns is desired
Cryptographic algorithms
In-depth experience in writing, understanding and reverse engineering regular expressions to detect patterns
Strong understanding of how TLS, mTLS and various industry standards for secure communications over the wire
Skilled in researching and understanding vulnerabilities and threats posted in open forum and translate them into system design/implementation to detect and prevent them