Al Calderon
(818)429-9902 | al@alcal.work |LinkedIn
(818)429-9902 | al@alcal.work |LinkedIn
Results-driven IT leader with extensive experience in customer support and managed cloud security, performing key Site Reliability Engineering (SRE) functions in high-stakes enterprise environments. Proven ability to enhance service reliability, lead cross-functional teams, and drive operational efficiency through strategic execution and automation.
Skilled in leveraging data-driven insights, KPIs, and real-time monitoring solutions to optimize performance and minimize downtime. Deep knowledge of ITIL Incident, Problem, and Change Management best practices, enabling proactive risk mitigation and streamlined incident response.
Experienced in supporting reliability efforts by collaborating with SRE teams to improve automation frameworks and observability solutions using tools such as Grafana, Power BI, and ServiceNow. Passionate about cross-functional collaboration, continuous improvement, and aligning technology with business goals to deliver impactful results and drive strategic success.
The 'Microscope,' as the team affectionately calls it, is a dashboard I built using Google App Script and Google Sheets. This script extracts data from various tables within our ticketing system, such as the 'ticket details' and 'assignment' tables, containing ticket details and timestamps, and uses query functions to organize it within a spreadsheet. The primary aim of the Microscope is to enable the team to monitor their progress against pre-established thresholds, other shifts, and historical data. This dashboard directly contributed to a 10% improvement in SLA adherence and a 13% improvement in incident response times.
I utilized Google App Script and Google Sheets to develop a management reporting tool that tracked the types of work (e.g., bug fixes, feature requests) being handled by individual team members. This enabled analysis of potential cherry-picking, identified skill gaps, and exposed an uneven distribution of work, revealing that some team members were handling 15% fewer bug fixes than others.
I built an API to automate the retrieval of CDN response time data from a third-party source. This data was then analyzed to benchmark our performance against the top 10 competitors, leading to the initiation of several projects focused on performance enhancement, including optimizing image compression and implementing caching improvements.
I developed an API to automate the collection of server status information across our global network. Using queries along with data aggregation functions, I calculated the percentage of servers that were in a 'Down For Repair' state per region and per data center. This reporting identified specific data center locations that were impacting overall network performance, leading to a 15% reduction in overall server downtime. This led to the launch of projects specifically aimed at resolving issues in all data center locations that exceeded 20% of their servers in a 'Down For Repair' state.
Developed and conducted comprehensive company-wide training sessions on:
Proactively developed and implemented workflows, processes, and best practices to enhance team performance. Established Key Performance Indicators (KPIs) to track progress, measure success, and identify improvement areas, with results shared with upper management to highlight team achievements. Implemented processes included:
Instrumental in establishing and scaling the Managed Cloud Security team, implementing essential security measures. Led initiatives for proactive log monitoring, rate limiting, and script execution workflows to mitigate DDoS attacks, significantly improving security posture and operational efficiency.
Successfully unified three diverse teams into a cohesive unit managing clients across various verticals. Implemented Key Performance Indicators (KPIs), driving significant improvements in response, escalation, and acknowledgment times while increasing overall case volume.
Al Calderon Resume (pdf)
DownloadWe use cookies to analyze website traffic and optimize your website experience. By accepting our use of cookies, your data will be aggregated with all other user data.