Michael Vivirito
Lead Site Reliability Engineer · Enterprise Platforms Integration
Contingent Engineer (Managed Service Provider) · Top-tier technology company serving billions of users · Remote (United States) · 2022 - Present
$ cat ~/career/stats
tenure: 4 years
code_changes: 750+
tasks_completed: 500
applications_managed: 7+
team_size_led: 4 engineers
team_role: Lead SRE, Enterprise Platforms
Professional Summary
Results-driven Site Reliability Engineering Lead with extensive experience managing enterprise platform operations at a top-tier technology company serving billions of users. Proven track record of leading cross-functional initiatives spanning application lifecycle management, security compliance, zero-trust / VPN-less architecture, and AI-powered tooling. Skilled at translating complex technical challenges into actionable solutions by bridging engineering teams, stakeholders, and leadership. Currently leading a team of 4 engineers supporting third-party application integration, reliability, and automation across the client's enterprise infrastructure.
Engaged as a contingent engineer through a Managed Service Provider. Specific client name, internal-platform names, and third-party product names redacted out of respect for the client. Specifics happily shared under NDA on request.
Experience
Lead SRE, Enterprise Platforms Integration
Contingent Engineer via Managed Service Provider · Top-tier technology company · Remote · 2022 - Present
Team Leadership & Operations
- Lead a team of 4 SRE engineers responsible for the lifecycle management of third-party enterprise applications across the client's global infrastructure.
- Serve as Tier 2 escalation point for application lifecycle operations, coordinating incident response and resolution across multiple stakeholder groups.
- Manage NORAM office-hours rotation, providing real-time operational support and cross-functional troubleshooting for enterprise platform issues.
Enterprise Analytics: AI Agent Integration & Auth Infrastructure
- Architected and implemented a Thrift-based authentication proxy service for an enterprise data analytics platform, enabling secure integration with the client's internal AI agent platform.
- Delivered end-to-end implementation across 8+ changesets: Thrift IDL design, internal service-routing configuration, handler testing, client migration, and codegen compatibility fixes.
- Created a conversational-AI skillbook for analytics queries, enabling enterprise users to query dashboards through natural language.
- Authored user documentation and organized wiki resources to support team adoption of the new auth architecture.
- Resolved complex cross-tier authentication challenges, including bringing per-controller identity authentication into the async tier, coordinating with production-engineering teams across the company.
Zero-Trust / VPN-Less Architecture: Application Migration
- Led the migration of enterprise applications to the client's VPN-less infrastructure, reducing VPN dependency and improving access reliability for global users.
- Implemented forward-proxy filter configurations, DNS CNAME entries, and backend reverse-proxy service entries for an EHS / sustainability platform's zero-trust integration across production and dev environments.
- Enabled affiliate-user access to zero-trust enterprise endpoints, expanding secure access to the client's extended workforce.
Enterprise Endpoint Backup Platform Management
- Managed the full lifecycle of an enterprise endpoint backup solution across 15+ changesets spanning deployment, vendor rebranding, disaster recovery, and compliance operations.
- Led a major vendor rebranding migration: engineered the client-swap method including post-install scripts, identity-file preservation, incremental rollout (2% → 60%), and cross-platform uninstall automation (Windows and Mac).
- Built region-based deployment automation (EMEA / global) using packaged distribution and feature-flag-controlled rollouts.
- Developed a custom metrics pipeline: wrote a script controller to send organizational metrics to the client's internal time-series platform for real-time monitoring and capacity planning.
- Executed disaster-recovery operations including failover from DR replica to primary and secrets-management cluster migration fixes.
- Integrated with a third-party e-discovery platform via internal service calls for legal-hold compliance.
- Managed ad-hoc legal-matter operations and user-deployment scripting for Linux endpoints.
Legal-Hold Integration: Compliance Infrastructure
- Owned the legal-hold status agent infrastructure, managing rollout progression and cross-platform packaging.
- Migrated package ACLs to the client's internal application-platform services for improved security posture.
- Deprecated legacy controllers and cleaned up logging configurations as part of infrastructure modernization.
- Built and maintained internal package-build pipelines for cross-platform agent distribution.
Manufacturing Execution System
- Executed a major version upgrade for a production manufacturing-execution system.
- Led the full decommission effort: removed configuration-management cookbooks, deleted internal workflow configurations, and cleaned up DNS aliases, demonstrating disciplined infrastructure lifecycle management.
Recruiting Whiteboard Application: Internal Tool Development
- Fixed critical React bugs and XSS vulnerabilities in a real-time collaborative whiteboard application used in technical interviews.
- Managed the production container deployment pipeline for the recruiting whiteboard tool.
Developer Experience & Knowledge Sharing
- Published an internal engineering guide on developing client-platform applications directly in VS Code, enabling engineers across the company to develop locally with shareable preview URLs.
- Created comprehensive developer-setup documentation including yarn path configuration, npmrc proxy fixes, and per-app workflow guides.
- Built data pipelines using the client's internal data-pipeline platform for automated data-processing workflows.
Infrastructure & Cloud Operations
- Managed cloud infrastructure using Terraform, updating deprecated attributes and maintaining configuration as code.
- Configured ACLs on the client's internal key-value store, configuration-management cookbooks, and config-as-code settings across multiple enterprise services.
- Maintained DNS configurations, FQDN aliases, and service routing for enterprise application availability.
Prior Experience
Pre-MSP career, direct-hire roles spanning cloud migration, systems administration, and IT infrastructure across financial services, payments, and telecom.
Network Administrator II: Atlantic Pacific Processing Systems
Fountain Valley, CA · May 2021 - May 2022
- Led three concurrent cloud-migration projects in a rapidly expanding payments organization, consolidating workloads from Azure, Eukhost, and Linode into a primary AWS environment while maintaining 99% uptime.
- Authored the company-wide migration plan and executed workload migration via the AWS Migration Acceleration Program (MAP), saving 25% on newly migrated assets.
- Drove the AWS Control Tower implementation across multi-account production environments, delivered ~30% under budget and 7 days ahead of schedule.
- Built CI/CD pipelines with Atlassian Bamboo and self-service products in AWS Service Catalog to streamline developer workflows.
- Stood up an ECS-hosted internal operations dashboard, reducing tooling costs and improving maintainability.
- Orchestrated site-to-site VPN tunnels with the TSYS global payment processor for production card-processing connectivity.
- Upgraded on-site firewalls to Fortinet FortiGate 100F units and deployed Palo Alto Cortex XDR endpoint protection across workstations and production servers.
- Partnered with Accounting, Compliance, IT, and Risk teams to scope migrations, resolve outstanding issues, and quantify migration timelines based on data size and complexity.
Systems Administrator: AT&T / DirecTV
El Segundo, CA · March 2020 - May 2021
- Supported 50+ developers and testers; managed 600+ servers and workstations including patching, migration, and greenfield environment builds.
- Authored Ansible playbooks and CFEngine policies to configure and maintain workload-management systems, saved ~$12k/year via automated monitoring and cut ticket volume by 16%.
- Resolved 500+ tickets requiring reverse engineering and advanced troubleshooting across Python, PowerShell, vSphere, NetApp, and Grafana.
- Built Python and PowerShell automation tooling that empowered multiple internal teams.
IT Administrator: Centek Capital Group
Beverly Hills, CA · August 2015 - March 2020
- Stood up a Windows Server 2016 domain controller managing 15 desktop workstations, 99.9% uptime sustained for 5 consecutive years.
- Built and maintained a 12-node Debian Linux workstation network; learned Ansible on the job to manage the fleet at scale, reducing tickets 12%.
- Built custom PHP and Java line-of-business applications, pre-approval form generators, a physical-file tracking system, and internal monitoring tooling, recapturing ~$7k/year in time and materials.
- Took over and reverse-engineered a legacy C#.NET marketing application from the original developer; established maintenance practices and shipped several new features.
Key Achievements
Built and Operated an Enterprise Endpoint Backup Platform
374 code changes
What: Owned the full lifecycle of an enterprise endpoint backup solution protecting corporate devices across a global fleet. Built automated employee-transition workflows, international rollout infrastructure, secrets-management cluster with disaster recovery, data pipeline for user-lifecycle automation, and fleet-wide metric collection.
Scale: Fleet of tens of thousands of corporate devices (Mac / Windows) across dozens of countries.
Technologies: PHP / Hack, Python, Chef, Terraform, HashiCorp Vault, AWS EKS, data-pipeline orchestration, distributed graph databases, dynamic configuration, feature flags.
Result: Achieved reliable enterprise backup coverage across the global corporate device fleet with automated onboarding / offboarding, multi-region DR, and compliance-driven legal-hold integration.
Pioneered AI Tool Integration Across Four Internal AI Platforms
95+ code changes
What: Led the integration of an enterprise data analytics platform with four of the client's internal AI / ML agent platforms. Rewrote the tool server from Python to a typed server-side language for platform compatibility. Designed self-service authentication architecture replacing admin-managed credentials.
Result: Delivered working AI tool integration across all four platforms. Navigated a month-long platform outage by rewriting the entire server in a different language to bypass the blocked path. Established reusable patterns for the broader team.
Led Infrastructure Takeover of a Mission-Critical Recruiting Application
38 code changes
What: Completed full infrastructure takeover of a real-time collaborative whiteboard application used in technical interviews. Migrated from legacy VMs to Kubernetes with multi-datacenter high availability.
Scale: ~500 daily active users on mission-critical interview infrastructure.
Result: Deprovisioned 14 legacy VMs, removed 8+ unused load-balancer records, established comprehensive documentation. Reduced incident frequency from ~1 SEV / month through proactive monitoring. Completed one month ahead of schedule.
Built Legal-Hold Compliance Integration System
79 code changes
What: Designed and maintained a compliance system connecting legal-hold processes with enterprise backup infrastructure. Managed third-party e-discovery platform integration, custodian lifecycle syncing, and data-preservation workflows.
Result: Ensured continuous legal compliance for data-preservation requirements with automated custodian management, handling edge cases like employee-leave status and identity transitions.
Enabled Native AI Capabilities on an Enterprise Analytics Platform
What: Led the integration of a cloud AI service with an enterprise analytics platform, allowing users to use natural language to generate visualizations, create calculated fields, and build data flows.
Result: Successfully delivered AI-powered analytics capabilities to beta users with full project closure documentation and architecture diagrams for ongoing operations.
Career Timeline
2022: 78 code changes
Joined as an enterprise infrastructure engineer focused on endpoint backup solutions. Ramped quickly on the corporate device-management stack, building configuration-management cookbooks, API controllers, and package-management automation.
2023: 171 code changes
Expanded scope significantly. Took ownership of the legal-hold compliance integration, built secrets-management disaster-recovery infrastructure, and deepened cloud-operations expertise. Became an active knowledge-sharing contributor across engineering communities.
2024: 233 code changes
Peak productivity year. Broadened portfolio to include data-analytics platform management. Began managing multiple enterprise application environments and incident response. Drove major refactoring efforts to modernize legacy codebases.
2025: 196 code changes
Shifted toward higher-impact project leadership. Led analytics Tier 2 enablement, managed platform upgrades, and supported data-center infrastructure tooling. Took on the recruiting-whiteboard infrastructure takeover project.
2026: 75 code changes (YTD)
Transitioned into AI-focused technical leadership. Pioneered MCP tool integration across four internal AI platforms, completed recruiting-infrastructure takeover ahead of schedule, and established reusable AI-onboarding patterns for the broader team.
Technical Skills
Languages & Frameworks
- PHP / Hack (typed server-side language): Expert
- Chef / configuration management: Expert
- Python: Advanced
- Terraform / Infrastructure as Code: Advanced
- Shell scripting (Bash / Zsh): Advanced
- HCL (Terraform): Advanced
- Ruby (Chef DSL): Advanced
- JavaScript / React: Working
- Thrift IDL: Working
Infrastructure & Systems
- Kubernetes / Helm: Advanced (container CI / CD, multi-DC)
- AWS (EKS, IAM, multi-region): Advanced
- HashiCorp Vault (HA, DR, auto-unseal): Advanced
- Nginx, reverse proxies, DNS: Advanced
- Monitoring & observability: Intermediate
- Data pipelines / ETL: Intermediate
Domains
- Enterprise application lifecycle management
- Endpoint security & backup at scale
- Legal compliance & e-discovery integration
- AI / ML tool integration (MCP: Model Context Protocol)
- Incident response & SEV management
- Zero-trust / VPN-less architecture & secure access
Practices
- Infrastructure as Code
- Cross-platform fleet management
- Feature-flag-driven rollouts
- Documentation-first delivery
- Multi-datacenter HA architecture
Internal-Tooling Experience
The client operates a large internal-platform suite. Comfortable with their equivalents of: configuration-as-code platforms, service mesh / routing layers, container orchestration, time-series metrics & real-time analytics, CI / CD pipelines, feature-flag rollout systems, internal package management, GraphQL / graph services, and internal AI-agent / assistant platforms. Specific product names withheld; available on request.
Categories of Platforms Owned or Operated
Enterprise data analytics · enterprise endpoint backup · manufacturing execution systems · EHS / sustainability platforms · recruiting and collaborative-whiteboard tools · internal application-development platforms.
Architecture & Security
Zero-trust / VPN-less infrastructure · Thrift services · MCP (Model Context Protocol) · AI-agent integration · authentication-proxy design · legal hold & e-discovery · secrets-management migration · ACL management
Leadership & Soft Skills
- Cross-functional Mediation: Proven ability to align engineering teams, PMs, and stakeholders around solutions, even when not the deepest technical expert in the room.
- Problem Decomposition: Skilled at breaking complex, ambiguous problems into actionable work streams and guiding teams to resolution.
- Knowledge Sharing: Proactive documentation author and community contributor (internal engineering publications, wiki authoring, developer-workflow guides).
- Team Building: Experience managing direct reports, running operational rotations, and mentoring engineers across skill levels.
Selected Community & Documentation Contributions
- Authored an internal Tier 2 application-operations wiki resource.
- Authored an internal AI-enablement / MCP analysis portfolio document; drove participation in the company's AI initiatives; proposed AI-assistant integrations for the team's portfolio.
- Published an internal engineering guide on developing client-platform applications directly in VS Code with shareable preview URLs.
- Ran Kubernetes and analytics-platform learning sessions for engineering communities across the company.
Education & Certifications
Education
- Bachelor of Science, Computer Science: Western Governors University, 2022
- Associate of Arts, Computer Science / Math & Physics: West Los Angeles College, 2020
Certifications
Beyond the Day Job
Outside of the client engagement I run a self-built FreeBSD edge router and write about the infrastructure choices behind it. The deeper write-ups live in the homelab tour and the blog. I'm also building OpenWorld, a platform for proposing and collaborating on real-world infrastructure improvements.