Incident response
Recognizing common failure modes
Symptom: User can’t connect to desktop Possible causes: VM not running, VDA not registered with Citrix, network connectivity issue, Citrix Cloud issue.Triage decision tree
Escalation procedures
| Level | Who | When |
|---|---|---|
| Level 1: Team Lead / Senior Admin | Handle yourself | Single user issues, VM restarts, minor performance tuning, user account management |
| Level 2: Infrastructure Team | Escalate | Multiple users affected, suspected host hardware failure, network infrastructure involved, capacity planning needed |
| Level 3: MacStadium Support | Escalate | Orka Engine failures, host hardware failures, datacenter network failures, new host provisioning |
| Level 4: Citrix Support | Escalate | Citrix Cloud outage, VDA registration failures across all VMs, HDX protocol issues, Citrix policy problems |
Post-incident review template
Complete this for any incident affecting more than 10 users or lasting more than an hour.Change management
Pre-change checklists
Before any production change, verify:- Change window scheduled and communicated to users
- Backup/snapshot of current image state available
- Rollback plan documented and tested
- Testing completed in a non-production environment
- Required approvals obtained
- Monitoring in place to detect issues
- Team available for the duration of the change
- New golden image tested on at least one VM
- Citrix VDA registration verified
- HDX features tested (clipboard, file transfer, USB)
- Applications tested and functional
- Pilot group identified
- Previous image version retained for rollback
Testing procedures
For new golden images:- Deployment test: Deploy one VM, verify it boots within 3 minutes and gets network connectivity.
- VDA registration test: Check System Preferences → Citrix VDA shows “Registered”; verify VM appears as “Available” in Citrix Cloud Console.
- User connectivity test: Assign a test user, launch desktop from Citrix Workspace, verify connection.
- HDX feature test: Test clipboard, file transfer (if enabled), printing (if enabled), application launching.
- Application functionality test: Launch each business-critical application, perform a basic workflow, check for errors.
- Performance test: Measure login time (target: under 30 seconds), check CPU/memory at idle, check responsiveness during typical tasks.
Rollback plans
Write rollback procedures before starting any change. Example: Image update rollback If a new image causes issues within the first 24 hours:- Stop new deployments immediately.
- Revert affected VMs:
- Verify users can connect to rolled-back VMs.
- Document what went wrong for post-incident review.
- Revert the policy in Citrix Cloud Console: Policies → Select policy → Edit → Restore previous settings.
- Force policy refresh: have users log out and back in, or wait 30 minutes for automatic refresh.
Communication templates
Planned maintenance (send 3-5 business days in advance):Metrics and reporting
Key performance indicators
| KPI | Target | Measurement |
|---|---|---|
| Availability | 99.5% uptime during business hours | % of time VMs are registered and available in Citrix |
| Login time | under 30 seconds | Desktop launch to usable desktop |
| Session latency | under 100ms round-trip | HDX session latency |
| Frame rate | 30 FPS for typical office workloads | HDX session frame rate |
| Capacity utilization | 70–80% at peak | VMs in use / total VMs |
| Capacity headroom | 20–30% spare | Spare VMs / total VMs |
| Support ticket volume | Track trend | VDI-related tickets per month |
| User satisfaction | 4.0/5.0+ | Quarterly survey |
| Cost per user | Track trend | Total infrastructure cost / active users |
User satisfaction tracking
Quarterly survey questions:- Rate your overall satisfaction with the macOS virtual desktop (1–5)
- How often do you experience connectivity issues? (Never / Rarely / Sometimes / Often)
- How would you rate desktop performance for your daily tasks? (Poor / Fair / Good / Excellent)
- What applications or features would improve your experience?
- Any other feedback?
Cost analysis and optimization
Review quarterly:- Right-size VMs: Are all users on high-spec VMs when they only need basic?
- Eliminate unused capacity: VMs deployed but not assigned to users?
- Image efficiency: Unnecessary applications in golden images? Can you consolidate?
- Licensing: Citrix licenses for inactive users? Remove inactive accounts quarterly.
Quarterly business review outline
Present to leadership/stakeholders each quarter:- Service overview: Total users, total VMs, uptime %, support ticket trend
- Highlights: Major improvements, issues resolved, user feedback summary
- Challenges: Pain points, resource constraints, technical debt
- Roadmap: Upcoming improvements, capacity planning, technology upgrades
- Financials: Cost per user, budget vs. actual, cost optimization initiatives
Reference
Vendor contacts
| Vendor | Purpose | Contact | SLA |
|---|---|---|---|
| MacStadium | Host hardware, Orka Engine, network | support@macstadium.com | 1 business day response |
| Citrix Support | Citrix Cloud, VDA, licensing | support.citrix.com (1-800-424-8749) | Varies by license tier |
Compliance checklist
Review quarterly. Security:- VMs patched monthly (macOS updates)
- Citrix VDA is current (or within 2 releases)
- Access logging enabled in Citrix Cloud
- User access reviewed quarterly, inactive users offboarded
- Network segmentation enforced
- Registry credentials rotated every 90 days
- User data not stored on VMs (network storage only)
- Golden images backed up (at least 3 versions retained)
- Disaster recovery plan documented and tested annually
- VM deletion policy enforced (no orphaned VMs)
- Capacity headroom maintained (20–30% spare VMs)
- Monitoring in place for VM availability
- Change management process followed for all production changes
- Post-mortems completed for major outages
- Documentation kept current
- Chargeback reporting in place (if multi-tenant)
- Monthly cost tracking vs. budget
- Unused licenses identified and reclaimed
- Quarterly cost optimization review

