Design Decision: Autoscale Design

Overview

The goal of the document is to help answer FAQs on Autoscale to achieve best cost optimization. It provides guidance for configuring Autoscale for different admin use cases and their infrastructure and technical requirements. This document will keep evolving as more use cases and capabilities are introduced within Autoscale.

管理员可以配置自动定量第一排吗on-premises resources and only then on-demandburst to the cloud?

  • The Autoscaleselective power managementfeature is useful here. The admin must tag the cloud workloads within the delivery group that will be utilized for burst. Then configure Autoscale to power manage only these tagged workloads. The admin can then leverage zone preference to prefer on-premises workloads (which are always powered on) for incoming requests. After this configuration, incoming requests will first be brokered to on-premises resources until they are fully utilized. Cloud resources will be powered on only after the on-premises capacity is utilized.

Can an admin use their on-premises resources for normal operations and Cloud resources only in abusiness continuity or disaster recoveryscenario?

  • Use the preceding configuration

管理员可以配置为用户自动定量吗ramping up slowly / seasonal users / not worry about capacity planning upfront?

  • When an organization has users onboarding slowly and does not want to provision the entire capacity ahead of time, then thedynamic provisioningfeature with Autoscale can help. This feature constantly evaluates the load on the delivery group and creates or deletes machines accordingly. An admin must define the high and low watermark for load on the delivery group to create machines. Admins can optionally define the maximum number of machines to be provisioned and tags to be applied to these machines.

How can theadmin of an universitywith varying schedules and lectures during the day configure Autoscale?

  • The admin needs to understand the capacity utilization and lecture schedule and feed it into Autoscaleschedule-based scalingto step up and step-down capacity during the day. Autoscale allows admins to vary the powered-on capacity with 30 min granularity and allows for different schedules on different days. If volatile workload is expected, it is advised to configure the capacity buffer to avoid users having to wait for machines to power on.

How can the admin of anoutsourcing firm with agents logging in at fixed timingsconfigure Autoscale?

  • Utilizeschedule-based scalingdepending on the shift schedules. Autoscale has 30 mins granularity. Admins can configure Autoscale to power on different workloads at separate times of the day. If the number of users is unexpected, then admins also define the buffer capacity. Some platforms can take several minutes to switch on machines so do factor that while creating these schedules.

How can the admin configure Autoscale if they areusing heavy machine instances?

  • If the organization provides static desktops to its users, there might be a use case where the admin wants the machines to be started when the user clicks the desktop icon (say for knowledge workers who occasionally work on those large machines) but shut them down at a specific time of the day say at 6 PM. In this case the admin can define the propertyAutomaticPowerOnForAssignedto false.

How can an admin configure Autoscale to save storage costs apart from compute?

  • With dynamic provisioning machines can be created and deleted using Citrix Machine Creation Services. This provides additional storage cost savings by de-allocating the disk when the machine is not needed.

Infrastructure questions

How can the admin choose the number ofreserved instances (RI)to purchase?

  • Observe thecapacity utilizationdata under监控趋势> >机使用over a time period (that covers all variations such as seasonality, work force changes and so on.) in the environment. The minimum capacity that is powered on for this time period can be a good indication of the minimum reserved instances you might buy. Some other parameters that define this are:

    • Is there a fixed schedule say 9 AM - 5 PM? That schedule would mean machines can be shut down 66% of the time and achieve close to reserved instances cost savings of ~70%.
    • More RIs than the minimum capacity usage can be bought depending on workload usage and savings. For instance, say if there are 10% more machines running more than 50% of the time.
    • The organizations expansion plan must also be factored in.
  • Using Reserved Instances can save up to 70% costs compared to pay as you go machines over a three year duration.

How can the admin choose the number ofburstable instancesto purchase?

  • For virtual desktop users with high memory consumption (For instance applications such as Chrome browser, Office applications) but low CPU consumption, admins can consider using burstable VMs. These VMs might be half the cost of their peer machines (as is the case in Azure for B series and D series). In Citrix Director, check Resource utilization and average CPU consumption and match it with the base CPU% guarantee of the burstable VMs. It is recommended to use these machines for single session machines only. The reason for this is that the impact is limited compared to a multi-session machine where many users can be impacted if the CPU gets throttled.
  • This mechanism can be used to lower costs by ~50% from the regular pay as you go model.

How can the admin choose theright instance size?

  • Admins can optimize costs if they right size the machine instances in public clouds. Smaller instances host fewer user sessions than larger instances. Therefore, Autoscale powers off machines faster because it takes less time for the last user session to be logged off, thus reducing costs. Citrix recommends provisioning smaller instances if they match workload performance and capacity requirements. An effective way to understand this would be to have no more than 15-20 sessions per machine.

Whatscale limitscan the admin operate under?

  • It depends on the limits of the chosen cloud platform and the provisioning technique. For example, if the cloud platform is Azure and the provisioning technique is MCS (Machine Creation Services) then theguidanceis ~1200 machines per subscription.
  • It is also recommended to evaluate other limits like # of machines in a resource location, hosting connections and so on. Visit theLimits pagefor more info. Note that this is an evolving page.

Technical questions

How can the admin开始new machines based on session count?

  • Look at theload index. By default, it is based on session count (250) and is the factor considered for powering on more machines.

How does Autoscaleturn off machines?

  • Autoscale always attempts to scale down the number of powered-on machines in the delivery group to the configured pool size and capacity buffer. It does so by putting the excess machines with the fewest sessions into “drain state” and powering them off when all sessions are logged off. This occurs when session demand lessens, and the schedule requires fewer machines than are powered on.More info.

How does Autoscaleturn on machines?

  • It powers on machines based on the schedule and load-based settings. If the current machines cannot accommodate the incoming load and respect the capacity buffer configured, then it will power on more machines. See thisworking examplefor better understanding of how it works.

Where do I seemachines which are candidates for shutting down?

  • Just like maintenance mode, a list of machines in drain state is maintained. If a machine is in drain state, then it is a candidate to be shut down. Once all the sessions are drained off, the machine is shut down. To find out which machines are in drain state, use the PowerShell command: Get-BrokerMachine -DrainingUntilShutdown $true.More info

我怎么看到我的cost savings?

  • The监控趋势> >机使用page also displays the estimatedcost savingsachieved by enabling Autoscale in the selected delivery group. Both $ saved and % savings using Autoscale are displayed. These savings can be sent as monthly savings reports to higher management.
Design Decision: Autoscale Design