sveska | AWS Fundamentals ELB, ASG

AWS Fundamentals ELB, ASG

[last update: 17 Sep 2020]

AWS Fundamentals: ELB, ASG

Scalability & High Availability

Scalability means that an application / system can handle greater loads by adapting.
There are two kinds of scalability: • Vertical Scalability • Horizontal Scalability (= elasticity)
Scalability is linked but different to High Availability
Let’s deep dive into the distinction, using a call center as an example
Vertical Scalability
Vertically scalability means increasing the size of the instance
For example, your application runs on a t2.micro
Scaling that application vertically means running it on a t2.large
Vertical scalability is very common for non distributed systems, such as a database.
RDS, ElastiCache are services that can scale vertically.
There’s usually a limit to how much you can vertically scale (hardware limit)
Horizontal Scalability
Horizontal Scalability means increasing the number of instances / systems for your application
Horizontal scaling implies distributed systems.
This is very common for web applications / modern applications
It’s easy to horizontally scale thanks the cloud offerings such as Amazon EC2
High Availability
High Availability usually goes hand in hand with horizontal scaling
High availability means running your application / system in at least 2 data centers (== Availability Zones)
The goal of high availability is to survive a data center loss
The high availability can be passive (for RDS Multi AZ for example)
The high availability can be active (for horizontal scaling
High Availability & Scalability For EC2
Vertical Scaling: Increase instance size (= scale up / down)
From: t2.nano - 0.5G of RAM, 1 vCPU
To: u-12tb1.metal – 12.3 TB of RAM, 448 vCPUs
Horizontal Scaling: Increase number of instances (= scale out / in)
Auto Scaling Group
Load Balancer
High Availability: Run instances for the same application across multi AZ
Auto Scaling Group multi AZ
Load Balancer multi AZ
Why use a load balancer?
Spread load across multiple downstream instances
Expose a single point of access (DNS) to your application
Seamlessly handle failures of downstream instances
Do regular health checks to your instances
Provide SSL termination (HTTPS) for your websites
Enforce stickiness with cookies
High availability across zones
Separate public traffic from private traffic
Why use an EC2 Load Balancer?
An ELB (EC2 Load Balancer) is a managed load balancer
AWS guarantees that it will be working
AWS takes care of upgrades, maintenance, high availability
Types of load balancer on AWS
AWS has 3 kinds of managed Load Balancers
Classic Load Balancer (v1 - old generation) – 2009
HTTP, HTTPS, TCP
Application Load Balancer (v2 - new generation) – 2016
HTTP, HTTPS, WebSocket
Network Load Balancer (v2 - new generation) – 2017
TCP, TLS (secure TCP) & UDP
Overall, it is recommended to use the newer / v2 generation load balancers as they provide more features
You can setup internal (private) or external (public) ELBs
Load Balancer Good to Know
LBs can scale but not instantaneously – contact AWS for a “warm-up”
Troubleshooting
4xx errors are client induced errors
5xx errors are application induced errors
Load Balancer Errors 503 means at capacity or no registered target
If the LB can’t connect to your application, check your security groups!
Monitoring
ELB access logs will log all access requests (so you can debug per request)
CloudWatch Metrics will give you aggregate statistics (ex: connections count)
AWS provides only a few configuration knobs
It costs less to setup your own load balancer but it will be a lot more effort on your end.
It is integrated with many AWS offerings / services
Health Checks
Health Checks are crucial for Load Balancers
They enable the load balancer to know if instances it forwards traffic to are available to reply to requests
The health check is done on a port and a route (/health is common)
If the response is not 200 (OK), then the instance is unhealthy
Classic Load Balancers (v1)

Client CLB EC2 listener internal
Supports TCP (Layer 4), HTTP & HTTPS (Layer 7)
Health checks are TCP or HTTP based
Fixed hostname XXX.region.elb.amazonaws.com
Application Load Balancer (v2)
Application load balancers is Layer 7 (HTTP)
Load balancing to multiple HTTP applications across machines (target groups)
Load balancing to multiple applications on the same machine (ex: containers)
Support for HTTP/2 and WebSocket
Support redirects (from HTTP to HTTPS for example)
Routing tables to different target groups:
Routing based on path in URL (example.com/users & example.com/posts)
Routing based on hostname in URL (one.example.com & other.example.com)
Routing based on Query String, Headers (example.com/users?id=123&order=false)
ALB are a great fit for micro services & container-based application (example: Docker & Amazon ECS)
Has a port mapping feature to redirect to a dynamic port in ECS
In comparison, we’d need multiple Classic Load Balancer per application
Application Load Balancer (v2)

Target Groups
EC2 instances (can be managed by an Auto Scaling Group) – HTTP
ECS tasks (managed by ECS itself) – HTTP
Lambda functions – HTTP request is translated into a JSON event
IP Addresses – must be private IPs
ALB can route to multiple target groups
Health checks are at the target group level
Application Load Balancer (v2)

Good to Know
Fixed hostname (XXX.region.elb.amazonaws.com)
The application servers don’t see the IP of the client directly
The true IP of the client is inserted in the header X-Forwarded-For
We can also get Port (X-Forwarded-Port) and proto (X-Forwarded-Proto)
Network Load Balancer (v2)
Network load balancers (Layer 4) allow to:
Forward TCP & UDP traffic to your instances
Handle millions of request per seconds
Less latency ~100 ms (vs 400 ms for ALB)
NLB has one static IP per AZ, and supports assigning Elastic IP (helpful for whitelisting specific IP)
NLB are used for extreme performance, TCP or UDP traffic
Not included in the AWS free tier
Load Balancer Stickiness
It is possible to implement stickiness so that the same client is always redirected to the same instance behind a load balancer
This works for Classic Load Balancers & Application Load Balancers
The “cookie” used for stickiness has an expiration date you control
Use case: make sure the user doesn’t lose his session data
Enabling stickiness may bring imbalance to the load over the backend EC2 instances
Cross-Zone Load Balancing
With Cross Zone Load Balancing: each load balancer instance distributes evenly across all registered instances in all AZ
Otherwise, each load balancer node distributes requests evenly across the registered instances in its Availability Zone only.
Cross-Zone Load Balancing
Classic Load Balancer
Disabled by default
No charges for inter AZ data if enabled
Application Load Balancer
Always on (can’t be disabled)
No charges for inter AZ data
Network Load Balancer
Disabled by default
You pay charges ($) for inter AZ data if enabled
SSL/TLS - Basics
An SSL Certificate allows traffic between your clients and your load balancer to be encrypted in transit (in-flight encryption)
SSL refers to Secure Sockets Layer, used to encrypt connections
TLS refers to Transport Layer Security, which is a newer version
Nowadays, TLS certificates are mainly used, but people still refer as SSL
Public SSL certificates are issued by Certificate Authorities (CA)
Comodo, Symantec, GoDaddy, GlobalSign, Digicert, Letsencrypt, etc…
SSL certificates have an expiration date (you set) and must be renewed
Load Balancer - SSL Certificates
The load balancer uses an X.509 certificate (SSL/TLS server certificate)
You can manage certificates using ACM (AWS Certificate Manager)
You can create upload your own certificates alternatively
HTTPS listener: • You must specify a default certificate • You can add an optional list of certs to support multiple domains • Clients can use SNI (Server Name Indication) to specify the hostname they reach • Ability to specify a security policy to support older versions of SSL / TLS (legacy clients)
SSL – Server Name Indication (SNI)
SNI solves the problem of loading multiple SSL certificates onto one web server (to serve multiple websites)
It’s a “newer” protocol, and requires the client to indicate the hostname of the target server in the initial SSL handshake
The server will then find the correct certificate, or return the default one Note:
Only works for ALB & NLB (newer generation), CloudFront
Does not work for CLB (older gen)
Elastic Load Balancers – SSL Certificates
Classic Load Balancer (v1)
Support only one SSL certificate
Must use multiple CLB for multiple hostname with multiple SSL certificates
Application Load Balancer (v2)
Supports multiple listeners with multiple SSL certificates
Uses Server Name Indication (SNI) to make it work
Network Load Balancer (v2)
Supports multiple listeners with multiple SSL certificates
Uses Server Name Indication (SNI) to make it work
ELB – Connection Draining
Feature naming:
CLB: Connection Draining
Target Group: Deregistration Delay (for ALB & NLB)
Time to complete “in-flight requests” while the instance is de-registering or unhealthy
Stops sending new requests to the instance which is de-registering
Between 1 to 3600 seconds, default is 300 seconds
Can be disabled (set value to 0)
Set to a low value if your requests are short
What’s an Auto Scaling Group?
In real-life, the load on your websites and application can change
In the cloud, you can create and get rid of servers very quickly
The goal of an Auto Scaling Group (ASG) is to:
Scale out (add EC2 instances) to match an increased load
Scale in (remove EC2 instances) to match a decreased load
Ensure we have a minimum and a maximum number of machines running
Automatically Register new instances to a load balancer
ASGs have the following attributes
A launch configuration
AMI + Instance Type
EC2 User Data
EBS Volumes
Security Groups
SSH Key Pair
Min Size / Max Size / Initial Capacity
Network + Subnets Information
Load Balancer Information
Scaling Policies
Auto Scaling Alarms
It is possible to scale an ASG based on CloudWatch alarms
An Alarm monitors a metric (such as Average CPU)
Metrics are computed for the overall ASG instances
Based on the alarm:
We can create scale-out policies (increase the number of instances)
We can create scale-in policies (decrease the number of instances)
Auto Scaling New Rules
It is now possible to define ”better” auto scaling rules that are directly managed by EC2
Target Average CPU Usage
Number of requests on the ELB per instance
Average Network In
Average Network Out
These rules are easier to set up and can make more sense
Auto Scaling Custom Metric
We can auto scale based on a custom metric (ex: number of connected users) • 1. Send custom metric from application on EC2 to CloudWatch (PutMetric API) • 2. Create CloudWatch alarm to react to low / high values • 3. Use the CloudWatch alarm as the scaling policy for ASG
ASG Brain Dump
Scaling policies can be on CPU, Network… and can even be on custom metrics or based on a schedule (if you know your visitors patterns)
ASGs use Launch configurations or Launch Templates (newer)
To update an ASG, you must provide a new launch configuration / launch template
IAM roles attached to an ASG will get assigned to EC2 instances
ASG are free. You pay for the underlying resources being launched
Having instances under an ASG means that if they get terminated for whatever reason, the ASG will automatically create new ones as a replacement. Extra safety!
ASG can terminate instances marked as unhealthy by an LB (and hence replace them)
Auto Scaling Groups – Scaling Policies
Target Tracking Scaling
Most simple and easy to set-up
Example: I want the average ASG CPU to stay at around 40%
Simple / Step Scaling
When a CloudWatch alarm is triggered (example CPU > 70%), then add 2 units
When a CloudWatch alarm is triggered (example CPU < 30%), then remove 1
Scheduled Actions
Anticipate a scaling based on known usage patterns
Example: increase the min capacity to 10 at 5 pm on Fridays
Auto Scaling Groups - Scaling Cooldowns
The cooldown period helps to ensure that your Auto Scaling group doesn’t launch or terminate additional instances before the previous scaling activity takes effect.
In addition to default cooldown for Auto Scaling group, we can create cooldowns that apply to a specific simple scaling policy
A scaling-specific cooldown period overrides the default cooldown period.
One common use for scaling-specific cooldowns is with a scale-in policy—a policy that terminates instances based on a specific criteria or metric. Because this policy terminates instances, Amazon EC2 Auto Scaling needs less time to determine whether to terminate additional instances.
If the default cooldown period of 300 seconds is too long—you can reduce costs by applying a scaling-specific cooldown period of 180 seconds to the scale-in policy.
If your application is scaling up and down multiple times each hour, modify the Auto Scaling Groups cool-down timers and the CloudWatch Alarm Period that triggers the scale in
Questions
Load Balancers provide a: static DNS name we can use in our application
You are running a website with a load balancer and 10 EC2 instances. Your users are complaining about the fact that your website always asks them to re-authenticate when they switch pages. You are puzzled, because it’s working just fine on your machine and in the dev environment with 1 server. What could be the reason?:The Load Balancer does not have stickiness enabled
Your application is using an Application Load Balancer. It turns out your application only sees traffic coming from private IP which are in fact your load balancer’s. What should you do to find the true IP of the clients connected to your website?:Look into the X-Forwarded-For header in the backend
You quickly created an ELB and it turns out your users are complaining about the fact that sometimes, the servers just don’t work. You realise that indeed, your servers do crash from time to time. How to protect your users from seeing these crashes?-Enable Health Checks
You are designing a high performance application that will require millions of connections to be handled, as well as low latency. The best Load Balancer for this is:Network Load Balancer
Application Load Balancers handle all these protocols except:TCP
The application load balancer can redirect to different target groups based on all these except:Client IP
You are running at desired capacity of 3 and the maximum capacity of 3. You have alarms set at 60% CPU to scale out your application. Your application is now running at 80% capacity. What will happen?:Nothing
I have an ASG and an ALB, and I setup my ASG to get health status of instances thanks to my ALB. One instance has just been reported unhealthy. What will happen?:The ASG will terminate the EC2 Instance
Your boss wants to scale your ASG based on the number of requests per minute your application makes to your database:You create a CloudWatch custom metric and build an alarm on this to scale your ASG
Running an application on an auto scaling group that scales the number of instances in and out is called:Horizontal Scalability
You would like to expose a fixed static IP to your end-users for compliance purposes, so they can write firewall rules that will be stable and approved by regulators. Which Load Balancer should you use?:ALB with Elastic IP is not technically feasible.Network Load Balancers expose a public static IP, whereas an Application or Classic Load Balancer exposes a static DNS (URL).
A web application hosted in EC2 is managed by an ASG. You are exposing this application through an Application Load Balancer. The ALB is deployed on the VPC with the following CIDR: 192.168.0.0/18. How do you configure the EC2 instance security group to ensure only the ALB can access the port 80?:Open up the EC2 security on port 80 to the ALB’s security group.This is the most secure way of ensuring only the ALB can access the EC2 instances. Referencing by security groups in rules is an extremely powerful rule and many questions at the exam rely on it. Make sure you fully master the concepts behind it!
Your application load balancer is hosting 3 target groups with hostnames being users.example.com, api.external.example.com, and checkout.example.com. You would like to expose HTTPS traffic for each of these hostnames. How do you configure your ALB SSL certificates to make this work?:SNI (Server Name Indication) is a feature allowing you to expose multiple SSL certs if the client supports it. Read more here: https://aws.amazon.com/blogs/aws/new-application-load-balancer-sni/
The Application Load Balancers target groups can be all of these EXCEPT:Network Load Balancer. Can be lambda or IP.
You are running an application in 3 AZ, with an Auto Scaling Group and a Classic Load Balancer. It seems that the traffic is not evenly distributed amongst all the backend EC2 instances, with some AZ being overloaded. Which feature should help distribute the traffic across all the available EC2 instances?:Cross Zone Load Balancing
Your Application Load Balancer (ALB) currently is routing to two target groups, each of them is routed to based on hostname rules. You have been tasked with enabling HTTPS traffic for each hostname and have loaded the certificates onto the ALB. Which ALB feature will help it choose the right certificate for your clients?Server Name Indication (SNI)
An application is deployed with an Application Load Balancer and an Auto Scaling Group. Currently, the scaling of the Auto Scaling Group is done manually and you would like to define a scaling policy that will ensure the average number of connections to your EC2 instances is averaging at around 1000. Which scaling policy should you use?:Target Tracking ????

AWS Fundamentals ELB, ASG

AWS Fundamentals: ELB, ASG

Scalability & High Availability

Vertical Scalability

Horizontal Scalability

High Availability

High Availability & Scalability For EC2

Why use a load balancer?

Why use an EC2 Load Balancer?

Types of load balancer on AWS

Load Balancer Good to Know

Health Checks

Classic Load Balancers (v1)

Application Load Balancer (v2)

Application Load Balancer (v2)

Application Load Balancer (v2)

Network Load Balancer (v2)

Load Balancer Stickiness

Cross-Zone Load Balancing

Cross-Zone Load Balancing

SSL/TLS - Basics

Load Balancer - SSL Certificates

SSL – Server Name Indication (SNI)

Elastic Load Balancers – SSL Certificates

ELB – Connection Draining

What’s an Auto Scaling Group?

ASGs have the following attributes

Auto Scaling Alarms

Auto Scaling New Rules

Auto Scaling Custom Metric

ASG Brain Dump

Auto Scaling Groups – Scaling Policies

Auto Scaling Groups - Scaling Cooldowns

Questions