Dalibor Nasevic

Implementing Application Layer Encryption in Ruby on Rails applications with Asherah

2023-05-23T07:00:00+00:00

This blog post was originally published on the GoDaddy Engineering Blog.

The public cloud revolutionized the way we store and access data, but it also introduced new security challenges. This is because it involves sharing resources and infrastructure with multiple users, creating a risk of unauthorized access and data breaches. When we migrate our web services to the public cloud, in addition to storage layer data encryption and end-to-end encryption in transit, we implement application-layer encryption to protect customer-sensitive data like Personally Identifiable Information (PII). This article explores how the Asherah Application Encryption SDK works and how we encrypt PII data in our Ruby on Rails applications.

What is Application Layer Encryption and why do we need it?

Application Layer Encryption is the process of encrypting data by the application that received or generated the data. The data is encrypted before it is transported over a network or saved to a database, restricting access to the data only within the application’s memory space. It differs from storage layer encryption, which can protect the data stored in a database when the server is powered off or the storage media is stolen. However, when the database server is running and authorized users or applications access the data, encryption at the storage layer is not sufficient to protect the data.

What is Asherah and how does it work?

Asherah is an application-layer encryption SDK developed by GoDaddy that uses envelope encryption and has a hierarchical data encryption model. At the top of the hierarchy, the master key is managed by a Hardware Security Module (HSM) or Key Management Service (KMS). Below that, there are system and intermediate keys. At the lowest level, there are data row records that represent the individual encrypted rows.

The following is a brief overview of how the data and encrypted keys are stored at the data layer using a few sample data structures to illustrate the encryption pattern. Note: Go to the Asherah design and architecture page for more information.

Let’s say we have PII data that we want to encrypt, starting at the row level (or in Ruby on Rails terminology, at the model level). The Asherah SDK generates a data row key to encrypt that row data. The final payload that we need to store on the row level is named the data row record. It has a reference to its parent key called the intermediate key that is used to encrypt the data row key:

{
  "Data": "<base64(encrypted_data)>",
  "Key": {
    "Created": 1534553138,
    "Key": "<base64(encrypted_key)>",
    "ParentKeyMeta": {
      "KeyId": "_IK_123_marketing_email",
      "Created": 1534553075
    }
  }
}

Asherah generates an intermediate key unless one already exists for the given partition. Partitions create a distinct chain of encryption keys and are a way to isolate the encrypted data and limit the blast radius. Usually, we choose the primary resource id for a partition id (i.e., user_id). The intermediate key envelope points to its parent key (the system key):

{
  "Id": "_IK_123_marketing_email",
  "Created": 1534553075,
  "Key": "<base64(encrypted_key)>",
  "ParentKeyMeta": {
    "KeyId": "_SK_marketing_email",
    "Created": 1534553054
  }
}

Asherah generates a system key unless one exists or is expired. By default, system keys have a lifespan of 90 days, after which Asherah generates a new key. This action also initiates the creation of new intermediate keys. The key_meta in the system key envelope specifies the parent key used to encrypt it.

{
  "Id": "_SK_marketing_email",
  "Created": 1534553054,
  "Key": "<base64(key_meta)>",
}

A parent key of the system key can be:

A static key (used for testing only), or
A HSM or KMS

When using AWS KMS, Asherah first generates a data key with it. This data key is the master key used to encrypt the system keys. The data key is encrypted by the KMS and stored in the encryptedKek. During a decrypt operation, the KMS initially decrypts the data key, which in turn decrypts the system key. The system key then decrypts the intermediate key, and the intermediate key decrypts the data row key. The data key is encrypted with multiple AWS regions to support a fallback when a region is unavailable.

{
  "encryptedKey": "<base64(encrypted_key)>",
  "kmsKeks": [
    {
      "region": "<aws_region>",
      "arn": "<arn>",
      "encryptedKek": "<base64(key_encrypted_key)"
    },
    ...
  ]
}

The default cipher that Asherah uses for encryption is AES-256-GCM.

Why not use AWS KMS directly?

You might wonder why we don’t use AWS KMS directly for each encrypt and decrypt operation. We can, but consider the following:

Performance - it will cause increased latency with each encrypt or decrypt call to AWS KMS.
Pricing - it will increase AWS KMS costs if we don’t cache system keys and intermediate keys in memory and minimize AWS KMS calls.

What is a Secure Memory?

Asherah implements Secure Memory to safely generate, store, and cache encryption keys. By using a secure memory heap, it guards against memory leaks with swapping, core dumps, debugger memory scans, and CPU vulnerabilities like Spectre. A secure memory heap is not part of the language-managed memory, but it can be implemented using some known native calls.

To allocate secure memory, the following steps must be performed:

check memory lock limit (getrlimit)
allocate memory (mmap)
disable swap (mlock)
disable core dumps (madvise)
write secret bytes to memory location
set no access (mprotect)
wipe secret bytes from managed memory

To read from secure memory, the following steps must be performed:

change memory address to read-only mode (mprotect)
read secret bytes from memory location
change memory address to no access (mprotect)
encrypt or decrypt with the secret
wipe secret bytes from managed memory

How to use Asherah in Ruby on Rails applications

Asherah-Ruby is a Ruby FFI wrapper around the Asherah Go implementation of the application-layer encryption SDK. The Asherah Go implementation is exposed to Ruby via the asherah-cobhan’s Go wrapper and compiled to a native shared library with Cgo. Currently supported platforms for Asherah Ruby are Linux and Darwin operating systems for x64 and ARM64 CPU architectures.

To configure the Asherah library in a Ruby on Rails application, we must first install the Asherah gem. After installing the gem, we need to create the following migration for the encryption_key table to store the system and intermediate keys. Asherah supports MySQL and DynamoDB metastores, and can be extended to support additional adapters. For our test, we will use MySQL.

class CreateEncryptionKey < ActiveRecord::Migration[7.0]
  def up
    execute("
      CREATE TABLE encryption_key (
        id             VARCHAR(255) NOT NULL,
        created        TIMESTAMP    NOT NULL DEFAULT CURRENT_TIMESTAMP,
        key_record     TEXT         NOT NULL,
        PRIMARY KEY (id, created),
        INDEX (created)
      );
    ")
  end

  def down
    drop_table :encryption_key
  end
end

We have to create an initializer to configure Asherah. To do so, we set the service_name and product_id used for the key naming. We configure metastore, and connection_string for the keys storage. We need a separate connection_string from the default Active Record connection because Asherah Go manages the connection for writing and reading the encrypted keys. Then we configure enable_session_caching for performance and specify the kms details. We use a static key in development and test environments, and in the production environment, we use the AWS KMS service. Here is the Asherah configuration:

Asherah.configure do |config|
  config.service_name = 'marketing'
  config.product_id = 'email'
  config.metastore = 'rdbms'
  config.enable_session_caching = true # default: false

  c = ActiveRecord::Base.connection_db_config.configuration_hash
  config.connection_string = "#{c[:username]}:#{c[:password]}@tcp(#{c[:host]}:#{c[:port]})/#{c[:database]}"

  if ENV['ASHERAH_KMS_ENABLED'] == 'true'
    config.kms = 'aws'
    config.preferred_region = ENV.fetch('AWS_REGION')
    config.region_map = { ENV.fetch('AWS_REGION') => ENV.fetch('KMS_KEY_ARN') }
  elsif Rails.env.development? || Rails.env.test?
    config.kms = 'static' # The static key used for encryption is `thisIsAStaticMasterKeyForTesting` (defined in Asherah Go)
  else
    raise "Asherah client not configured for: #{Rails.env}"
  end
end

Once we have all that set, we can call the encrypt and decrypt operations with Asherah:

partition_id = 'user_1'
data = 'user@example.com'
encrypted_data = Asherah.encrypt(partition_id, data)
decrypted_data = Asherah.decrypt(partition_id, encrypted_data)

How to integrate Asherah in Ruby on Rails models

In Ruby on Rails models, we frequently use open schema columns of type text and leverage ActiveRecord::Store with JSON serialization. That way, we store data without having to run migrations for each new column we add. We’ll start by creating the table users with text column params to store personally identifiable information like name and email. Let’s create the migration:

class CreateUsers < ActiveRecord::Migration[7.0]
  def change
    create_table :users do |t|
      t.text :params
      t.timestamps
    end
  end
end

Each model that implements application layer encryption needs to include the DataEncryption module we’ll define below. This module defines the data_encryption method used to specify the encrypted attributes’ name and email and how we reference them from the model. For the partition_id, we use the global value, but if we had a parent account model, we could partition by the account_id. Next, we’ll define the User model:

class User < ActiveRecord::Base
  include DataEncryption

  store :params, accessors: [:enc_data], coder: JSON
  data_encryption :raw_data, :enc_data, store_name: :params, accessors: [:name, :email]

  private
  def partition_id
    'global'
  end
end

The DataEncryption module defines before_save and after_find callbacks to ensure proper encryption and decryption of data when models are saved or retrieved from the database. The models that include it must define the partition_id for the encryption session. The data_encryption method expects the following arguments:

raw_data - a virtual attribute that holds the raw data
enc_data - an attribute to store the encrypted data
store_name - the name of the store where enc_data will be stored

Next, we will define the DataEncryption module:

module DataEncryption
  extend ActiveSupport::Concern

  DataEncrypt = Struct.new(:raw_attr_name, :enc_attr_name, :store_name)

  included do
    class_attribute :data_encrypt, default: nil
    before_save :encrypt_data_callback
    after_find :decrypt_data_callback
  end

  class_methods do
    def data_encryption(raw_attr_name, enc_attr_name, store_name: , accessors: [])
      self.data_encrypt = DataEncrypt.new(raw_attr_name, enc_attr_name, store_name)

      attribute raw_attr_name, default: -> { HashWithIndifferentAccess.new }

      accessors.each do |accessor|
        define_method(accessor) do
          public_send(raw_attr_name)[accessor]
        end

        define_method("#{accessor}=") do |value|
          public_send(raw_attr_name)[accessor] = value
        end
      end
    end
  end

  private
  def encrypt_data_callback
    data = public_send(data_encrypt.raw_attr_name)

    if data.present? || public_send(data_encrypt.enc_attr_name).present?
      public_send("#{data_encrypt.enc_attr_name}=", encrypt_data(data))
    end
  end

  def decrypt_data_callback
    enc_data = public_send(data_encrypt.enc_attr_name)

    if enc_data.present?
      data = decrypt_data(enc_data)
      data = ActiveRecord::Store::IndifferentCoder.as_indifferent_hash(data)
      public_send("#{data_encrypt.raw_attr_name}=", data)
    end
  end

  def encrypt_data(data)
    Asherah.encrypt(partition_id, JSON.dump(data))
  end

  def decrypt_data(enc_data)
    JSON.parse(Asherah.decrypt(partition_id, enc_data))
  end
end

How to search encrypted PII data

Our PII data is encrypted and stored in the database, but we can’t search for it because it is not indexed. One way to implement a search for encrypted PII data is to use a cryptographic technique called a blind index. Blind indexes are created by applying a one-way cryptographic hash function to the data, generating a unique fixed-length string that represents the data without revealing the actual content. To further enhance the security of the hashed data, we use a pepper, a secret key added to the input of the hashing function to create a peppered hash. Next, we’ll define the hashing function:

class Hasher
  def self.hash(value)
    Digest::SHA256.hexdigest(value.downcase + ENV.fetch('HASHING_PEPPER'))
  end
end

To implement a blind index, we will add a column named hashed_email with an index to the table users. That way, we’ll be able to search for an exact match of the hashed email (though we still can’t do a full-text search or use LIKE queries). Next, we’ll add the migration:

class AddHashedEmailToUsers < ActiveRecord::Migration[7.0]
  def change
    add_column :users, :hashed_email, :string
    add_index :users, :hashed_email
  end
end

We can then add a before_validation callback to our model to hash the data for the PII columns and define helper class methods like find_by_email. Finally, we’ll extend the User model with the following code:

class User < ActiveRecord::Base
  before_validation :hash_pii_columns

  def self.find_by_email(email)
    where(hashed_email: Hasher.hash(email)).take
  end

  private
  def hash_pii_columns
    self.hashed_email = Hasher.hash(email) if email.present?
  end
end

Important considerations for production deployments of Asherah-Ruby

The following are some things to consider before deploying Asherah-Ruby to production:

The minimal overhead is about 33% of the payloads due to Base64 encoding of encrypted data.
Warm up Asherah with a dummy encrypt call to decrypt the master key with KMS and cache it in memory before handling any requests:
```
Rails.configuration.after_initialize do
  Asherah.encrypt('global', 'warmup')
end
```
Use glibc-based Linux distributions because the Go standard library has incompatibility and causes C-shared builds to fail with musllibc.
You might need to pass ENV variables from Ruby to Go as with the AWS_CONTAINER_CREDENTIALS_RELATIVE_URI ENV var when running in AWS Fargate containers. Go os.Getenv() does not see variables set by C.setenv() as reported in this issue and documented in the wiki.
```
AWS_ECS_ENV_VAR_NAME = 'AWS_CONTAINER_CREDENTIALS_RELATIVE_URI'
Asherah.set_env(AWS_ECS_ENV_VAR_NAME => ENV.fetch(AWS_ECS_ENV_VAR_NAME)) if ENV[AWS_ECS_ENV_VAR_NAME].present?
```

Conclusion

Asherah’s cross-language support, secure memory management, and granularity with the hierarchical key encryption model are some of the key features that help us minimize attack exposure and increase the security of our customer data. Revoking keys due to a suspected compromise is also built into the key rotation model. We have been using Asherah successfully in production for a few years now. We’ve iterated through a few different distributions of it for Ruby projects specifically, using an Asherah Go sidecar, a pure Ruby implementation of Asherah, and finally landing on Asherah-Ruby that’s using Asherah Go under the hood. With version 7 of Ruby on Rails, we saw the light of the built-in Active Record Encryption for encrypting data at the application layer. It’s great to see more alternative solutions bringing their features and advantages.

*) Cover Photo Attribution: Photo by Glenn Carstens-Peters on Unsplash

Optimizing Email Batch API with bulk inserts

2022-09-12T07:00:00+00:00

This blog post was originally published on the GoDaddy Engineering Blog.

Rails 6 introduced the insert_all ActiveRecord API for inserting multiple records into the database with a single SQL INSERT statement. It has an option to select the returning columns, but it is available only for PostgreSQL using its RETURNING SQL clause and not MySQL. This blog post explores how we optimized our Email Batch API by using Rails bulk inserts with MySQL and the details of calculating the auto-incrementing IDs for records.

Improving Our Email API

Our Email API has a multi-tenant architecture providing a database for each customer. It accepts millions of emails daily and provides a Batch API for enqueuing up to 50 messages per single batch request. The Batch API inserts these records one by one, enqueues background workers to build and deliver the emails, and returns the message IDs to the client for an eventual status check later.

Our change aims to improve the Batch API performance by inserting messages in bulk while preserving the original API design and returning the message IDs to the client. Our API runs on-premise using MySQL and in AWS using Aurora MySQL, and the change must be compatible with both.

MySQL Information Functions

Although MySQL does not support a RETURNING clause, it provides LAST_INSERT_ID() and ROW_COUNT() functions that can help us calculate the auto-incrementing ID values from the connection session. The LAST_INSERT_ID() function returns the first automatically generated value successfully inserted for an AUTO_INCREMENT column in a table. And the ROW_COUNT() function returns the number of rows affected by the previous SQL statement.

So, it seems simple enough to calculate the auto-incrementing IDs based on these two values.

AUTO_INCREMENT Handling in InnoDB

Before we go any further, we need to review the innodb-auto-increment-handling because the type of inserts, the lock mode, and the replication type can have implications on whether the IDs will be consecutive and be the same on the replicas as on the source. We need to ensure the generated IDs are without any gaps to be able to calculate their values with the functions reliably.

The type of multiple-row inserts we do are:

INSERT INTO `messages` (`template_id`, `params`, `created_at`, `processed`)
  VALUES (NULL, 'content', '2022-02-04 15:12:24', FALSE),
         (NULL, 'content', '2022-02-04 15:12:24', FALSE)

These inserts fall into the category of simple inserts:

Statements for which the number of rows to be inserted can be determined in advance (when the statement is initially processed). This includes single-row and multiple-row INSERT and REPLACE statements that do not have a nested subquery, but not INSERT … ON DUPLICATE KEY UPDATE.

There are three types of lock modes for MySQL: traditional (0), consecutive (1), and interleaved (2). If the only statements we execute are “simple inserts,” then there are no gaps in the numbers generated for any lock mode. We use the “consecutive” lock mode with MySQL 5.7:

irb(main):001:0> ActiveRecord::Base.connection.execute("SELECT @@innodb_autoinc_lock_mode;").to_a
=> [[1]]

There are three types of binary log formats: STATEMENT, ROW, and MIXED. When using statement-based replication and interleaved lock mode combination, there are no guarantees for auto-increment values to be the same on the replicas as on the source. But, when using row-based or mixed-format replication and any auto-increment lock mode, auto-increment values will be the same on the replicas as on the source. We run our binary log in MIXED format.

2.7.4 :001 > ActiveRecord::Base.connection.execute("SHOW VARIABLES LIKE 'binlog_format';").to_a
 => [["binlog_format", "MIXED"]]

Here’s the messages table schema we do bulk inserts against:

CREATE TABLE `messages` (
  `id` int(11) NOT NULL AUTO_INCREMENT,
  `template_id` int(11) DEFAULT NULL,
  `params` mediumtext CHARACTER SET utf8mb4,
  `created_at` timestamp NOT NULL,
  `processed` tinyint(1) DEFAULT '0',
  PRIMARY KEY (`id`),
  KEY `index_messages_on_template_id` (`template_id`)
) ENGINE=InnoDB AUTO_INCREMENT=1 DEFAULT CHARSET=utf8 ROW_FORMAT=COMPRESSED KEY_BLOCK_SIZE=8

Thread Safety of LAST_INSERT_ID()

Given that LAST_INSERT_ID() isolation is per-connection and Rails’s ConnectionPool is thread-safe, using LAST_INSERT_ID() is safe with our case.

The ID that was generated is maintained in the server on a per-connection basis. This means that the value returned by the function to a given client is the first AUTO_INCREMENT value generated for most recent statement affecting an AUTO_INCREMENT column by that client. This value cannot be affected by other clients, even if they generate AUTO_INCREMENT values of their own. This behavior ensures that each client can retrieve its own ID without concern for the activity of other clients, and without the need for locks or transactions.

Bulk Insert and Calculate Auto-Incrementing IDs

To convert the individual Rails model saves to a single bulk insert, we collect the attributes for all models before the final bulk insert. The following code with inline comments shows how we collect the models’ attributes.

attributes = messages_params.map do |message_params|
  # Initialize Message object
  message = Message.new(message_params: message_params)

  # Set timestamps
  Message.all_timestamp_attributes_in_model.each do |name|
    message._write_attribute(name, Message.current_time_from_proper_timezone)
  end

  # Run the necessary model callbacks
  [:validation, :save, :create].each { |kind| message.run_callbacks(kind) }

  # Collect message attributes for bulk insert
  attribute_names = Message.column_names - [Message.primary_key]
  attribute_names.each_with_object({}) do |name, object|
    object[name] = message._read_attribute(name)
  end
end

Once we’ve built the array of attributes, we can call insert_all!.

Message.insert_all!(attributes)

We use insert_all! instead of insert_all for the bulk insert so if an issue occurs, it fails the whole insert, and no rows are inserted. For instance, insert_all! raises the ActiveRecord::RecordNotUnique error if any rows violate a unique index when it’s present on the table.

After inserting the records, we can calculate the auto-incrementing IDs by retreiving the LAST_INSERT_ID() value from the Mysql2::Client object using the last_id method:

mysql_client = Message.connection.instance_variable_get(:@connection)
last_id = mysql_client.last_id

When inserting multiple rows using a single INSERT statement, the last_id returns only the value generated for the first inserted row.

To get the ROW_COUNT() function value, we can call the affected_rows method on the Mysql2::Client, but since the IDs are consecutive numbers, we can simply add the loop index to the last_id and set the message ID:

messages.each_with_index do |message, index|
  message.id = last_id + index
end

Performance Improvements

By deploying this change to our Email API running with MySQL 5.7, we saw a decrease of about 35% for the average transaction duration time of the Batch API request. The time decrease percentage is for the Batch API request and not just the MySQL insert time.

And for the Email API in AWS running with Aurora MySQL 5.7, the change decreased the average transaction duration time of the Batch API request by about 65%.

Summary

MySQL does not support a RETURNING clause for getting the auto-incrementing IDs for bulk inserts, but it provides the LAST_INSERT_ID() information function that helps us calculate them. By introducing bulk inserts, we significantly improved the transaction duration times of our Email Batch API requests. The change had a more significant effect on AWS Aurora MySQL, presumably due to its engine optimization. A simpler application model with minimal callbacks and validation logic makes introducing such a change more feasible.

*) Cover Photo Attribution: Photo by Marek Piwnicki: https://www.pexels.com/photo/train-in-motion-8991549/

Running Puma in AWS

2022-01-10T19:00:00+00:00

This blog post was originally published on the GoDaddy Engineering Blog.

In the past couple of years, we have been on our journey to the cloud migrating our web services to AWS. In this blog post, we share what we learned about deploying Puma web server to AWS by migrating our email delivery service written in Ruby to AWS.

What is Puma?

Puma is the most popular Ruby web server used in production per the Ruby on Rails Community Survey Results. It is a fast and reliable web server that we use for deploying containerized Ruby applications at GoDaddy.

End-to-end SSL

The web components of our email delivery service run on Kubernetes. The Kubernetes service is behind an ALB Ingress Controller managed by AWS Load Balancer Controller. Every web request has end-to-end encryption in transit. Application Load Balancer (ALB) reinitializes TLS and Puma server terminates TLS in the Kubernetes pod. The Kubernetes pod is a docker container running a Ruby on Rails application with Puma.

Loading certificates from memory

When a container starts, the application initialization process retrieves the SSL certificates from AWS Secrets Manager and configures them with Puma on the fly. We contributed a change to Puma’s MiniSSL C extension to allow setting cert_pem and key_pem strings without persisting them on disk for security reasons. This new functionality is available through the ssl_bind Puma DSL and will be available in the next Puma version (> 5.5.2).

With the following sample we fetch and configure the certificate for our API component:

# config/puma.rb
config = AwsDeploy::Config.new(ENV.fetch("RAILS_ENV"))
certificate_downloader = AwsCertificateDownloader.new(config)
api_port = ENV.fetch("PORT_API")
api_cert = certificate_downloader.download("/Cert/#{config.api_host_name}")

ssl_bind '0.0.0.0', api_port, {
  cert_pem: api_cert.fetch(:cert),
  key_pem: api_cert.fetch(:key),
  no_tlsv1: true,
  no_tlsv1_1: true,
}

We run two other application components on different ports and hosts in the same Puma process using a similar config to the above.

Warmup for slow clients

We use our application-layer encryption SDK, Asherah, to encrypt all data with Personally Identifiable Information (PII) in the database. Each data row gets encrypted with a data row key, that gets encrypted with an intermediate key, then a system key, and a master key stored in AWS Key Management Service (KMS).

Asherah client initialization is an expensive operation that involves HTTP requests to AWS KMS service and database calls to retrieve the system and intermediate keys. To avoid availability issues during process restarts (deploys, daily node rotation) we have to warm up clients with slow initialization inside Puma on_worker_boot block.

on_worker_boot do
  AsherahClient.encrypt('warmup', EncryptionPartition::GLOBAL)
end

Without a warmed-up Asherah client and when there is a spike of requests during a deployment, we would experience availability problems as seen on the ALB monitoring graphs below. Warming up the Asherah client before targets get added in service resolves that issue.

ALB monitoring

AWS Load Balancers Monitoring page gives us a good overview of incoming requests and response statuses.

We need to distinguish between status codes returned from the targets:

HTTP 2XXs (HTTPCode_Target_2XX_Count)
HTTP 3XXs (HTTPCode_Target_3XX_Count)
HTTP 4XXs (HTTPCode_Target_4XX_Count)
HTTP 5XXs (HTTPCode_Target_5XX_Count)

and status codes generated by the load balancers:

ELB 4XXs (HTTPCode_ELB_4XX_Count)
ELB 5XXs (HTTPCode_ELB_5XX_Count)
HTTP 500s (HTTPCode_ELB_500_Count)
HTTP 502s (HTTPCode_ELB_502_Count)
HTTP 503s (HTTPCode_ELB_503_Count)
HTTP 504s (HTTPCode_ELB_504_Count)

Errors generated by the targets will appear in the Exception Monitoring and/or Application Performance Monitoring (APM) systems and are easier to find and resolve than errors generated by the AWS ELB (Elastic Load Balancer).

In our experience and specific to our infrastructure setup, HTTP 500s errors are blocks coming from the AWS Web Application Firewall (WAF), HTTP 502s errors are because of TCP connection issues or SSL handshake issues, HTTP 503s errors happen when there are no targets and HTTP 504s errors are due to capacity issues i.e. when there are not enough targets.

Keep-Alive timeout

In our setup, ALB uses Keep-Alive connections with Puma and we noticed a small but consistent rate of HTTP 502s errors during quiet hours. That was happening because Puma’s default persistent timeout is 20 seconds (PERSISTENT_TIMEOUT = 20) and the ALB connection idle timeout is 60 seconds. In such quiet intervals, it happens for Puma to close the connection before ALB does and then ALB serves 502 Bad Gateway error to the client.

By configuring persistent_timeout for Puma to a value bigger than ALB connection idle timeout (60 seconds) + ALB connect timeout (10 seconds) we resolved that issue:

persistent_timeout(75)

Handling blasts of requests

When we send campaigns with a significant volume of recipients hosted by a single ISP, sometimes we get back a spike of web requests from the ISP that checks the links with their abuse and spam protection system. Some of the ISPs use a wide range of IPs and the AWS WAF IP rate limiting per 5-minute time span often does not catch the complete blast of requests and we get 504 Gateway timeout:

The load balancer failed to establish a connection to the target before the connection timeout expired (10 seconds).

These 504 errors are a result of open timeouts from ALB to targets i.e. requests from ALB are waiting for 10 seconds and are not able to connect to the target socket. The reason for that is some slow requests that saturate the queue and with the blast of requests the socket backlog gets full resulting in the operating system not accepting new connections. Puma allows configuring the backlog value that determines the size of the queue for unaccepted connections.

We made a change to Puma to allow setting the backlog value with the ssl_bind DSL that we use. It’s interesting that, although Puma sets the backlog size to 1024 by default, its actual value depends on the OS value for max socket connections; i.e., it is capped by the net.core.somaxconn sysctl value. We can check the system value with sysctl net.core.somaxconn or cat /proc/sys/net/core/somaxconn. On older Linux kernels (before linux-5.4) the default was set to 128 and on newer, it is 4096 (reference).

To set that with Puma’s ssl_bind DSL, we just provide the appropriate backlog value with:

ssl_bind '0.0.0.0', tracking_port, {
  # ...
  backlog: 4096,
}

In our particular case, it makes sense to increase the backlog value to prevent dropping requests while the blast of requests lasts at a cost of slightly increased latency for that short duration. That resolution came after we first evaluated capacity increase options and optimized the end-points by moving expensive operations to background jobs. These responses are in the range of 1-10 milliseconds and tools like lru_redux for in-process memory caching are extremely helpful.

Another thing to check is whether the liveness probe is the same as the readiness probe as it can worsen such high-load situations by restarting the pods. If the liveness probe is the same as the readiness probe, we can increase the failureThreshold for the liveness probe to a bigger value (10 for example).

Consider also relaxing the readiness probe in this situation by increasing its timeout. That helped us reduce errors like “SSL_read: shutdown while in init” that we were seeing for Redis connections. They seem to happen when Kubernetes takes the pod out of service due to failing readiness probes during that blast of requests, and then the ongoing requests to other Puma threads in the same process gets canceled which results in 502 errors in addition to the 504 errors.

Graceful shutdown and pod termination

When terminating a pod, Kubernetes first sends a SIGINT and, if the pod does not stop within the terminationGracePeriodSeconds, Kubernetes sends SIGKILL to forcefully stop it. When Kubernetes terminates a pod, the command to remove the endpoint from the service and the SIGINT signal execute in parallel. That could cause some requests to get dropped because the pod is terminating and that would result in 502/504 errors.

An easy way to work around that limitation and to have a greceful-shutdown is to add a sleep interval before the Puma process stops. To achieve that we use a preStop hook and, in our testing, we landed on a sleep interval of 40 seconds that is enough time for Kubernetes’ Endpoints Controller async reaction and for kube-proxy to update iptable rules. We are also increasing the terminationGracePeriodSeconds to 70 seconds that applies to the total time (both PreStopp hook + container stop) to allow for 30 seconds for Puma to process queued requests before it receives SIGKILL.

terminationGracePeriodSeconds: 70
containers:
  - name: {{ include "application.apps.name" . }}
    image: {{ include "container.image" . }}
    args: ["bundle exec puma"]
    lifecycle:
      preStop:
        exec:
          command: ["sh", "-c", "sleep 40"]

Puma stats and auto-scaling

Queue Time is an important metric to monitor and should feed into the auto-scaling configuration. But, AWS ALB does not provide the X-Request-Start header and we cannot calculate the queue time dynamically. We can enable, download and parse load balancer access logs to calculate queue time after the fact like this:

queue_time = time - request_creation_time - request_processing_time - response_processing_time - target_processing_time

We need a dynamic value to use for auto-scaling, and we can calculate the Puma business metric using the following formula:

puma_business = (1 - sum(pool_capacity) / sum(max_threads)) * 100

These Puma values used in the calculation are available from Puma.stats:

{
  "started_at": "2021-12-27T15:19:09Z",
  "backlog": 0,
  "running": 3,
  "pool_capacity": 4,
  "max_threads": 5,
  "requests_count": 6
}

Load balancing

In regards to load balancing, we need to consider whether to run Puma in single or cluster mode. The advantage of Puma cluster mode is that it can better deal with slow, CPU-bound responses because the queue is shared between more than one worker. Puma will route requests to worker processes that have the capacity, yielding better queue time.

AWS ALB supports the Least Outstanding Requests algorithm for load balancing requests in addition to the default, Round Robin algorithm. The Least Outstanding Requests algorithm is not ideal in case there is a problematic pod that quickly returns error responses and all upcoming requests gets routed to it unless we have quick health checks to react to the falling target.

Conclusion

Deploying Puma and tuning its performance to adequately provision resources involves lots of details to consider and analyze. Warming up slow clients, tuning keep-alive timeouts, graceful shutdowns, and optimizing the backlog queue size are essential to ensure the service can respond to high loads with minimal latency and without interruption. Loading SSL certificates directly from Secrets Manager, end-to-end SSL encryption in transit, and implementing application-layer encryption are required to secure customer data in the cloud. Monitoring for Puma metrics, in addition to ALB monitoring, would be great to have. We’ll be exploring using Prometheus for monitoring Puma metrics and configuring auto-scaling based on the Puma business metric. In the lack of such monitoring, analyzing access logs could bring useful insights and ideas on what to tweak next.

Distributed cron for Rails apps with Sidekiq Scheduler

2018-10-15T21:00:00+00:00

This blog post was originally published on the GoDaddy Engineering Blog.

We are heavy users of Sidekiq. Sidekiq is a Ruby background jobs processing library that uses Redis for storage and is widely used in Ruby on Rails applications. It has a nice ecosystem that allows extending its functionality with plugins.

One such plugin that helped us run distributed cron, reduce maintenance costs and simplify our deployments is Sidekiq Scheduler. We will discuss the motivation for migrating from OS based cron to distributed cron using Sidekiq Scheduler and the benefits we get from it.

Our deployment setup

We maintain some legacy Ruby on Rails applications along with new Ruby on Rails microservices. We build our new microservices with the public cloud in mind and deploy them on Kubernetes. We deploy our legacy applications with Capistrano while we work on migrating them to the public cloud. We landed on a strategy for deploying cron jobs that works well for us in both scenarios.

With our standard Capistrano deploys, we deploy an application to web servers that handle web requests and to worker servers that process background jobs.

The web servers deploy is consistent and all running processes are Phusion Passenger instances. The workers deploy is more complex. Besides deploying the Sidekiq processes, it deploys cron jobs to a specific worker server and depending on the application it might deploy other stand-alone runner processes to specific worker servers.

What are the main problems with this setup?

There are two main problems with this setup that we want to resolve:

Single point of failure

The crons and the runner procesess are each deployed to a specific server respectively. In case of an issue like a network or out of memory incident, we risk having a partial failure in how the service operates.
Running tasks twice at the same time

If a cron job needs to run frequently and it has a long processing time, there is nothing to prevent an overlap with the next cron schedule. With experimental canary deploys, human error is possible too, that could result in deploying the crons or the runner process to more than one server.

Distributed cron with Sidekiq Scheduler

Let’s first start with a brief introduction to how Sidekiq Scheduler works and then we will discuss its benefits over OS based cron jobs and look at some of the alternatives.

Sidekiq Scheduler is a lightweight job scheduling extension for Sidekiq. It uses Rufus Scheduler under the hood, that is itself an in-memory scheduler.

Sidekiq Scheduler extends Sidekiq by starting a Rufus Scheduler thread in the same process, loading and maintaining the schedules for it. By starting Sidekiq Scheduler in all Sidekiq processes distributed on all hosts we get a distibuted cron solution that resolves the single point of failure issue.

Running Sidekiq Scheduler on multiple hosts could have some issues. Although, we exclusively use the cron type of schedules, we still couple the cron jobs in Sidekiq Scheduler with using a Sidekiq plugin for unique jobs. That covers the uniqueness goal and also guarantees that no duplicate cron jobs run at the same time until the cron job finishes with success.

Each Sidekiq process running Sidekiq Scheduler will first try to register the cron job to get a lock and only then enqueue it. The increased load to Redis when every single process tries to get a lock is acceptable for us because Redis capacity allows for that.

Configuring and using Sidekiq Scheduler

We have a custom config for Sidekiq Scheduler that allows for more control over sharing configs between environments. In an initializer, we require sidekiq-scheduler and its UI component and configure the Sidekiq server:

# config/initializers/sidekiq.rb

require 'sidekiq'
require 'sidekiq/web'
require 'sidekiq-scheduler'
require 'sidekiq-scheduler/web'

Sidekiq.configure_server do |config|
  config.on(:startup) do
    SidekiqScheduler::Scheduler.instance.rufus_scheduler_options = { max_work_threads: 1 }
    Sidekiq.schedule = ConfigParser.parse(File.join(Rails.root, "config/sidekiq_scheduler.yml"), Rails.env)
    SidekiqScheduler::Scheduler.instance.reload_schedule!
  end
end

Rufus Scheduler starts 28 threads by default. Because its job is only to enqueue Sidekiq jobs and Sidekiq workers will do the actual execution, we can decrease the max_work_threads to 1.

ConfigParser.parse is a small utility function that converts the YAML config to a hash:

require 'yaml'
require 'erb'

class ConfigParser
  def self.parse(file, environment)
    YAML.load(ERB.new(IO.read(file)).result)[environment]
  end
end

Sidekiq Scheduler config looks like this:

# config/sidekiq_scheduler.yml

default: &default
  active_mailings:
    class: ActiveMailingsWorker
    cron: '*/10 * * * * * America/Phoenix'
  scheduled_mailings:
    class: ScheduledMailingsWorker
    cron: '* * * * * America/Phoenix'

development:
  <<: *default

staging:
  <<: *default

production:
  <<: *default

Rufus Scheduler allows for seconds precision with an optional cron expression format consisting of a six fields time specifier where the first one is for the seconds. Per that config example, we specify a run of ActiveMailingsWorker every 10 seconds and a run of ScheduledMailingsWorker every minute.

By default, when no timezone is set with the cron string, it uses the Rails’ configured timezone in config/application.rb. We have an option to change it if we need to.

The scheduled tasks are standard Sidekiq workers:

class ActiveMailingsWorker
  include Sidekiq::Worker

  sidekiq_options queue: :cron, unique_for: 30.minutes

  def perform
  end
end

Benefits of using Sidekiq Scheduler vs OS based cron jobs

There are some other benefits of using Sidekiq Scheduler vs OS based cron jobs that are worth discussing:

No process bootup wait time

Each time OS based cron jobs run, it takes time for the process to bootup before it executes. Depending on the app size, it could take from seconds to minutes. That means the cron execution is always delayed. With Sidekiq Scheduler, it’s an already running thread as part of the Sidekiq process and there are no bootup delays.
Seconds precision

The most frequent an OS based cron job can run is minutes frequency. Because Rufus Scheduler runs in-memory it can schedule jobs every second.
Error monitoring

When OS based cron jobs fail, we can log errors to log files and remember to check them later. With Sidekiq Scheduler, the cron jobs are normal Sidekiq jobs and the standard Sidekiq UI and application error monitoring mechanisms apply.
Consistency

We can write rake tasks, custom scripts or rails runners and configure the OS based cron jobs to call them. While there are ways to test all these types of tasks, it’s more consistent when we define cron jobs as normal Sidekiq workers.
Run it everywhere

Cron jobs run as part of Sidekiq workers and that makes it easy to deploy cron jobs in different environments. From production, staging to running the cron jobs locally.

Converting runner processes to Sidekiq Scheduler

Our runner processes are responsible for operations like booting up scheduled mailings, throttling operations or sending mailing batches. These tasks need to run more frequently than once a minute, which is the minimum frequency for OS based cron jobs.

Rufus Scheduler allows for seconds frequency and we can convert these runner processes into normal Sidekiq jobs scheduled and enqueued by Sidekiq Scheduler. With that we get a consistent workers deploy that is as simple as the apps deploy resulting in all running instances being Sidekiq workers.

Look at some alternatives

An alternative solution is using Sidekiq Enterprise feature for Periodic Jobs. It has a standard crontab format that does not have seconds frequency and the Leader Election feature can help implement a custom seconds frequency.
Sidekiq Cron is another valid alternative. It uses the internal Sidekiq’s Sidekiq::Poller and has fewer dependencies, but also does not allow for seconds frequency.
Kubernetes Cron Jobs is another alternative when deploying to Kubernetes. It documents its limitations, long bootup process and no seconds frequency make it not ideal.

Final thoughts

We have been running Sidekiq Scheduler in production for few months and it’s working reliably. We use the cron type of schedules exclusively and we use a Sidekiq plugin for unique jobs that guard us against the potential of duplicate jobs.

Implementing a custom Redis and in-memory bloom filter

2018-09-11T20:00:00+00:00

This blog post was originally published on the GoDaddy Engineering Blog.

In our email marketing and delivery products (GoDaddy Email Marketing and Mad Mimi) we deal with lots of data and work with some interesting data structures like bloom filters. We made an optimization that involved replacing an old bloom filter built in-memory and stored on Amazon S3 with a combination of a Redis bloom filter and an in-memory bloom filter. In this blog post we’ll go through the reasoning for this change as well as the details of the bloom filter implementation we landed on. Let’s first start with a brief introduction to bloom filters.

What is a bloom filter?

A Bloom filter is a space-efficient probabilistic data structure, designed to test whether an element is a member of a set. Because of its probabilistic nature, it can guess if an element is in a set with a certain precision or tell for sure if an element is not in a set. That is an important detail to design around as we’ll see later. If you’re curious about the math involved, check out this blog post for more details.

What is the real problem we are solving?

In our email delivery products, each plan places limit on the number of unique contacts our customers can send emails to in a billing cycle. An interesting abuse scenario happens when a customer uploads a list of email addresses, sends a campaign to that list, deletes the list, and then imports another list with different email addresses and sends another campaign. We call this scenario “deleting and replacing” and to prevent it we need to keep a history of contacts that have received emails in a billing cycle.

The naive solution

The naive solution would be to check against the history of delivered emails in a billing cycle. While that might work for smaller data sets, it causes a performance problem when dealing with billions of contacts. That is where the opportunity for using the bloom filter data structure emerges.

Initial bloom filter implementation

Initially, we used the C-implementation from bloomfilter-rb by building a bloom filter in-memory and uploading it to Amazon S3.

There were issues with this approach, the two most important ones being:

concurrency: sending multiple campaigns at the same time overrides the filter
slow updates / restricted to bulk updates: fetching files from S3 is not fast and updating the filter for one-off sends is expensive and not doable

With the re-design, we need a solution that will solve these problems.

Bloom filter implementation

Our bloom filter will have as a dependency our User model. Let’s say the User model has three attributes: id, max_contacts and billing_cycle_started_at:

User = Struct.new(:id, :max_contacts, :billing_cycle_started_at)
user = User.new(1, 500, Time.new(2018, 8, 01, 10, 0, 0, 0))

Here is our bloom filter implementation:

require 'zlib'

class BloomFilter

  # http://www.igvita.com/2008/12/27/scalable-datasets-bloom-filters-in-ruby/
  # 10 bits for 1% error approximation
  # ~5 bits per 10 fold reduction in error approximation
  BITS_PER_ERROR_RATE = {
    1    => 10,
    0.1  => 15,
    0.01 => 20
  }
  HASH_FUNCTIONS_COEFICIENT = 0.7 # Math.log(2)

  attr_reader :error_rate

  def initialize(user, error_rate: )
    @user = user
    @error_rate = error_rate
  end

  def indexes_for(key)
    hash_functions.times.map { |i| Zlib.crc32("#{key.to_s.strip.downcase}:#{i+seed}") % size }
  end

  def hash_functions
    @hash_functions ||= (bits * HASH_FUNCTIONS_COEFICIENT).ceil.to_i
  end

  def seed
    @seed ||= since.to_i
  end

  def since
    @since ||= @user.billing_cycle_started_at
  end

  def size
    @size ||= bits * @user.max_contacts
  end

  def bits
    @bits ||= BITS_PER_ERROR_RATE.fetch(error_rate)
  end

  def fingerprint
    @fingerprint ||= [@user.id, @user.max_contacts, seed].join('.')
  end
end

The most important part of the bloom filter is the method that generates the indexes for a given key, indexes_for(key).

Here’s an example usage:

bloom_filter = BloomFilter.new(user, error_rate: 1)

bloom_filter.indexes_for('user1@example.com')
# [2872, 110, 3108, 2498, 4409, 751, 2861]

bloom_filter.indexes_for('user2@example.com')
# [3992, 2262, 1788, 1970, 3185, 4135, 4957]

As a hashing function we use CRC32 with a custom seed per user that is the billing_cycle_started_at and the number of hashing functions based on the error rate (in this example we use an error rate of 1%).

For the bloom filter to return consistent hashing indexes during a user’s billing cycle, the input parameters it depends on (error_rate, @user.billing_cycle_started_at and @user.max_contacts) should not change for the billing cycle until it gets reset. That is the fingerprint that, as we’ll see later, we’ll use as a redis key for the Redis bloom filter.

Redis bloom filter

Redis supports getbit and setbit operations for the String type that make the individual updates simple. There is a special data type for bloom filters called rebloom if you want to explore it, but here we’ll just use a standard Redis type.

Here is our Redis bloom filter implementation:

require 'redis'

class RedisBloomFilter
  MAX_TTL = 31 * 24 * 60 * 60 # max days in a month

  def initialize(user)
    @user = user
  end

  def insert(keys)
    existing_indexes = redis.pipelined do
      keys.each do |key|
        bloom.indexes_for(key).map { |index| redis.setbit(filter_key, index, 1) }
      end
    end

    new_keys_count = keys.length.times.count { |i|
      existing_indexes[i * bloom.hash_functions, bloom.hash_functions].include?(0)
    }

    total = redis.incrby(counter_key, new_keys_count)

    if total == new_keys_count
      redis.expire(filter_key, MAX_TTL.to_i)
      redis.expire(counter_key, MAX_TTL.to_i)
    end
  end

  def count
    redis.get(counter_key).to_i
  end

  def include?(key)
    values = redis.pipelined do
      bloom.indexes_for(key).map { |index| redis.getbit(filter_key, index) }
    end

    !values.include?(0)
  end

  def field
    redis.get(filter_key)
  end

  private

  def redis
    @redis ||= Redis.new
  end

  def bloom
    @bloom ||= BloomFilter.new(@user, error_rate: 1)
  end

  def filter_key
    @filter_key ||= "bloom:filter:#{key_suffix}"
  end

  def counter_key
    @counter_key ||= "bloom:counter:#{key_suffix}"
  end

  def key_suffix
    @key_suffix ||= bloom.fingerprint
  end
end

The RedisBloomFilter uses the BloomFilter implementation to produce the indexes that it manipulates in Redis. It also implements a counter of how many unique elements are added to the filter by increasing the count when it detects a unique insert. Using an error rate of 1% for the bloom filter means that the count can be for 1% lower than the actual count and in our case that is totally fine as we allow for a bigger grace overage to customer plans. It uses redis pipelined that sends operations in batch to avoid latency and improve performance by about 5-6 times. It also sets a TTLs on the keys to expire them after a month and it exposes the field for the in-memory filter.

Here’s an example usage:

redis_bloom_filter = RedisBloomFilter.new(user)

redis_bloom_filter.insert(['user1@example.com', 'user2@example.com'])

redis_bloom_filter.count
# => 2

redis_bloom_filter.include?('user1@example.com')
# => true

redis_bloom_filter.include?('user2@example.com')
# => true

redis_bloom_filter.include?('user3@example.com')
# => false

In-memory Bloom filter

With the Redis implementation we solved half of the problem. We have a way to concurrently and quickly add elements to the bloom filter in Redis, but we still need a way to check if a bloom filter could accept a given set of elements without actually inserting the elements in the filter. This is useful when we want to prevent a list import before importing the list or stop a campaign from sending before starting it.

To achieve that, we need an in-memory filter that we can initialize with the state of the Redis bloom filter and bitarray can help us with that. We have an important PR that changes the storage representation i.e. the bits order in bitarray to match the way Redis stores them internally and a way to initialize a bitarray with a given field. To test it, you can fetch the BitArray that includes that patch from here.

Here is the implementation of the in-memory bloom filter:

class TemporaryBloomFilter

  def initialize(user)
    @user = user
    @bloom = BloomFilter.new(@user, error_rate: 1)
    @redis_filter = RedisBloomFilter.new(@user)
    @count = @redis_filter.count
  end

  def count
    @count
  end

  def insert(keys)
    keys.each do |key|
      previous_indexes = @bloom.indexes_for(key).map { |index|
        value = bit_array[index]
        bit_array[index] = 1
        value
      }
      @count += 1 if previous_indexes.include?(0)
    end
  end

  def include?(key)
    !@bloom.indexes_for(key).map { |index| bit_array[index] }.include?(0)
  end

  def over_limit?
    plan_over_limit_count > 0
  end

  def plan_over_limit_count
    @count - @user.plan_contacts
  end

  private

  def bit_array
    @bit_array ||= prepare_bit_array
  end

  def prepare_bit_array
    field = @redis_filter.field.to_s
    current_field_length = field.length
    max_field_length = (@bloom.size / 8 + 1)

    if current_field_length < max_field_length
      field += "\0" * (max_field_length - current_field_length)
    end

    BitArray.new(@bloom.size, field)
  end
end

And an example usage:

temporary_bloom_filter = TemporaryBloomFilter.new(user)

temporary_bloom_filter.insert(['user3@example.com', 'user4@example.com', 'user5@example.com'])

temporary_bloom_filter.count
# => 5

temporary_bloom_filter.include?('user5@example.com')
# => true

temporary_bloom_filter.include?('user6@example.com')
# => false

Performance

Ruby’s in-memory implementation is few times slower than the C-implementation in bloomfilter-rb, but still fast enough as it can process 1 million items in 5-10 seconds both calculating hash functions and doing BitArray inserts.

total_items = 1_000_000
t1 = Time.now
bf = BloomFilter.new(user, error_rate: 1)
ba = BitArray.new(total_items)
total_items.times do |i|
  bf.indexes_for("user#{i}@example.com").each do |j|
    ba[j] = true
  end
end
t2 = Time.now
puts t2-t1

# => 7.485282645

Redis performance is pretty solid as well. It can handle around 70-80k operations per second and when using pipelined mode for our batches of 350, we get 5-6 times more operations:

$ redis-benchmark -q -n 100000 -P 350
PING_INLINE: 373134.31 requests per second
PING_BULK: 421940.94 requests per second
SET: 369003.69 requests per second
GET: 396825.38 requests per second
INCR: 344827.59 requests per second
LPUSH: 362318.84 requests per second
LPOP: 389105.06 requests per second
SADD: 353356.91 requests per second
SPOP: 361010.81 requests per second
LPUSH (needed to benchmark LRANGE): 370370.34 requests per second
LRANGE_100 (first 100 elements): 61050.06 requests per second
LRANGE_300 (first 300 elements): 17494.75 requests per second
LRANGE_500 (first 450 elements): 11043.62 requests per second
LRANGE_600 (first 600 elements): 7965.59 requests per second
MSET (10 keys): 202839.75 requests per second

Conclusion

This custom implementation of a bloom filter turned out pretty solid and robust in our production environment. We have a Kibana dashboard monitoring the bloom filter updates over time giving us much better insights than our previous implementation.

Refactoring Rails configs for deploy to Kubernetes

2018-02-27T07:30:00+00:00

Recently, I worked on a project to containerize one of our Rails apps. The goal was to add per pull request verification deploys to Kubernetes as part of the CICD pipeline. During that work I faced a need to re-design how we manage the configs in the application and I will share some thoughts about the approach. But before we jump of that, let’s explain the concept of per pull request verification deploys.

Per Pull Request verification deploys

We use Jenkins for Continuous Integration and Continuous Delivery (CICD). Whenever we merge a pull request to the master branch (CI), the pipeline will deploy the changes to the target environments (CD). It starts by deploying to staging environments and finishes with a deploy to production environment.

We introduced another verification step to this pipeline. On each successful pull request build, it deploys the changes to a short-lived location. This temporary deploy is used for QA, manual testing and verification against more realistic data and environment before deploying to production. Once verified, the pull request can get merged in master that triggers the automated production deploy.

We deploy the app using Capistrano to OpenStack and bare metal servers. For the short-lived verification deploys we decided to explore and deploy to Kubernetes cluster. So, my main goal for the configs refactor was to have a solution that works well for both deploy scenarios.

Config refactor design goals

Configs that work in different scenarios:
- local app
- local app using docker containers
- local app using docker-compose
- Capistrano deploy to OpenStack and bare metal servers
- Kubernetes deploy to minikube and real clusters
Flexibility in how configs are defined

Some configs like database.yml and redis.yml are in YAML format and other are using environment variables. I wanted to keep the flexibility of using YAML configs for the more complex configurations instead of forcing environment variables for everything.

Keep everything but the secrets config in source control

Managing many config files, especially when deploying and running the app in different ways, increases maintenance complexity. The config files that are not stored in source control needs to become visible to the app during deploy. The goal here is to have at most a single file that’s not in source control. For Capistrano deploys it’s a single shared file with secrets to link during deploy. And, for Kubernetes deploys it’s a single Secret resource that’s updated on change.

By keeping as much of the configs in source control, we’ll do regular reviews on any config changes before merging to master. This is expecially important for much more complex configs like the one we have for Octoshark where we connect to around 50 MySQL instances.

Using environment variables with dotenv

One of the tenets of Twelve-Factor app methodology is storing configs in the environment. Docker, docker-compose and Kubernetes have a built-in ways for passing environment variables to the containers.

The dotenv gem can help us replicate that by loading environment variables from config files. Once we include dotenv in the Gemfile, all we need to add is the following line to config/application.rb:

Dotenv.overload(".env", ".env.#{Rails.env}", ".env.#{Rails.env}.secrets")

Here we use the overload feature of Dotenv. For production environment for example, it will first load the .env file, then .env.production and finally the .env.production.secrets file.

.env                   # keeps the shared variables across all environments
.env.production        # keeps the environment specific variables
.env.production.secret # keeps the environment specific secrets

The .env.production.secrets file is the one that’s ignored in source control and it is used to keep the secrets as well as other configuration values that change between environments.

In the context of containers, we have the flexibility to override any of these environment variables which makes this config strategy work in both scenarios.

Using YAML configs with environment variables

We can use YAML configs like database.yml with environment variables. We just change it to read the secrets and values that change between environments from environment variables. Where it makes sense we can also use a fallback, i.e. default values. Here’s an example database.yml config:

development:
  adapter: mysql2
  encoding: utf8
  reconnect: false
  pool: 5
  database: <%= ENV['MYAPP_DATABASE'] || 'myapp_development' %>
  username: <%= ENV['MYAPP_USERNAME'] %>
  password: <%= ENV['MYAPP_PASSWORD'] %>
  host:     <%= ENV['MYAPP_HOST'] || 'localhost' %>
  port:     <%= ENV['MYAPP_PORT'] || 3306 %>

Rails parses ERB tags by default when interpreting database.yml config. But for the custom configs we might have in the app, like the Redis one below for example, we need to replicate that behaviour.

Here is a very simple ConfigParser class that does that:

require 'yaml'
require 'erb'

class ConfigParser
  def self.parse(file, environment)
    YAML.load(ERB.new(IO.read(file)).result)[environment]
  end
end

Then for a Redis config:

:development:
  :host: <%= ENV['REDIS_HOST'] || 'localhost' %>
  :port: <%= ENV['REDIS_PORT'] || 6379 %>
  :password: <%= ENV['REDIS_PASSWORD'] %>
  :db: <%= ENV['REDIS_DB'] || 10 %>
  :reconnect_attempts: 3
  :timeout: 2

We can use the ConfigParser like:

redis_config = ConfigParser.parse('config/redis.yml', Rails.env.to_sym)
redis_conn = Redis.new(redis_conf)

Rails 5.2 Encrypted Credentials

With Rails 5.2 being just around the corner, and specifically the Encrypted Credentials feature, we have the option to keep all the secrets encrypted in source control.

We can put all the secrets from the different environments .env.development.secrets, .env.test.secrets and .env.production.secrets in config/credentials.yml.enc and then the only value that the deploy target will need as a dependency is the config/master.key encryption key.

This approach of storing production secrets in codebase, although encrypted, might be sensible to some organizations.

Final thoughts

I’ve considered using different Rails environments as an alternative approach. That increases complexity and does not meet some of the configs design goals. There are also Rails env checks in the codebase that behave as feature flags. So, the overloading approach with environment variables and the flexibility of using YAML for more complex configs works pretty well in all these scenarios.

A Walkthrough for Handling and Testing Exceptions

2017-10-22T09:00:00+00:00

In a previous blog posts I wrote about the problem of overusing exceptions, and in this one we’ll look at some exception handling and testing practices.

To start with, let’s define LinkCounter class. LinkCounter counts how many links are on a web page. It is initialized with a url, it uses Faraday HTTP client to fetch the page content and it uses Nokogiri to parse the HTML content.

require 'faraday'
require 'nokogiri'

class LinkCounter
  def initialize(url)
    @url = url
  end

  def count
    doc.css('a').count
  end

  private

  def doc
    Nokogiri::HTML.parse(content)
  end

  def content
    connection.get(@url).body
  end

  def connection
    Faraday.new
  end
end

Then, we can use it like this:

puts LinkCounter.new('https://example.com').count # 1

Pretty simple so far.

What could possibly go wrong?

To improve the robustness of our LinkCounter we need to think about what could fail? We identify the Faraday’s connection.get call, doing the GET HTTP request, as one with highest probably of failure because it depends on the reliability of the network.

Always rescue very specific exceptions. Never rescue Exception and avoid rescuing StandardError too because it can hide unexpected errors like NameError and NoMethodError. See ruby’s exception hierarchy.

In order to rescue the very specific exceptions, we need to figure out all the exceptions that Faraday can raise. Good libraries usually would have a separate file defining all the errors like it’s the case with Faraday errors or Redis errors as another example.

Looking at the Faraday error definitions we can see it has the following hierarchy:

StandardError
  Faraday::Error
    Faraday::MissingDependency
    Faraday::ClientError
      Faraday::ConnectionFailed
      Faraday::ResourceNotFound
      Faraday::ParsingError
      Faraday::TimeoutError
      Faraday::SSLError

Exploring Faraday errors

We need to explore and understand at what conditions each of the Faraday errors could happen.

So, if we define very small open timeout, we’ll see Faraday::ConnectionFailed error.

Faraday.new(request: { open_timeout: 0.1 }).get('https://example.com') 

# Faraday::ConnectionFailed: execution expired

If we define small read timeout, we’ll get Faraday::TimeoutError.

Faraday.new(request: { open_timeout: 1, timeout: 0.1 }).
  get('https://example.com') 

# Faraday::TimeoutError: Net::ReadTimeout

Note here that if we set only the timeout value, the open_timeout will use the same value and we wouldn’t be able to reproduce the Faraday::TimeoutError error, but we’ll get Faraday::ConnectionFailed error again.

For docs on timeouts in other popular Ruby gems, you can check out this popular github repo.

If we try GET request to a nonexistent host we get Faraday::ConnectionFailed.

Faraday.get('https://example.nonexistent.com') 

# Faraday::ConnectionFailed: Failed to open TCP connection to example.nonexistent.com:443 (getaddrinfo: Name or service not known)

Note that in this case we also have a nice exception message getaddrinfo: Name or service not known that distinguishes this error from the error that happens when a connection cannot be opened for an existing host.

If we request a website without SSL support, we get Faraday::SSLError.

Faraday.get('https://ruby.mk')

# Faraday::SSLError: SSL_connect returned=1 errno=0 state=SSLv3 read server certificate B: certificate verify failed

Finally, if we configure Faraday to raise exceptions on 40x and 50x responses, we’ll see it raises Faraday::ResourceNotFound error for 404 response:

Faraday.new do |faraday| 
  faraday.use Faraday::Response::RaiseError
  faraday.adapter Faraday.default_adapter
end.get('https://httpstat.us/404')

# Faraday::ResourceNotFound: the server responded with status 404

And, we’ll get Faraday::ClientError for 500 response:

Faraday.new do |faraday| 
  faraday.use Faraday::Response::RaiseError
  faraday.adapter Faraday.default_adapter
end.get('https://httpstat.us/500')

# Faraday::ClientError: the server responded with status 500

Note that in the last two examples I use this handy httpstat.us service that returns the requested status code.

Handling exceptions

Based on our previous exploration, we conclude that we will retry Faraday::TimeoutError and Faraday::ConnectionFailed errors except the case when the host does not exist, i.e. exception message is getaddrinfo: Name or service not known.

Let’s define a general purpose Retryable module for that.

module Retryable
  SLEEP_INTERVAL = 0.4

  def with_retries(retries: 3, retry_skip_reason: nil, rescue_class: )
    tries = 0

    begin
      yield
    rescue *rescue_class => e
      tries += 1
      if tries <= retries && (retry_skip_reason.nil? || !e.message.include?(retry_skip_reason))
        sleep sleep_interval(tries)
        retry
      else
        raise
      end
    end
  end

  private

  def sleep_interval(tries)
    (SLEEP_INTERVAL + rand(0.0..1.0)) * tries ** 2
  end
end

From this module we can use with_retries method that by default will retry 3 times the error with an exponential and randomized sleep interval. It also accepts an option retry_skip_reason to skip retry when a specific exception message matches the skip reason.

We can now use the Retryable module with LinkCounter as follows:

class LinkCounter
  include Retryable

  # the rest of the code

  def content
    with_retries(
      rescue_class: [Faraday::TimeoutError, Faraday::ConnectionFailed],
      retry_skip_reason: 'getaddrinfo: Name or service not known'
    ) do
      connection.get(@url).body
    end
  end

  def connection
    @connection ||= Faraday.new(
      request: { open_timeout: 10, timeout: 30 }
    ) do |faraday| 
      faraday.use Faraday::Response::RaiseError
      faraday.adapter Faraday.default_adapter
    end
  end
end

The other exceptions that Faraday could raise are not temporary and we don’t want to retry them. We could either rescue and ignore them or let them raise and be tracked by the exceptions tracking system we have in place. It depends on the use case and if they stop or not our running system.

Testing exception retries

Always provide a test / spec that documents why each exception is being handled. This is very important for future readers of the code to understand the failure context better.

We’ll use RSpec to test the exception retries. If we focus on the Faraday::TimeoutError, the scenarios that we want to test are that 1) an error is retried and 2) retry is not infinite.

describe LinkCounter do
  let(:url) { 'http://example.com' }

  it "retries read timeout errors" do
    link_counter = LinkCounter.new(url)
    connection = link_counter.send(:connection)
    expect(connection).to receive(:get).once.and_raise(Faraday::TimeoutError)
    expect(connection).to receive(:get).once.and_return(double(body: '<a href="#">link</a>'))
    allow_any_instance_of(Retryable).to receive(:sleep_interval).and_return(0)

    expect(link_counter.count).to eq(1)
  end

  it "re-raises read timeout error after exausting error retries" do
    link_counter = LinkCounter.new(url)
    connection = link_counter.send(:connection)
    expect(connection).to receive(:get).exactly(4).times.and_raise(Faraday::TimeoutError)
    allow_any_instance_of(Retryable).to receive(:sleep_interval).and_return(0)

    expect {
      expect(link_counter.count)
    }.to raise_error(Faraday::TimeoutError)
  end
end

In the above example we use rspec-mocks to set expectations for the consecutive calls. In the first spec, for the first GET request we expect timeout error and then for the second call we return a body with content that has one link. In the second spec, we expect 4 GET requests (1 + 3 retries) and all of them raising timeout error resulting in a final exception being raised.

If you are using mocha, you can set expectations for consecutive invocations like this:

connection.expects(:get).
  raises(Faraday::TimeoutError).
  then.returns(stub(get: body: '<a href="#">link</a>'))

Let’s now cover the other two cases that are 3) retrying open timeout errors and 4) not retrying unknown host errors.

describe LinkCounter do
  # the rest of the specs

  it "retries open timeout errors" do
    link_counter = LinkCounter.new(url)
    connection = link_counter.send(:connection)
    expect(connection).to receive(:get).once.and_raise(Faraday::ConnectionFailed.new('execution expired'))
    expect(connection).to receive(:get).once.and_return(double(body: '<a href="#">link</a>'))
    allow_any_instance_of(Retryable).to receive(:sleep_interval).and_return(0)

    expect(link_counter.count).to eq(1)
  end

  it "does not retry unknown host errors" do
    link_counter = LinkCounter.new(url)
    connection = link_counter.send(:connection)
    expect(connection).to receive(:get).once.and_raise(Faraday::ConnectionFailed.new("Failed to open TCP connection to example.nonexistent.com:80 (getaddrinfo: Name or service not known)"))
    allow_any_instance_of(Retryable).to receive(:sleep_interval).and_return(0)

    expect {
      expect(link_counter.count)
    }.to raise_error(Faraday::ConnectionFailed)
  end
end

Final notes

In this walkthough I did not use TDD intentionally to focus on these other important details. And also, we are often surprised by exceptions we cannot predict in development but they appear in production and we handle them after the fact. The important thing is to always document with a spec the very specific exception that happens, in which conditions it happens so that others can understand, improve and refactor the code in the future.

Debugging Rails Views in Production

2017-06-11T10:00:00+00:00

Today I’m going to share a quick technique for debugging Rails views in production. When there is a nasty bug or performance issue, the easiest way to find the cause is to reproduce it in the environment where it’s happening with the real data and in the real context.

The technique involves monkey-patching production code in Rails console that adds print statements, defines or redefines methods that when called will get us some insights to understand what’s going on. Investigating and isolating segment by segment, usually using read only operations to prevent undesirable data side effects, and we’ll eventually figure out the cause.

It’s easy to use this approach with small and isolated classes and methods that can be initialized and called without much setup, but we can use the same approach with the standard request-response cycle to debug views with ConsoleMethods from Rails.

Find the slow partial

Say we have a controller action that we want to investigate where exactly it’s getting slow in the views. Btw, imagine that this is happening only for a single user in production and transaction average metric is not revealing us any useful info.

class HomeController < ApplicationController

  before_filter :authenticate_account!

  def index
    # some stuff
  end
end

We can now use the app instance available in console to make a GET request to / path in the app.

>> app.get('/')

Started GET "/" for 127.0.0.1 at 2017-06-11 08:45:15 +0200
Processing by HomeController#index as HTML
Completed 401 Unauthorized in 10ms (ActiveRecord: 0.0ms)
=> 302

Oh, of course. We cannot get to the view rendering yet because of the before filter and we’ll need to authenticate first. We can either login first with another request or we can just stub it for the duration of this console session that will also avoid logging our credentials in production console history.

class HomeController
  skip_before_filter :authenticate_account!

  def current_account
    Account.find(1)
  end
end

Then, by making the GET request to / path we’ll get the details from the views rendering:

>> app.get('/')

Started GET "/" for 127.0.0.1 at 2017-06-11 00:59:44 +0200
Processing by HomeController#index as HTML
  Rendered home/_view1.html.erb (0.0ms)
  Rendered home/_view2.html.erb (10000.2ms)
  Rendered home/index.html.erb within layouts/application (10001.3ms)
  Rendered shared/_topnav.html.erb (0.2ms)
  Rendered shared/_flash_messages.html.erb (0.1ms)
  Rendered shared/_header.html.erb (0.1ms)
  Rendered shared/_footer.html.erb (0.0ms)
Completed 200 OK in 10010ms (Views: 10009.8ms | ActiveRecord: 0.0ms)
=> 200

From the rendering info, we can see that most of the time, that is around 10 seconds, is spent rendering home/_view2.html.erb partial. We have identified that something slow is happening there but we don’t know what exactly it is.

Get the stacktrace

While the request is processing the slow part we can just press CTRL+C to stop it and get a stacktrace:

>> app.get('/')

Started GET "/" for 127.0.0.1 at 2017-06-11 01:05:30 +0200
Processing by HomeController#index as HTML
  Rendered home/_view1.html.erb (0.0ms)
^C  Rendered home/_view2.html.erb (1309.6ms)
  Rendered home/index.html.erb within layouts/application (1310.6ms)
Completed 500 Internal Server Error in 1312ms (ActiveRecord: 0.0ms)

IRB::Abort (abort then interrupt!):
  app/views/home/_view2.html.erb:1:in `sleep'
  app/views/home/_view2.html.erb:1:in `_app_views_home__view__html_erb__3830501997270886489_69842991281620'
  app/views/home/index.html.erb:3:in `_app_views_home_index_html_erb___2357466009542976056_69842998601520'

  Rendered /home/dalibor/.rbenv/versions/2.4.1/lib/ruby/gems/2.4.0/gems/actionpack-4.2.8/lib/action_dispatch/middleware/templates/rescues/_source.erb (5.6ms)
  Rendered /home/dalibor/.rbenv/versions/2.4.1/lib/ruby/gems/2.4.0/gems/actionpack-4.2.8/lib/action_dispatch/middleware/templates/rescues/_trace.html.erb (2.2ms)
  Rendered /home/dalibor/.rbenv/versions/2.4.1/lib/ruby/gems/2.4.0/gems/actionpack-4.2.8/lib/action_dispatch/middleware/templates/rescues/_request_and_response.html.erb (0.7ms)
  Rendered /home/dalibor/.rbenv/versions/2.4.1/lib/ruby/gems/2.4.0/gems/actionpack-4.2.8/lib/action_dispatch/middleware/templates/rescues/diagnostics.html.erb within rescues/layout (18.0ms)
=> 500

From the stacktrace we can see that the slow call is the call to sleep method in view2 partial. So, once we know the “what”, we can go and start figuring out the “why”.

Alternatively to this, we can use TracePoint as explained in tracing ruby blog post to get a stacktrace and then play roulette to sample individual calls to figure out what’s slow.

trace { app.get('/') }

ConsoleMethods module has few handy methods that you can check out.

For example, we can get the response body.

>> app.response.body.first(15)
=> "<!DOCTYPE html>"

We can call routes and helper methods, etc.

>> helper.link_to(app.root_path, 'Home')
=> "<a href=\"Home\">/</a>"

Faster CI builds using an in-memory database

2017-02-21T08:10:00+00:00

What if you could get some speed improvement for your database intensive tests for free?

In this blog post we’ll use an in-memory file storage called tmpfs that is available on most Unix-like operating systems. To test this out I am using Ubuntu 14.04 and MySQL 5.5.54 but the approach applies to any database that writes data to disk. Databases are very sensitive to IOPS since their job is reading and writing data and the speed gain comes from faster writes to RAM than disk.

You should expect to see more significant speed improvement if your test suite is more intense on the database like using database cleaning strategy with non-transactional fixtures, using truncation, etc. The gain depends on the speed difference between writing to RAM and writing to disk for your machine.

In my test, I got ~ 32% speed improvement with build time decrease from 32.96 to 22.37 seconds:

# Before
Finished in 32.96 seconds (files took 3.38 seconds to load)
1316 examples, 0 failures, 1 pending

# After
Finished in 22.37 seconds (files took 3.29 seconds to load)
1316 examples, 0 failures, 1 pending

To test this out yourself and see what speed improvement you get, this is what to do.

Create a RAM disk

Create a new directory /mnt/testdisk and then use the mount command to create a disk using tmpfs file storage with size of 300 megabytes.

sudo mkdir /mnt/testdisk
sudo mount -t tmpfs -o size=300m tmpfs /mnt/testdisk

In case you need to unmount and remove that directory later, you can do that with:

sudo umount /mnt/testdisk
sudo rm -rf /mnt/testdisk

Run MySQL in Docker container

If you’re now familiar with Docker you can skip this section. If you are familiar or you want to get familiar, first install it, and then just setup a MySQL container with the following command:

sudo docker run \
--detach \
--name=mysql-test \
--env="MYSQL_ROOT_PASSWORD=pass" \
--volume=/mnt/testdisk:/var/lib/mysql \
mysql:5.5.54

What that command does is, it creates a new MySQL container with a name of mysql-test using version 5.5.54. It sets the root password for MySQL to pass and attaches the RAM disk we previously created at /mnt/testdisk to /var/lib/mysql that is the default MySQL datadir.

If the container was setup successfully, find out the IP address and check if MySQL is running inside:

sudo docker inspect mysql-test | grep "IPAddress"
# "IPAddress": "172.17.0.2",

mysql -uroot -ppass -h 172.17.0.2 -P 3306

If you can connect, then all is good and you are ready to change the host in database.yml for the test environment and measure the build time.

If something went wrong during container bootup, you can start debugging by checking out the container logs. I had an issue when I tried to use a smaller partition size for tmpfs disk and the container creation failed.

sudo docker logs mysql-test

In case you want to remove the container, you can do that with:

sudo docker rm -f mysql-test

Configure local MySQL

If you’re not familiar with Docker, the other option is to manually change the local MySQL config. The inconvenience is that you’ll need to constantly change the config switching between development and test because RAM data will not persist after reboots. Here are setup steps.

Stop MySQL service:

sudo service mysql stop

Change MySQL data directory:

# sudo vi /etc/mysql/my.cnf
datadir	= /mnt/testdisk

Add apparmor (Linux kernel security module) alias for the new MySQL path:

# sudo vi /etc/apparmor.d/tunables/alias
alias /var/lib/mysql/ -> /mnt/testdisk,

Restart apparmor service:

sudo service apparmor restart

Re-configure MySQL to setup things properly in the new data directory:

sudo dpkg-reconfigure mysql-server-5.5

The last command will auto-start MySQL and then you should be ready to measure the test time.

Share in the comments what speed improvements do you get.

Auto-reconnect for ActiveRecord connections

2017-01-20T08:00:00+00:00

ActiveRecord has a special config option reconnect: true for native auto-reconnect when using MySQL database. With that option in database.yml, it will try to reconnect only once as per the manual before it fails:

The MySQL client library can perform an automatic reconnection to the server if it finds that the connection is down when you attempt to send a statement to the server to be executed. If auto-reconnect is enabled, the library tries once to reconnect to the server and send the statement again.

>> Post.count
   (0.7ms)  SELECT COUNT(*) FROM `posts`
ActiveRecord::StatementInvalid: Mysql2::Error: Can't connect to local MySQL server through socket '/var/run/mysqld/mysqld.sock' (2): SELECT COUNT(*) FROM `posts`

Often we want to have more control over the reconnect strategy in order to give it more than one chance for the connection to recover. Imagine doing a master-slave fail-over or the database server is not stable and it takes about 10 seconds of downtime for the server to become available. To keep the service reliable we’ll need to avoid dropping requests during that interval.

One way to do that would be to patch ActiveRecord to auto-reconnect with custom wait intervals like:

module Mysql2AdapterPatch
  def execute(*args)
    # During `reconnect!`, `Mysql2Adapter` first disconnect and set the
    # @connection to nil, and then tries to connect. When connect fails,
    # @connection will be left as nil value which will cause issues later.
    connect if @connection.nil?

    begin
      super(*args)
    rescue ActiveRecord::StatementInvalid => e
      if e.message =~ /server has gone away/i
        in_transaction = transaction_manager.current_transaction.open?
        try_reconnect
        in_transaction ? raise : retry
      else
        raise
      end
    end
  end

  private
  def try_reconnect
    sleep_times = [0.1, 0.5, 1, 2, 4, 8]

    begin
      reconnect!
    rescue Mysql2::Error => e
      sleep_time = sleep_times.shift
      if sleep_time && e.message =~ /can't connect/i
        warn "Server timed out, retrying in #{sleep_time} sec."
        sleep sleep_time
        retry
      else
        raise
      end
    end
  end
end

require 'active_record/connection_adapters/mysql2_adapter'
ActiveRecord::ConnectionAdapters::Mysql2Adapter.prepend Mysql2AdapterPatch

When connection goes down, it starts trying to reconnect and finally succeeds when server is up.

>> Post.count
   (0.6ms)  SELECT COUNT(*) FROM `posts`
Server timed out, retrying in 0.1 sec.
Server timed out, retrying in 0.5 sec.
Server timed out, retrying in 1 sec.
Server timed out, retrying in 2 sec.
Server timed out, retrying in 4 sec.
   (1.1ms)  SELECT COUNT(*) FROM `posts`
=> 0

What’s interesting to note here is that if during a transaction block the connection goes down and reconnects, it will continue executing the following queries and will just swallow the previous queries from the start of the transaction until the moment where connection dropped. That’s why when trying to reconnect while in_transaction as per the patch above, it’s safer to re-raise the connect error.

Here’s an example to demonstrate that edge-case:

Post.transaction do
  Post.create
  sleep 5
  Post.count
end

If the connection is dropped while on the sleep call, and then reconnects, it will re-raise the dropped connection error to stop executing following queries because the Post.create will not get created.

   (0.3ms)  BEGIN
  SQL (0.2ms)  INSERT INTO `posts` (`created_at`, `updated_at`) VALUES ('2017-01-18 20:18:14', '2017-01-18 20:18:14')
   (0.2ms)  SELECT COUNT(*) FROM `posts`
Server timed out, retrying in 0.1 sec.
Server timed out, retrying in 0.5 sec.
Server timed out, retrying in 1 sec.
Server timed out, retrying in 2 sec.
   (0.1ms)  ROLLBACK
ActiveRecord::StatementInvalid: Mysql2::Error: MySQL server has gone away: SELECT COUNT(*) FROM `posts`

I hope you find this info useful, please share in the comments if you have any thoughts.