Active Record Tenanting

Note

This file will eventually become a complete "Rails Guide"-style document explaining Active Record tenanting with this gem.

In the meantime, it is a work-in-progress containing:

skeleton outline for documentation
functional roadmap represented as to-do checklists

Introduction

Tip

If you're not familiar with how Rails's built-in horizontal sharding works, it may be worth reading the Rails Guide on Multiple Databases with Active Record before proceeding.

Documentation outline:

this gem primarily extends Active Record,
- essentially creating a new Connection Pool for each tenant,
- and extending horizontal shard swapping to support these pools.
- also provides test helpers to make it easy to handle tenanting in your test suite
but also touches many other parts of Rails
- integrations for Middleware, Action View Caching, Active Job, Action Cable, Active Storage, Action Mailbox, and Action Text
- support and documentation for Solid Cache, Solid Queue, Solid Cable, and Turbo Rails
a Tenant is just a string that is used for:
- the sqlite database filename (or perhaps the pg/mysql database name in the future)
- configuring either tenant-by-subdomain or a tenant-by-root-path-element
- fragment cache disambiguation
- global id disambiguation
- invalid characters in a tenant name
  - and how the application may want to do additional validation (e.g. ICANN subdomain restrictions)
- #tenant is a readonly attribute on all tenanted model instances
- .current_tenant returns the execution context for the model connection class
talk a bit about busted assumptions about shared state
- database ids are no longer unique
- global ids are no longer global
- cache is no longer global
- cable channels are no longer global
- jobs are no longer global
and what we do in this gem to help manage that "current tenant" state
reference existing approaches/projects, maybe talk about differences
logging
- SQL query logs
- TaggedLogging and config.log_tenant_tag
- suggest how to add to structured logs if people are doing that

Active Record

Configuration

Documentation outline:

how to configure database.yml
- for tenanting a primary database
- for tenanting a non-primary database
how to configure model classes and records
- variations for primary or non-primary records
- how to make a class that inherits from ActiveRecord::Base "subtenant" from a tenanted database
  - and note how we do it out of the box for Rails records
Rails configuration
- explain why we set some options
  - active_record.use_schema_cache_dump = true
  - active_record.check_schema_cache_dump_version = false
- explain gem railtie config options
  - connection_class
  - tenant_resolver
  - tenanted_rails_records
  - log_tenant_tag
- demonstrate how to configure an app with subdomain tenanting
  - app.config.hosts
  - example TenantSelector config
- demonstrate how to configure an app with root path tenanting
  - app.config.hosts
  - example TenantSelector config
migrations
- create_tenant migrates the new database
- but otherwise, creation of the connection pool for a tenant that has pending migrations will raise a PendingMigrationError
- db:migrate will migrate all tenants

TODO:

Tenanting in your application

Documentation outline:

introduce the basics
- explain .tenanted and the ActiveRecord::Tenanted::Tenant module
- explain .subtenant_of and the ActiveRecord::Tenanted::Subtenant module
- explain .with_tenant, .with_each_tenant, .current_tenant=, and current_tenant
- demonstrate how to create a tenant, destroy a tenant, etc.
troubleshooting: what errors you might see in your app and how to deal with it
- specifically when running untenanted

Testing

Documentation outline:

explain the concept of a default tenant
- and that database connection is wrapped in a transaction
explain creating a new tenant
- and how that database is NOT wrapped in a transaction during the test,
- but those non-fixture databases will be cleaned up at the start of the test suite
explain without_tenant
example of:
- unit test with fixtures
- integration test
- sytem test

TODO:

testing
- a without_tenant test helper
- set up test helper to default to a tenanted named "test-tenant"
- set up test helpers to deal with parallelized tests, too (e.g. "test-tenant-19")
- set up integration tests to do the right things ...
  - set the domain name in integration tests
  - wrap the HTTP verbs with without_tenant
  - set the domain name in system tests
- allow the creation of tenants within transactional tests

Caching

Documentation outline:

explain why we need to be careful
explain how active record objects' cache keys have tenanting built in
explain why we're not worried about collection caching and partial caching (?)
explain why we're not worried about russian doll caching
explain why calling Rails.cache directly requires care that it's either explicitly tenanted or global
explain why we're not worried about sql query caching (it belongs to the connection pool)

TODO:

make basic fragment caching work
investigate: is collection caching going to be tenanted properly
investigate: make sure the QueryCache executor is clearing query caches for tenanted pool
do we need to do some exploration on how to make sure all caching is tenanted?
- I'm making the call not to pursue this. Rails.cache is a primitive. Just document it.

Action View Fragment Caching

Documentation outline:

explain how it works (cache keys)

TODO:

extend #cache_key on Base
extend #cache_key on Subtenant

Solid Cache

Documentation outline:

describe one-big-cache and cache-in-the-tenanted-database strategies
- note that cache-in-the-tenanted-database means there is no global cache
- note that cache-in-the-tenanted-database is not easily purgeable (today)
- and so we recommend (?) one big cache in a dedicated database
how to configure Solid Cache for one-big-cache
how to configure Solid Cache for tenanted-cache

TODO:

upstream
- feature: make shard swap prohibition database-specific
  - which would work around Solid Cache config wonkiness caused by rails/solid_cache#219

Action Cable

Documentation outline:

explain why we need to be careful
how to tenant a channel
- make sure to call super if you override #connect
how the global id also contains the tenant
do we need to document each adapter?
- async
- test
- solid_cable
- redis?

TODO:

extend the base connection to support tenanting with a tenanted_connection method
reconsider the current API using tenanted_connection if we can figure out how to reliably wrap #connect
- did this! prefer to force the app to call super() from #connect, it's simpler
test disconnection
- ActionCable.server.remote_connections.where(current_tenant: "foo", current_user: User.find(1)).disconnect
- can we make this easier to use by implying the current tenant?
add tenant to the action_cable logger tags
add integration testing around executing a command (similar to Job testing)

Turbo Rails

Documentation outline:

explain why we need to be careful
explain how it works (global IDs)

TODO:

extend to_global_id and friends for Base
extend to_global_id and friends for Subtenant
some testing around global id would be good here
system test of a broadcast update

Active Job

Documentation outline:

explain why we need to be careful
explain belt-and-suspenders of
- ActiveJob including the current tenant,
- and any passed record being including the tenant in global_id

TODO:

extend ActiveJob to set the tenant in perform_now
extend to_global_id and friends for Base
extend to_global_id and friends for Subtenant
create a tenanted GlobalID locator
inject the tenanted GlobalID locator as the default app locator
make sure the test helper perform_enqueued_jobs wraps everything in a without_tenant block

Active Storage

Documentation outline:

explain why we need to be careful
explain how it works
- if connection_class is set, then Active Storage will insert the tenant into the blob key
  - and the disk service will include the tenant in the path on disk in the root location, like: 'foobar/ab/cd/abcd12345678abcd'
Disk Service can also have a tenanted root path, but it's optional

TODO:

extend Disk Service to change the path on disk
extend Blob to have tenanted keys

ActionMailer

Documentation outline:

explain how to configure the action mailer default host if needed, with a "%{tenant}" format specifier.

TODO:

Interpolate the tenant into a host set in config.action_mailer.default_url_options
Do we need to do something similar for the asset host?
- I'm going to wait until someone needs it, because it's not trivial to hijack.
Do we need to do something similar for explicit host parameters to url helpers?
- I don't think so.
- I'm going to wait until someone needs it, because it's not trivial to hijack.

ActionMailbox

TODO:

I need a use case here around mail routing before I tackle it

Console

Documentation outline:

explain the concept of a "default tenant"
explain usage of the ARTENANT environment variable to control startup

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Active Record Tenanting

Introduction

Active Record

Configuration

Tenanting in your application

Testing

Caching

Action View Fragment Caching

Solid Cache

Action Cable

Turbo Rails

Active Job

Active Storage

ActionMailer

ActionMailbox

Console

FilesExpand file tree

GUIDE.md

Latest commit

History

GUIDE.md

File metadata and controls

Active Record Tenanting

Introduction

Active Record

Configuration

Tenanting in your application

Testing

Caching

Action View Fragment Caching

Solid Cache

Action Cable

Turbo Rails

Active Job

Active Storage

ActionMailer

ActionMailbox

Console