System Design Notes

Birthday Paradox & Base62

Collisions are more likely than expected. Only 23 people are needed for a >50% chance two share a birthday.
For space size N, expect 50% collision chance after about ( \sqrt{N} ) random picks.

Base62 Encoding

Length (power of 62)	Combinations
6	56 billion
7	3.5 trillion
8	218 trillion

Caching: In-Memory vs Disk

Speed

Storage	Latency
Memory	100 nanoseconds (0.0001 ms)
SSD	0.1 ms
HDD	10 ms

Throughput

Storage	IOPS (Reads/sec)
Memory	Millions
SSD	~100,000
HDD	100 - 200

CDN with Edge Computing

CDN = geographically distributed PoPs (Points of Presence).
Edge Computing = logic execution closer to user via Cloudflare Workers or AWS Lambda@Edge.
Benefits: Reduced latency, faster TTFB, global reach.
Cautions: Higher cost with scale; limits on memory, execution time.
Note: Be mindful of cache invalidation and consistency. TTL and CDC can help limit staleness.

Scaling Strategies

Database:
- Replication
- Sharding/Partitioning
- Backups
Services:
- Horizontal scaling
- Auto-scaling groups
Distributed Cache:
- Standalone vs Sentinel vs Cluster
Counters:
- Counter batching to reduce network overhead
Write Optimization:
- Batch writes
Design Patterns:
- CQRS (Command Query Responsibility Segregation)
- Sidecar Pattern
- Bulkhead Pattern
- Circuit Breaker

Uploading Large Files

Use Blob/Object storage with pre-signed URLs for direct upload/download.
Use event triggers (e.g., S3 triggers Lambda) for post-upload actions.
Prefer Chunking / Multipart upload for:
- Fault tolerance
- Parallelism
- Data integrity (via ETag + Part Number)
Watch out for:
- Large file timeouts (e.g., 50 GB on 100 Mbps = ~1.1 hrs)
- Web server limits (e.g., NGINX default 2GB)
Consider compression: GZip, Brotli, ZStandard — better for text files.

File Sync Agents

Platform	Utility
Linux	fswatch, inotify
macOS	FsEvents
Windows	FileSystemWatcher

File/Data Security

Encryption at Rest
Encryption in Transit
Access Control:
- RBAC (Role-Based Access Control)
- IAM roles
- KMS, Vault

Consistency & Transactions

Write-centric considerations:

Single Database (best)
SAGA pattern — with compensating transactions
Distributed Lock + 2PC
Application-level Lock (e.g., Mutex)

Other notes:

Use Redis SETNX for distributed locks
2PC for cross-shard transactions (may need pre-locking)
If race conditions rare: use optimistic concurrency (MVCC)
If possible, normalize schema to a single store to avoid complexity

Pagination

Client-Side

Use lazy loading when dataset is small or partially rendered on demand.

Server-Side

Offset Pagination:
- Simple but problematic under frequent inserts (can cause duplicate/missing rows).
Cursor-Based Pagination:
- Use timestamp + unique ID to avoid duplicates.
- Monotonic counters also work but prevent page jumps (e.g., page 50).
- GraphQL Relay uses base64 cursors.

Latency & Caching Strategies

Latency mitigation techniques:

Caching: Redis, Memcached, CDN
TTL: Reduce staleness window
CDC: Push updates to cache
Geo-Sharding: Serve from nearest region
Pooling: Connection/thread pooling
Prefetching & lazy loading

Caching Strategies

Strategy	Description
Cache Aside	App reads/writes DB, manages cache manually
Read Through	Cache auto-loads data on miss
Write Through	Writes go to cache and DB simultaneously
Write Back	Write to cache only; DB syncs later (risk of data loss on cache failure)

Eviction Policies:

LRU (Least Recently Used)
LFU (Least Frequently Used)
FIFO (First In First Out)

More sections to be added: CAP theorem, Rate limiting, Leader election, Observability, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
design.md		design.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

System Design Notes

Table of Contents

Birthday Paradox & Base62

Base62 Encoding

Caching: In-Memory vs Disk

Speed

Throughput

CDN with Edge Computing

Scaling Strategies

Uploading Large Files

File Sync Agents

File/Data Security

Consistency & Transactions

Pagination

Client-Side

Server-Side

Latency & Caching Strategies

Caching Strategies

About

Uh oh!

Releases

Packages

hemantobora/sd_notes

Folders and files

Latest commit

History

Repository files navigation

System Design Notes

Table of Contents

Birthday Paradox & Base62

Base62 Encoding

Caching: In-Memory vs Disk

Speed

Throughput

CDN with Edge Computing

Scaling Strategies

Uploading Large Files

File Sync Agents

File/Data Security

Consistency & Transactions

Pagination

Client-Side

Server-Side

Latency & Caching Strategies

Caching Strategies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages