Scaling MongoDB Best Practices for Sharding, Indexing and Performance Isolation
Developers and data teams typically make early decisions on data modeling, shard configuration and indexing strategy in MongoDB without full knowledge of future query access patterns and database loads. These decisions have the potential to affect how MongoDB scales to handle data growth and new application requirements, so how should engineering organizations tackle the challenges of scaling MongoDB?
Dai Shi, a former Site Reliability Engineer at Foursquare, spent several years operating MongoDB in production at scale, writing a custom cluster balancer and building automated self-healing tools to ensure cluster health. The lessons learned from his experience culminate in 3 scaling strategies that he’ll cover in this talk:
-
Sharding & cluster balancing: How to choose your shard key and how to make the cluster autobalancer work for you
-
Indexing: Processes for creating new collections and indexes, and how to identify when a production issue is caused by an index
-
Offloading from production MongoDB: Identifying workloads to offload to ensure reliability and isolation of your primary MongoDB database, and methods for building integrations to secondary data stores
The talk will conclude with a live Q&A so you can get your scaling questions answered by Dai.