Pinboard (jm)
https://pinboard.in/u:jm/public/
recent bookmarks from jmOctober 21 post-incident analysis | The GitHub Blog2018-10-31T10:57:33+00:00
https://blog.github.com/2018-10-30-oct21-post-incident-analysis/
jmgithub fail outages failover replication consensus opshttps://pinboard.in/https://pinboard.in/u:jm/b:94432c1d3b19/Cross-Region Read Replicas for Amazon Aurora2016-06-14T09:19:08+00:00
https://aws.amazon.com/blogs/aws/new-cross-region-read-replicas-for-amazon-aurora/
jmCreating a read replica in another region also creates an Aurora cluster in the region. This cluster can contain up to 15 more read replicas, with very low replication lag (typically less than 20 ms) within the region (between regions, latency will vary based on the distance between the source and target). You can use this model to duplicate your cluster and read replica setup across regions for disaster recovery. In the event of a regional disruption, you can promote the cross-region replica to be the master. This will allow you to minimize downtime for your cross-region application. This feature applies to unencrypted Aurora clusters.
]]>aws mysql databases storage replication cross-region failover reliability aurorahttps://pinboard.in/https://pinboard.in/u:jm/b:4bc9688d386b/Chaos Engineering Upgraded2015-09-28T15:33:16+00:00
http://techblog.netflix.com/2015/09/chaos-engineering-upgraded.html
jmarchitecture aws netflix ops chaos-monkey chaos-kong testing availability failover hahttps://pinboard.in/https://pinboard.in/u:jm/b:4dbd4d3af135/Uber Goes Unconventional: Using Driver Phones as a Backup Datacenter - High Scalability2015-09-23T21:54:42+00:00
http://highscalability.com/blog/2015/9/21/uber-goes-unconventional-using-driver-phones-as-a-backup-dat.html
jmscalability failover multi-dc uber replication state crdtshttps://pinboard.in/https://pinboard.in/u:jm/b:400d153ebfed/Aurora for MySQL is coming2014-12-05T22:53:40+00:00
http://smalldatum.blogspot.ie/2014/11/aurora-for-mysql-is-coming.html?showComment=1416603086179
jmvia:highscalability mysql aurora failover fault-tolerance aws replication quorumhttps://pinboard.in/https://pinboard.in/u:jm/b:ae310d17fac7/DynamoDB Streams2014-11-11T10:40:54+00:00
http://aws.amazon.com/blogs/aws/dynamodb-streams-preview/
jmiops dynamodb aws kinesis reliability replication multi-az multi-region failover streaming kafkahttps://pinboard.in/https://pinboard.in/u:jm/b:bd8b590d113c/Game Day Exercises at Stripe: Learning from `kill -9`2014-10-28T20:52:23+00:00
https://stripe.com/blog/game-day-exercises-at-stripe
jmWe’ve started running game day exercises at Stripe. During a recent game day, we tested failing over a Redis cluster by running kill -9 on its primary node, and ended up losing all data in the cluster. We were very surprised by this, but grateful to have found the problem in testing. This result and others from this exercise convinced us that game days like these are quite valuable, and we would highly recommend them for others.
Excellent post. Game days are a great idea. Also: massive Redis clustering fail]]>game-days redis testing stripe outages ops kill-9 failoverhttps://pinboard.in/https://pinboard.in/u:jm/b:0de011811783/Amazon Route 53 Infima2013-11-15T16:58:38+00:00
https://github.com/awslabs/route53-infima
jmInfima provides a Lattice container framework that allows you to categorize each endpoint along one or more fault-isolation dimensions such as availability-zone, software implementation, underlying datastore or any other common point of dependency endpoints may share.
Infima also introduces a new ShuffleShard sharding type that can exponentially increase the endpoint-level isolation between customer/object access patterns or any other identifier you choose to shard on.
Both Infima Lattices and ShuffleShards can also be automatically expressed in Route 53 DNS failover configurations using AnswerSet and RubberTree.
]]>infima dns route-53 fault-tolerance failover multi-az sharding service-discovery colmmacchttps://pinboard.in/https://pinboard.in/u:jm/b:8a967930a6a7/Building a Modern Website for Scale (QCon NY 2013) [slides]2013-06-17T10:37:00+00:00
http://www.slideshare.net/r39132/q-con-ny2013modernwebsitescalabilityfinal-22989785
jmgc-scout gc java scaling scalability linkedin qcon async threadpools rest slas timeouts networking distcomp netty tcp udp failover fault-tolerance packet-losshttps://pinboard.in/https://pinboard.in/u:jm/b:8766348f43f5/Is Your MySQL Buffer Pool Warm? Make It Sweat!2013-04-16T21:28:43+00:00
https://engineering.groupon.com/2013/mysql/mysql-buffer-pool-warming/
jmvia:dave-doran mysql databases warm-spares spares failover groupon percona replicationhttps://pinboard.in/https://pinboard.in/u:jm/b:6c770a73f651/High Scalability - geo-aware traffic load balancing and caching at CNBC.com2013-02-01T14:26:51+00:00
http://highscalability.com/blog/2010/2/6/geo-aware-traffic-load-balancing-and-caching-at-cnbccom.html
jmanycast dns scalability dyn failover geographical load-balancinghttps://pinboard.in/https://pinboard.in/u:jm/b:4d1a14c70348/Fault Tolerance in a High Volume, Distributed System2012-03-02T14:02:22+00:00
http://techblog.netflix.com/2012/02/fault-tolerance-in-high-volume.html
jmnetflix architecture concurrency distributed failover ha resiliency fail-fast failsafe soa fault-tolerancehttps://pinboard.in/https://pinboard.in/u:jm/b:d34c188eb195/