The Data Infrastructure team is responsible for the various datastores we use, including MySQL, HBase, ElasticSearch, and Kafka. We aim to provide four nines (or better) of reliability for all of these, as well as to help our developers use them easily and safely.
We spend a lot of time building automation, tooling, and monitoring to equip developers to understand and optimize their datastore usage and minimize the impact of any operational issues on customers and developers alike.
We're also operationally responsible for a huge volume of traffic to and from these datastores. Our HBase clusters serve over 3 million requests/second across 220+ tables, while our ElasticSearch clusters serve over 20k searches/second and 50k indexes/second to 90+ billion documents. Streaming that data to and from applications amounts to more than 3 GB/sec of data through our Kafka clusters, with hundreds of producers and consumers.
If you're:
Get in touch! We'd love to talk to you about our data infrastructure team.