Instagram architecture & database – How does it store & search billions of images
Instagram is the most popular photo-oriented social network on the planet today. With over a billion users, it has become the first choice for businesses to run their marketing campaigns on. This write-up is a deep dive into its platform architecture and addresses questions like…
An Insight into Bazaarvoice Scalable Architecture with Over 300 Million Visitors
Bazaarvoice is a digital marketing company based out of Austin, Texas. The service it offers enables retailers to add customer reviews to their websites. Also, it helps big brands like Adidas & Samsung syndicate the customer reviews, who purchased their product, across multiple retail e-commerce…
How Does PayPal Process Billions of Messages Per Day with Reactive Streams?
Chances are you have already come across this phrase “Data is the new Oil” A business’s worth today is gauged by the amount of data it contains pertaining to a certain niche or market segment. In this social media era petabytes of data generation on online…
How HotStar Scaled With 10.3 Million Concurrent Users – An Architectural Insight
Hotstar is the leading streaming media & video-on-demand service in India with a user base of approx. 200 million users. It lets users stream popular shows in several different languages & genres over the web. But the primary, the most popular feature of the service…
How Evernote Migrated & Scaled their Workload with Google Cloud Platform
Evernote is a leader when it comes to offering a service which enables us to save notes, capture our thoughts, project ideas, to-do lists, organize our data, also to collaborate with our teams online. The business has over 200 million happy customers. In the recent…
Distributed Systems and Scalability Feed
Facebook photo storage architecture
Facebook built Haystack, an object storage system designed for storing photos on a large scale. The platform stores over 260 billion images which amounts to over 20 petabytes of data. One billion new photos are uploaded each week which is approx—60 terabytes of data. At peak, the platform serves over one million images per second.
In the original NAS-based photo storage architecture, Facebook faced throughput and latency issues as the photos and the associated metadata lookups in NAS caused excessive disk operations almost upto ten just for retrieving a single image.

Tail latency in distributed systems
Tail latency is that tiny percentage of responses from a system that are the slowest in comparison to most of the responses. They are often called as the 98th or 99th percentile response times. This may seem insignificant at first but for large applications like LinkedIn, this has noticeable effects. This could mean that for a page having a million views per day 10,000 of those page views would experience the delay. Read how LinkedIn deals with longtail network latencies.
There can be multiple causes of tail latency: increasing load on the system, complex and distributed systems, application bottlenecks, slow network, slow disk access and more. Read more on it.
RobinHood: Tail latency-aware caching
RobinHood is a research caching system for application servers in large distributed systems having diverse backends. The cache system dynamically partitions the cache space between different backend services and continuously optimizes the partition sizes.
Microsoft research has a talk on getting rid of long-tail latencies.
Zero to Software Architect Learning Track - Starting from Zero to Designing Web-Scale Distributed Applications Like a Pro. Check it out.
Master system design for your interviews. Check out this blog post written by me.
Looking for developer, software architect jobs? Try Jooble. Jooble is a job search engine created for a single purpose: To help you find the job of your dreams!!
Zero to Software Architect Learning Track - Starting from Zero to Designing Web-Scale Distributed Applications Like a Pro. Check it out.
Recent Posts
- How Actor model/Actors run in clusters facilitating asynchronous communication in distributed systems
- Understanding the Actor model to build non-blocking, high-throughput distributed systems
- Technical Consultant: How can I become one? Explained with an example
- IT consultant: How do I become one? Explained with a real-world use case
- Software architecture course – From zero to mastering the fundamentals
Follow On Social Media