Categories: Tech & Society

Facebook is Collecting Your Data — 500 Terabytes a Day

How much of your data is Facebook collecting every day? Some new stats from the company reveal just how large its user base is, and what big data means to a company with 950 million users.

With more than 950 million users, Facebook is collecting a lot of data. Every time you click a notification, visit a page, upload a photo, or check out a friend’s link, you’re generating data for the company to track. Multiply that by 950 million people, who spend on average more than 6.5 hours on the site every month, and you have a lot of information to deal with.

Here are some of the stats the company provided Wednesday to demonstrate just how big Facebook’s data really is:

  • 2.5 billion content items shared per day (status updates + wall posts + photos + videos + comments)
  • 2.7 billion Likes per day
  • 300 million photos uploaded per day
  • 100+ petabytes of disk space in one of FB’s largest Hadoop (HDFS) clusters
  • 105 terrabytes of data scanned via Hive, Facebook’s Hadoop query language, every 30 minutes
  • 70,000 queries executed on these databases per day
  • 500+terrabytes of new data ingested into the databases every day

“If you aren’t taking advantage of big data, then you don’t have big data, you have just a pile of data,” said Jay Parikh, VP of infrastructure at Facebook on Wednesday. “Everything is interesting to us.”

Parikh said the company is constantly trying to figure out how to better analyze and make sense of the data, including doing extensive A/B testing on all potential updates to the site, and making sure it responds in real time to user input.

“We’re growing fast, but everyone else is growing faster,” he said.

Via: GigaOM

Prateek Panda

Prateek is the Founder of TheTechPanda. He's passionate about technology startups and entrepreneurship and enjoys speaking to new founders every day. Prateek has also been consistently regarded as one of the top marketing experts in the region.

Recent Posts

India Crossed Over 1 Billion Digital Transactions a Day, Now Cybersecurity Decides Who Gets to Participate

India has crossed a historic threshold. More than one billion digital transactions now move through…

12 hours ago

M&A: The art of the deal

The Tech Panda takes a look at recent mergers and acquisitions within various tech ecosystems…

2 days ago

Skilling & upskilling: AI, Finance, STEM & Scholarship Programmes

The Tech Panda takes a look at the efforts at skilling, upskilling, and reskilling in…

3 days ago

India’s tech pulse: Ecosystem harkat & the shifting investment temperament

The Tech Panda examines the forces shaping ecosystem behaviour and investment sentiment in India. INR15…

4 days ago

Indian multi-gaming platform Googly paves the way for future esports Champions With IIT Indore’s Gaming Fest – Glitchpop 2.0.

Googly, an Indian multi-gaming platform tied up with Glitchpop 2.0 at IIT Indore on March…

4 days ago

Leads Connect, ICRISAT sign MoU to develop sustainable solutions for agriculture

Leads Connect Services, the agritech data, risk management, and financial services company, in collaboration with…

4 days ago