The provenance of the Big Data is the wide scale prevalence of huge Volume of Data being stored & processed by wide cross-section of the society .
To better comprehend the 'Big Data' and to understand how 'Big Data' differentiates itself from 'Traditional Databases' , we have to understand the 3 V's of Big Data .
Gartner analyst Doug Laney introduced the 3Vs concept in a 2001 MetaGroup research publication, 3D data management: Controlling data volume, variety and velocity.
What are these 3 V's of Big Data :-
Now , lets try to get some perspective on each of the above mentioned V of Big Data :-
The moment we discuss about 'Big Data' , the first thing which flashes through our brain is 'Volume'.
Volume is the most dominant aspect / attribute of 'Big Data'.
With the advent of IOT ( Internet of Things ) & widespread use of Social Media , world now is
dealing with insanely large amounts of data.
- Social Media Data - Can you imagine that Facebook has 2.23 Billion Active users as of Q2 2018 ( little shy of the combined population of China & India ) . with this high number of users , Facebook is storing roughly 250 billion images & 2.5 trillion posts.
- IOT Devices Data - As per the Expert's prediction the total volume of data generated by IoT will reach 600 ZB per year by 2020. Virgin Atlantic has embraced IoT with a slew of connected Boeing 787 aircraft and connected cargo devices. Each plane has multiple internet-connected parts generating a large volume of data ( each connected flight can produce more than a half a terabyte of data ) .The data is being used to predict maintenance requirements or to improve flight and fuel efficiency.
The second vector 'V' of Big Data stands for 'Velocity.
Velocity refers to Speed of Data Processing .
The importance of data's velocity — the rate at which data flows into an organization or flows out of
an Organization to the end-user has phenomenally increased over the years.
Industry terminology for such fast-moving data tends to be either "streaming data," or "complex
It's not just about input data stream Velocity which is critical , the velocity of a system's outputs can
matter too. The tighter the feedback loop, the greater the competitive advantage.
Variety refers to various types of Data being stored into the Big Data Store.
Traditional Databases can only store a specific type of Structured Data Type , whereas Big Data
can store any type and structure of Data.
One no longer has control over the input data format. Structure can no longer be imposed like in the
past in order to keep control over the analysis. As new applications are introduced new data formats
come to life.