Skip to main content

Introduction to Big Data

Big data is more extensive, more complex data sets, especially from new data sources. These data sets are so voluminous that traditional data processing software just can't manage them. But these massive volumes of data can be used to address business problems you wouldn't have been able to tackle before. (Oracle.com, 2014)

Types of Big Data

1. Structured Data
Any data that can be stored, accessed, and processed in a fixed format is called structured data.
2. Unstructured Data
Any data with a form or structure unknown is labelled as unstructured data. A typical example of unstructured data is a heterogeneous source of data that contains a combination of simple text files, images, videos, etc.
3. Semi-Structured data
Semi-structured data may contain data in both types. We can see semi-structured data as structured in form, but it is not defined explicitly in relational DBMS with, e.g., a table description. Semi-structured data, for example, is a data described in an XML format.

Advantages and Disadvantages of Big Data
   Generally speaking, getting more data on one's clients (and potential customers) will help companies to help refine their offerings and marketing efforts to create the highest level of loyalty and company repeats. Organizations capable of collecting massive amounts of data are allowed to perform more in-depth and more vibrant research. (Guru99.com, 2020)
While a positive thing is better analysis, big data can also generate complexity and noise. Companies must be able to handle larger volumes of data while determining which data represents signals as compared to sound. A key factor will be deciding what makes the data import. (Guru99.com, 2020)

Big Data Tools

        Based on popularity and usability following is the lists of open-source tools: -

      
2.   Apache Spark
       
3.    Apache Storm
       
4.    Cassandra 
       
5.    RapidMiner
       
6.    Mongo DB
       
       
8.    Neo4j
  
9.    Apache Samoa (Guru99.com, 2020)

       
References

  • Oracle.com. (2014). What Is Big Data? | Oracle Ireland. [online]Available at: https://www.oracle.com/ie/big-data/guide/what-is-big-data.html [Accessed 22 Jan. 2020].
  • Guru99.com. (2020). Introduction to BIG DATA: What is, Types, Characteristics & Example. [online]Available at: https://www.guru99.com/what-is-big-data.html [Accessed 22 Jan. 2020].
  • ‌Amit Verma (2018). Top 10 Open Source Big Data Tools in 2020 [Updated] - Whizlabs Blog.  Whizlabs Blog. Available at: https://www.whizlabs.com/blog/big-data-tools/ [Accessed 23 Jan. 2020].


Comments

  1. Great read. Very informative and easy to understand. Keep up the good work!

    ReplyDelete
  2. Well explained and informative.

    ReplyDelete
  3. Lucid and precisely explaine ��d.

    ReplyDelete
  4. One of the most valuable commodity Right now is data! Great article!

    ReplyDelete
  5. Great info on big data.

    ReplyDelete

Post a Comment

Popular posts from this blog

3V's of Big Data

Businesses are generating massive amounts of data through its various data points and business process. Small companies can collect all the generated data into tools like excel sheets, accessing databases, and other devices. But in the case of huge businesses, the data which they generate cannot fit into such tools which cause human error instance to be increased drastically due to manual processing. 3V's of Big Data.   1.       Volume        The name Big Data itself has to do with a vast size. Data size plays a significant role in assessing meaning from the data. (Guru99.com, 2020) For Ex: Facebook has 2.37 billion users, Youtube has 2 billion users, Instagram has 1 billion users, and Twitter has 126 million users. All users of these social media share trillions of posts, images, videos, tweets, etc. Just think about the volume of data generated every single minute.  (Big Data Framework, 2019) 2. ...
1.Google Analytics Home In the last 28 Days, 105 Users have read this blog, creating 184 sessions with a bounce rate of 38.04% and session duration of 2m 17s. FIG.1 1.1. Active Users The below image gives an analysis of page views per minute. For instance, six active users are viewing four different blogs for the given minute. FIG.2 1.2.  How do you acquire users?     There are three different ways to acquire users, namely: 1.2.1. Traffic Channel The below histogram indicates maximum traffic is caused by direct and social channels. FIG.3 1.2.2. Source/Medium Views have been escalated through the use of Direct and Instagra m. FIG.4 1.2.3. Referral  FIG.5 1.3.  How are your active users trending over time? The line graph displays the number of active users over 90 days.  FIG.6 1.4.  How well do you retain users? The below statistic gives the percentage of...

Benefits and Challenges of Using Customer Data for Marketing

All personal, behavioural, and population data that the marketing companies and departments collect from their customised databases refer to customer data or consumer data. (Wikipedia Contributors, 2020) Types of Customer Data 1. Identity Data The first type of customer data analysis looks at the heart of database marketing, the most essential information to identify a person. It gathers the name, gender, age, telephone number, email address, occupation, social media handles, and account information of a customer. (Connext Digital, 2019) 2. Descriptive Data Your understanding is beyond the names, age, and email addresses of your customers. To get the right feeling of your customers, you must dig deeper. Here descriptive data comes into play. The goal is to collect quantifiable data on your customers so that their actions, seasonal increases, and buying practices can be accurately predicted. For maximum effect, your predictive analysis can be aligned with y...