Question

Critically evaluate and discuss the challenges and opportunities of implementing big data which could have arisen...

Critically evaluate and discuss the challenges and opportunities of implementing big data which could have arisen in view of social networking revolution.

Homework Answers

Answer #1

COMMENT IF ANYTHING NEEDED.

With the advent of Internet of Things (IoT) and Web 2.0 technologies, there has been a tremendous growth in the amount of data generated. This chapter emphasizes on the need for big data, technological advancements, tools and techniques being used to process big data are discussed. Technological improvements and limitations of existing storage techniques are alsopresented. Since, the traditional technologies like Relational Database Management System (RDBMS) have their own limitations to handle big data, new technologies have been developed to handle them and to derive useful insights.

With the digitization of most of the processes, emergence of different social network platforms, blogs, deployment of different kind of sensors, adoption of hand-held digital devices, wearable devices and explosion in the usage of Internet, huge amount of data are being generated on continuous basis. No one can deny that Internet has changed the way businesses operate, functioning of the government, education and lifestyle of people around the world. Today, thistrend is in a transformative stage, where the rate of data generation is very high and the type of data being generated surpasses the capability of existing data storage techniques. It cannot be denied that these data carry a lot more information than ever before due to the emergence and adoption of Internet.

Over the past two decades, there is a tremendous growth in data. This trend can be observed in almost every field. According to a report by International Data Corporation (IDC), a research company claims that between 2012 and 2020, the amount of information in the digital universe will grow by 35 trillion gigabytes (1 gigabyte equivalent to 40 (four-drawer) file cabinets of text, or two music CDs). That‟s on par with the number of stars in the physical universe! (Forsyth, 2012).

In the mid-2000s, the emergence of social media, cloud computing, and processing power (through multi-core processors and GPUs) contributed to the rise of big data (Manovich, 2011; Agneeswaran, 2012). As of December 2015, Facebook has an average of 1.04 billion daily active users, 934 million mobile daily active users, available in 70 languages, 125 billion friend connections, 205 billion photos uploaded every day 30 billion pieces of content, 2.7 billion likes, and comments are being posted and 130 average number of friends per Facebook user (Facebook, 2015). This has created new pathways to study social and cultural dynamics.Though big data has gained attention due to the emergence of the Internet, but it cannot be compared with it. It is beyond the Internet, though, Web makes it easier to collect and share knowledge as well data in raw form. Big Data is about how these data can be stored, processed, and comprehended such that it can be used for predicting the future course of action with a great precision and acceptable time delay.

The current and emerging focus of big data analytics is to explore traditional techniques such asrule-based systems, pattern mining, decision trees and other data mining techniques to develop business rules even on the large data sets efficiently. It can be achieved by either developing algorithms that uses distributed data storage, in-memory computation or by using cluster computing for parallel computation. Earlier these processes were carried out using grid computing, which was overtaken by cloud computing in recent days.

The concept of big data dates back to the year 2001, where the challenges of increasing data were addressed with a 3Vs model by Laney (2001). 3Vs, also known as the dimensions of big data, represent the increasing Volume, Variety, and Velocity of data (Assunção et al., 2015). The model was not originally used to define big data but later has been used eventually by various enterprises including Microsoft and IBM to define the same (Meijer, 2011).

In 2010, Apache Hadoop defined big data as “datasets, which could not be captured, managed, and processed by general computers within an acceptable scope” (p.173, Chen et al., 2014). Following this, in 2011, McKinsey Global Institute defined big data as "datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze" p.1 (Manyika et al., 2011). International Data Corporation (IDC) defines “big data technologies as a new generation of technologies and architectures, designed to economically extract value from very large volumes of a wide variety of data, by enabling high-velocity capture, discovery, and/or analysis” (p. 6, Gantz and Reinsel, 2011).

Digitization of content by industries is the new source of data (Villars et al., 2011). Advancements in technology also lead to high rate of data generation. For example, one of the biggest surveys in Astronomy, Sloan Digital Sky Survey (SDSS) has recorded a total of 25TB data during their first (2000-2005) and second surveys (2005-2008) combined. With the advancements in the resolution of the telescope, the amount of data collected at the end of their third survey (2008-14) is 100 TB. Use of “smart” instrumentation is another source of big data. Smart meters in the energy sector record the electricity utilization measurement every 15 minutes as compared to monthly readings before. The data produced from Social Media sectors are Blog posts, tweets, social networking sites, log details which is used to analyze the customer behavior patterns.

Tools that are being used to collect data encompass various digital devices (for example, mobile devices, camera, wearable devices, and smart watches) and applications that generate enormous data in the form of logs, text, voice, images, and video. In order to process these data, several researchers are coming up with new techniques that help better representation of the unstructured data, which makes sense in big data context to gain useful insights that may not have been envisioned earlier.

R: is an open-source statistical computing language that provides a wide variety of statistical and graphical techniques to derive insights from the data. It has an effective data handling and storage facility and supports vector operations with a suite of operators for faster processing. It has all the features of a standard programming language and supports conditional arguments, loops, and user-defined functions.

Despite the growth in these technologies and algorithms to handle big data, there are there are

few limitations, which are discussed in this section.

1. Scalability and Storage Issues: The rate of increase in data is much faster than the existing processing systems. The storage systems are not capable enough to store these data (Chen et al., 2014; Li and Lu, 2014; Kaisler et al., 2013; Assunção et al., 2015). There is a need to develop a processing system that not only caters to today's needs but also future needs.

2. Timeliness of Analysis: The value of the data decreases over time. Most of the applications like fraud detection in telecom, insurance and banking, require real time or near real time analysis of the transactional data (Chen et al., 2014; Li and Lu, 2014).

3. Representation of Heterogeneous Data: Data obtained from various sources are heterogeneous in nature. Unstructured data like Images, videos and social media data cannot be stored and processed using traditional tools like SQL. Smartphones now record and share images, audios and videos at an incredibly increasing rate, forcing our brains to process more. However, the process for representing images, audios and videos lacks efficient storage and processing (Chen et al., 2014; Li and Lu, 2014; Cuzzocrea et al., 2011).

4. Data Analytics System: Traditional RDBMS are suitable only for structured data and they lack scalability and expandability. Though non-relational databases are used for processing unstructured data, but there exist problems with their performances. There is a need to design a system that combines the benefits of both relational and non-relational database systems to ensure flexibility (Chen et al., 2014; Li and Lu, 2014;).

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Discuss ethical, global, and security challenges involved with implementing an ERP system. In your experience either...
Discuss ethical, global, and security challenges involved with implementing an ERP system. In your experience either as a customer or within your own organization, describe what led to the need to implement an ERP system. Explain the challenges you have faced when a new ERP system was implemented or attempted to be implemented. In any of your experiences with E-Business and E-Commerce, how could an ERP system come into play to create a better shopping experience for the customer?
Could Blockbuster's destiny have been changed by simply using Big Data to their advantage?
Could Blockbuster's destiny have been changed by simply using Big Data to their advantage?
One of the challenges the accounting profession faces is that the tools accountants have traditionally used...
One of the challenges the accounting profession faces is that the tools accountants have traditionally used are ill-equipped for analyzing the types and quantity of data present in big data. True or Flase Information systems that collect data regarding the business events of entity and support its day-to-day business requirements originate from the following broad source: a. mechanical b. electronic c. operational d. social Given the tools and training that accountants currently possess, which of the following sources of data...
What are three areas within data communication of which managers need to have a clear understanding?...
What are three areas within data communication of which managers need to have a clear understanding? What are three business applications of social networking sites? What are three major components of a social media information system? What are three suggestions for making a company’s Web site more appealing to a global audience? What are three reasons that an information system may fail? What are four ways that a knowledge management system could help an organization? What are four recommendations for...
Which of the following data sets or plots could have a regression line with a negative...
Which of the following data sets or plots could have a regression line with a negative slope? Select all that apply. Select all that apply: The number of tons of trash in a landfill as a function of the number of years since the landfill was built. The number of loads of trash a dump truck hauls per month as a function of the number of years since the dump truck was built. The number of tons of trash produced...
Consider the following data, which represent the number of times individuals have visited a physician in...
Consider the following data, which represent the number of times individuals have visited a physician in the last 12 months. 2, 5, 4, 3, 3, 1, 0, 1, 0, 0, 7, 13, 5, 4, 3, 6, 9 i. Physicians’ offices in the area budget for 3 visits per year, per patient. Suppose researchers are interested in whether individuals do not visit their physicians exactly 3 times per year. Construct an appropriate pair of hypotheses that could be used to test...
In this second portion of the Final Exam, you will critically evaluate a quantitative research study...
In this second portion of the Final Exam, you will critically evaluate a quantitative research study on a social science topic. Your instructor will post an announcement with the reference for the article assigned for the exam. The study will be from a peer-reviewed journal and published within the last 10 years. In the body of your critique, describe the statistical approaches used, the variables included, the hypothesis(es) proposed, and the interpretation of the results. In your conclusion, suggest other...
CASE STUDY REWARD ENCOURAGES BEAST.....OOPS, BEST! Challenges and accomplishments Neera and Vijit’s experience proved to be...
CASE STUDY REWARD ENCOURAGES BEAST.....OOPS, BEST! Challenges and accomplishments Neera and Vijit’s experience proved to be an asset in the project. Both were technically qualified and committed to the work. Both wanted to excel in their work. Neera knew that with a big team, verbal instructions will get diluted and therefore written instructions were important. Manuals were prepared for each aspect of assessment and the entire field staff was meticulously trained so that clarity about the work and mechanism to...
Note:  100% plagiarism in the above paragraph please remove the plagiarism less than 15 % . CHALLENGES...
Note:  100% plagiarism in the above paragraph please remove the plagiarism less than 15 % . CHALLENGES / OPPORTUNITIES One of the major challenges is to change the people’s perspective of PepsiCo as an unhealthy soft drink producer. Due to the link of soft drinks to obesity and diabetes, the new CEO wants to reinvent Pepsi as a healthy food producer rather than a snacks producer. Although this is a good plan for the PepsiCo to consider, people who are used...
What tools could AA leaders have used to increase their awareness of internal and external issues?...
What tools could AA leaders have used to increase their awareness of internal and external issues? ???ALASKA AIRLINES: NAVIGATING CHANGE In the autumn of 2007, Alaska Airlines executives adjourned at the end of a long and stressful day in the midst of a multi-day strategic planning session. Most headed outside to relax, unwind and enjoy a bonfire on the shore of Semiahmoo Spit, outside the meeting venue in Blaine, a seaport town in northwest Washington state. Meanwhile, several members of...