Big data acquisition, preparation, and analysis using apache software foundation tools

Document Type

Book Chapter

Publication Title

Big Data Analytics: Tools and Technology for Effective Planning

Abstract

Challenges in Big Data analysis include data inconsistency, incompleteness, scalability, timeliness, and data security. The fundamental challenge is the existing computer architecture. For several decades, the latency gap between multicore CPUs and mechanical hard disks has increased each year, making the challenges of data-intensive computing harder to overcome (Hey et al. 2009). A systematic and general approach to these problems with a scalable architecture is required. Most of the Big Data is unstructured or of a complex structure, which is hard to represent in rows and columns. A good candidate for a large design space can efficiently solve the Big Data problem in different disciplines. This chapter highlights two specific objectives.

First Page

195

Last Page

228

DOI

10.1201/b21822

Publication Date

1-1-2017

This document is currently not available here.

Share

COinS