Big data

Big data is a field that treats ways to analyze, systematically extract information from, or otherwise deal with data sets that are too large or complex to be dealt with by traditional data-processing application software. Data with many cases (rows) offer greater statistical power, while data with higher complexity may lead to a higher false discovery rate. Big data challenges include capturing data, data storage, data analysis, search, sharing, transfer, visualization, querying, updating, information privacy and data source. Big data was originally associated with three key concepts: volume, variety, and velocity. When we handle big data, we may not sample but simply observe and track what happens. Therefore, big data often includes data with sizes that exceed the capacity of traditional software to process within an acceptable time and value.

Apache Hadoop

Apache Hadoop is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Hadoop was originally designed for computer clusters built from commodity hardware, which is still the common use. It has since also found use on clusters of higher-end hardware. All the modules in Hadoop are designed with a fundamental assumption that hardware failures are common occurrences and should be automatically handled by the framework.

SAS (software)

SAS is a statistical software suite developed by SAS Institute for data management, advanced analytics, multivariate analysis, business intelligence, criminal investigation, and predictive analytics.

QlikView

QlikView is the Business Intelligence platform for turning data into knowledge. It enables users to gain unexpected business insights by understanding how data is associated—and what data is not related. They can conduct direct and indirect searches across all data anywhere in the application—globally or within a single field. When the user clicks on a data point in a field, no queries are fired. Instead, all the other fields instantaneously filter themselves based on the selection the user made.
QlikView also offers guided exploration and discovery and collaborative analytics for sharing insights. Additionally, the program allows users to build and deploy analytic apps without requiring professional development skills, helping drive faster response to changing business requirements, shorter time to value, and more insight across an organization.

Tableau

Tableau Desktop is an analytic tool for anyone. It is designed to support how people think. Easily combine data by drag & drop to spot trends, identify opportunities, and make data-guided decisions with confidence. Creating interactive presentations allows the audience to explore the data in a self-explaining way. Tableau also allows working offline by extracting data for ad-hoc analysis of massive data in seconds. It combines advances in database and computer graphics technology so it is possible to analyze huge datasets on a laptop also.

Leave a comment

Post here