Number

Component Name

Description

1

Inceptor (Spark)

Distributed memory computing engine

2

Hyperbase (HBase)

Distributed real-time online NoSQL data service engine

3

Stream (Streaming)

Real-time data processing engine

4

Discover (R)

Encapsulates the R language

5

Manager

Independently developed graphical cluster management tool

6

HDFS

Hadoop distributed file system

7

MapReduce

Distributed data computing model and execution environment

8

Yarn

Unified resource management system

9

Zookeeper

Distributed, highly available distributed coordination services

10

Sqoop

Hadoop relational database synchronization tool

11

Flume

Distributed massive log collection system

12

Oozie

Workflow engine

13

Elastic Search

Full staff search service