Top
Interstage Big DataParallel Processing ServerV1.0.1 User's Guide
FUJITSU Software

Preface


Purpose of this document

This document gives an overview of the features of Interstage Big Data Parallel Processing Server (hereafter, referred to as "this product"). It also describes the operations required during installation and the settings and operations of this product.

Intended readers

This document is intended for administrators building Big Data analysis systems using this product and who have the knowledge of building infrastructure, along with the knowledge of building and operating Apache Hadoop systems, and of developing Apache Hadoop applications.

Structure of this document

This document is structured as follows:

Part 1 - Product Overview

Chapter 1 Overview

Provides an overview of this product

Chapter 2 Functions

Describes the features provided by this product

Part 2 - Installation

Chapter 3 System Configuration and Design

Explains the server configuration, file system configuration, network configuration, and user account design that should be considered when using this product

Chapter 4 System Requirements

Explains the hardware and software requirements for using this product

Chapter 5 Preparing to Build the System

Describes the preparatory tasks that should be performed prior to installing this product

Chapter 6 Installation

Explains how to install and set up this product

Chapter 7 Uninstallation

Explains how to uninstall this product

Part 3 - Operations

Chapter 8 Starting and Stopping

Explains how to start and stop Hadoop on this product

Chapter 9 Developing and Registering Applications

Describes how to develop applications executed by Hadoop

Chapter 10 Executing and Stopping Jobs

Explains how to use this product to execute and stop Hadoop jobs

Chapter 11 Managing Job Execution Users

Explains the management of job execution users who perform Hadoop job operations

Chapter 12 Adding and Deleting Slave Servers

Explains how to add and delete slave servers after installing this product (after starting operations)

Chapter 13 Adding and Deleting Storage Systems

Explains how to add and delete storage systems after installing this product (after starting operations)

Chapter 14 Backup and Restore

Explains how to back up and restore the system configuration of this product

Chapter 15 Operations when There are Errors

Describes the corrective action to take when an error occurs on a system that uses this product

Chapter 16 Troubleshooting

Describes the troubleshooting data to collect when an issue occurs with a system that uses this product

Appendixes

Appendix A Commands

Explains the commands provided by this product

Appendix B Definition Files

Describes the definition files for the system configuration information used by this product

Appendix C Hadoop Configuration Parameters

Explains the various parameters to configure for using Hadoop on this product

Appendix D Port List

Describes the ports used by this product

Appendix E Messages

Explains the meaning of messages output by this product and the corresponding action that should be taken

Appendix F Mandatory Packages

Covers the packages required for the system software that runs this product

Glossary

Explains the terminology used for this product.


Conventions

The following notation is used in this document:

Interstage Big Data Parallel Processing Server website

The latest manuals and technical information is published on the Interstage Big Data Parallel Processing Server website.

It is recommended to refer to that website before using this product. The URL is shown below.

URL: 
	http://www.fujitsu.com/global/services/software/interstage/solutions/big-data/bdpp/ (as of October2013)

Related documents

The following manuals are bundled with this product:

To refer to the contents of the manuals bundled with this product, refer to the manuals stored at the following locations in the product media:

DISK1: PRIMECLUSTER manuals

dvdDrive:\DISK1\products\PCL\documents\manuals\En

DISK1: ServerView Resource Orchestrator Virtual Edition manual

dvdDrive:\DISK1\products\ROR\DISK1\Manual\en\VirtualEdition

DISK1: Primesoft Distributed File System for Hadoop manual

dvdDrive:\DISK1\products\PDFS\documents\manuals\en

In the bundled manuals, only the features provided by Interstage Big Data Parallel Processing Server can be used.


Abbreviation

The following abbreviations are used in this document:

Abbreviation

Product

Linux
or
Red Hat Enterprise Linux

Red Hat(R) Enterprise Linux(R) 5.6 (for Intel64)
Red Hat(R) Enterprise Linux(R) 5.7 (for Intel64)
Red Hat(R) Enterprise Linux(R) 5.8 (for Intel64)
Red Hat(R) Enterprise Linux(R) 5.9 (for Intel64)
Red Hat(R) Enterprise Linux(R) 6 (for Intel64)
Red Hat(R) Enterprise Linux(R) 6.1 (for Intel64)
Red Hat(R) Enterprise Linux(R) 6.2 (for Intel64)
Red Hat(R) Enterprise Linux(R) 6.3 (for Intel64)
Red Hat(R) Enterprise Linux(R) 6.4 (for Intel64)


Export restriction

If this document is to be exported or provided overseas, confirm legal requirements for the Foreign Exchange and Foreign Trade Act as well as other laws and regulations, including U.S. Export Administration Regulations, and follow the required procedures.

Trademarks

Note that registration symbols (TM or R) are not appended to system names or product names in this manual.


Issue date and version

Edition

Manual code

October 2013: Second edition

J2UL-1563-02ENZ0(00)


Notice

No part of the content of this manual may be reproduced without the written permission of Fujitsu Limited.

The contents of this manual may be changed without notice.


Copyright 2013 FUJITSU LIMITED