Purpose of this document
This document gives an overview of the features of Interstage Big Data Parallel Processing Server (hereafter, referred to as "this product"). It also describes the operations required during installation and the settings and operations of this product.
Intended readers
This document is intended for administrators building Big Data analysis systems using this product and who have the knowledge of building infrastructure, along with the knowledge of building and operating Apache Hadoop systems, and of developing Apache Hadoop applications.
Structure of this document
This document is structured as follows:
| |
Provides an overview of this product | |
Describes the features provided by this product | |
| |
Explains the server configuration, file system configuration, network configuration, and user account design that should be considered when using this product | |
Explains the hardware and software requirements for using this product | |
Describes the preparatory tasks that should be performed prior to installing this product | |
Explains how to install and set up this product | |
Explains how to uninstall this product | |
| |
Explains how to start and stop Hadoop on this product | |
Describes how to develop applications executed by Hadoop | |
Explains how to use this product to execute and stop Hadoop jobs | |
Explains the management of job execution users who perform Hadoop job operations | |
Explains how to add and delete slave servers after installing this product (after starting operations) | |
Explains how to add and delete storage systems after installing this product (after starting operations) | |
Explains how to back up and restore the system configuration of this product | |
Describes the corrective action to take when an error occurs on a system that uses this product | |
Describes the troubleshooting data to collect when an issue occurs with a system that uses this product | |
| |
Explains the commands provided by this product | |
Describes the definition files for the system configuration information used by this product | |
Explains the various parameters to configure for using Hadoop on this product | |
Describes the ports used by this product | |
Explains the meaning of messages output by this product and the corresponding action that should be taken | |
Covers the packages required for the system software that runs this product | |
Explains the terminology used for this product. |
Conventions
The following notation is used in this document:
Where features differ in accordance with the system software required to use this product, information is distinguished as shown below.
[Master server] | Information intended for master servers |
[Slave server] | Information intended for slave servers |
[Development server] | Information intended for development servers |
[Collaboration server] | Information intended for collaboration servers |
Unless indicated otherwise, "rack server" in this document refers to the PRIMERGY RX Series.
References are enclosed in " ".
Variable information or content that can be modified is italicized and written in mixed case (for example: newBkpDir).
GUI elements, such as window, menu, and tab names, are formatted bold (for example: File).
Key names are enclosed in < >.
Strings and numeric values requiring special emphasis are enclosed in double quotation marks (").
In usage examples, the prompt is represented by the Linux "#".
Interstage Big Data Parallel Processing Server website
The latest manuals and technical information is published on the Interstage Big Data Parallel Processing Server website.
It is recommended to refer to that website before using this product. The URL is shown below.
URL: http://www.fujitsu.com/global/services/software/interstage/solutions/big-data/bdpp/ (as of October2013)
Related documents
The following manuals are bundled with this product:
PRIMECLUSTER 4.3A10
ServerView Resource Orchestrator Virtual Edition V3.1.0
Primesoft Distributed File System V1
To refer to the contents of the manuals bundled with this product, refer to the manuals stored at the following locations in the product media:
DISK1: PRIMECLUSTER manuals
dvdDrive:\DISK1\products\PCL\documents\manuals\En
DISK1: ServerView Resource Orchestrator Virtual Edition manual
dvdDrive:\DISK1\products\ROR\DISK1\Manual\en\VirtualEdition
DISK1: Primesoft Distributed File System for Hadoop manual
dvdDrive:\DISK1\products\PDFS\documents\manuals\en
In the bundled manuals, only the features provided by Interstage Big Data Parallel Processing Server can be used.
Abbreviation
The following abbreviations are used in this document:
Abbreviation | Product |
---|---|
Linux | Red Hat(R) Enterprise Linux(R) 5.6 (for Intel64) |
Export restriction
If this document is to be exported or provided overseas, confirm legal requirements for the Foreign Exchange and Foreign Trade Act as well as other laws and regulations, including U.S. Export Administration Regulations, and follow the required procedures.
Trademarks
Apache Hadoop, Hadoop, HDFS, HBase, Hive, and Pig are trademarks of The Apache Software Foundation in the United States and/or other countries.
Adobe, Adobe Reader, and Flash are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States and/or other countries.
Linux is a registered trademark of Linus Torvalds.
Red Hat, RPM, and all Red Hat-based trademarks and logos are trademarks or registered trademarks of Red Hat, Inc. in the United States and other countries.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.
Microsoft, Windows, MS, MS-DOS, Windows XP, Windows Server, Windows Vista, Windows 7, Excel, and Internet Explorer are either registered trademarks or trademarks of Microsoft Corporation in the United States and/or other countries.
VMware, the VMware logo, VMware vSphere, VMware vCenter, ESXi, vMotion, Storage DRS, Server Appliance, and DirectPath I/O are trademarks or registered trademarks of VMware, Inc. in the United States and other countries.
Interstage, ServerView, Symfoware, and Systemwalker are registered trademarks of Fujitsu Limited.
Other company names and product names used in this document are trademarks or registered trademarks of their respective owners.
Note that registration symbols (TM or R) are not appended to system names or product names in this manual.
Issue date and version
Edition | Manual code |
---|---|
October 2013: Second edition | J2UL-1563-02ENZ0(00) |
Notice
No part of the content of this manual may be reproduced without the written permission of Fujitsu Limited.
The contents of this manual may be changed without notice.
Copyright 2013 FUJITSU LIMITED