TEMARIO
DIA 2
OBJECTIVES
• Describe HDFS Architecture and Operation
• Manage HDFS using Ambari Web, NameNode and DataNode UIs
• Manage HDFS using Command-line Tools
• Summarize the Purpose and Benefits of Rack Awareness
• Summarize Hadoop Backup Considerations
• Summarize the Purpose and Operation of HDFS Centralized Caching
• Identify HDFS NFS Gateway Use Cases
• Install and Configure an HDFS NFS Gateway
MANAGING HDFS STORAGE, RACK AWARENESS, HDFS SNAPSHOTS AND HDFS CENTRALIZED CACHE
LABS
• Managing HDFS Storage
• Managing HDFS Quotas
• Configuring Rack Awareness
• Managing HDFS Snapshots
• Using DistCP
• Configuring HDFS Storage Policies
• Configuring HDFS Centralized Cache
• Configuring an NFS Gateway
TEMARIO
DIA 4
OBJETIVE
• Describe Apache Hadoop
• List Hadoop Cluster Management Choices
• Identify Hadoop Cluster Deployment Options
• Perform an Interactive HDP Installation using Apache Ambari
• Manage Users, Groups and Permissions
• Summarize Operations of the Web UI Tool
• Perform HDFS Shell Operations
HIGH AVAILABILITY WITH HDP, DEPLOYING HDP WITH BLUEPRINTS, AND THE HDP UPGRADE PROCESS
LABS
• Configuring NameNode HA
• Configuring Resource Manager HA
• Adding, Decommissioning and Re-commissioning a Worker Node
• Configuring Ambari Alerts
• Deploying an HDP Cluster Using Ambari Blueprints
• Performing an HDP Upgrade – Express
REQUISITOS PREVIOS
Los estudiantes deben tener experiencia trabajando en un entorno Linux con comandos estándar del sistema Linux. Los estudiantes deberían poder leer y ejecutar scripts de shell básicos de Linux.
AUDIENCIA
El público objetivo de este curso incluye administradores de Linux y operadores de sistemas responsables de instalar, configurar y administrar un clúster HDP.