Skip to main content

Training Big Data with Apache Hadoop

Big Data with Apache Hadoop

Duration : 4 Days (09.00 – 16.00)
Venue  & Price : www.purnamaacademy.com 
Registration : www.purnamaacademy.com

Description :
Hadoop adalah framework atau platform open source berbasis Java di bawah lisensi Apache untuk support aplikasi yang jalan pada Big Data. Hadoop menggunakan teknologi Google MapReduce dan Google File System (GFS) sebagai fondasinya.

Pada awalnya Hadoop dikembangkan oleh Doug Cutting dan Mike Cafarella pada tahun 2005 yang saat itu bekerja di Yahoo. Nama Hadoop berdasarkan mainan 'Gajah' anak dari Doug Cutting.

Beberapa point penting Hadoop :
1. Hadoop merupakan framework/Platform open source berbasis Java
2. Hadoop di bawah lisensi Apache
3. Hadoop untuk support aplikasi yang jalan pada Big Data
4. Hadoop dikembangkan oleh Doug Cutting
5. Hadoop gunakan teknologi Google MapReduce dan Google File System (GFS)

Hadoop optimal digunakan untuk menangani data dalam jumlah besar baik data Structured, Semi-structured, maupun Unstructured. Hadoop mereplikasi data di beberapa komputer (Klustering), sehingga jika salah satu komputer mati/problem maka data dapat diproses dari salah satu komputer lainnya yang masih hidup

Topics include:

Introduction to Hadoop and Big Data:

• What is Big Data?

• What are the challenges for processing big data?

• What technologies support big data?

• What is Hadoop?

• Why Hadoop?

• History of Hadoop

• Use cases of Hadoop

• RDBMS vs Hadoop

• When to use and when not to use Hadoop

• Ecosystem tour

• Vendor comparison

• Hardware Recommendations & Statistics


HDFS: Hadoop Distributed File System:

Significance of HDFS in Hadoop

• Features of HDFS

• 5 daemons of Hadoop

• Data Storage in HDFS

Introduction about Blocks
Data replication
• Accessing HDFS

CLI (Command Line Interface) and admin commands
Java Based Approach
• Fault tolerance

• Download Hadoop

• Installation and set-up of Hadoop


Start-up & Shut down process
• HDFS Federation

Map Reduce:

• Map Reduce Story

• Map Reduce Architecture

• How Map Reduce works

• Developing Map Reduce

• Map Reduce Programming Model

• Creating Input and Output Formats in Map Reduce Jobs

PIG:

• Introduction to Apache Pig

• Map Reduce Vs. Apache Pig

• SQL vs. Apache Pig

• Different data types in Pig

• Modes of Execution in Pig

• Grunt shell

• Loading data

• Exploring Pig

• Latin commands

HIVE:

• Hive introduction

• Hive architecture

• Hive vs RDBMS

• HiveQL and the shell

• Managing tables (external vs managed)

• Data types and schemas

• Partitions and buckets

HBASE:

• Architecture and schema design

• HBase vs. RDBMS

• HMaster and Region Servers

• Column Families and Regions

• Write pipeline

• Read pipeline

• HBase commands

Flume

SQOOP

Participants :  (System Architecture, Database Administrator, IT Developer, IT Manager, CTO, CIO)




Comments

Popular posts from this blog

Smoke Test

Smoke Testing Explanation In computer programming and software testing, smoke testing (also confidence testing, sanity testing, build verification test (BVT) and build acceptance test) is preliminary testing to reveal simple failures severe enough to, for example, reject a prospective software release SMOKE TESTING, also known as "Build Verification Testing", is a type of software testing that comprises of a non-exhaustive set of tests that aim at ensuring that the most important functions work. The result of this testing is used to decide if a build is stable enough to proceed with further testing. The term 'smoke testing', it is said, came to software testing from a similar type of hardware testing, in which the device passed the test if it did not catch fire (or smoked) the first time it was turned on. Smoke Testing Elaboration Smoke testing covers most of the major functions of the software but none of them in depth. The result of t...

How to Pass ISACA CGEIT Certification Exam Dumps Practice Test

ISACA CGEIT certification is mainly targeted to those candidates who want to build their future in IT Governance domain. CGEIT exam can provide those within an information technology related governance field with great benefits, both with potential employers, and unlock a network of like-minded individuals. ISACA's Certified in the Governance of Enterprise IT (CGEIT) exam certification is framework agnostic and the only IT governance certification for the individual. CGEIT can put you in the role of a trusted advisor to your enterprise. CGEITs maintain an adequate level of current knowledge and proficiency in the field of information systems audit, control and security. The technical skill requirement of CGEIT is that the candidate has relevant professional work experience supporting organizational enterprise information technology. CGEIT is a vendor-neutral enterprise IT governance certification which can help take your career to new heights. Free Exam Practice Question and Answer...

TRAINING ENTERPRISE ARCHITECTURE DEVELOPMENT WITH TOGAF-BANDUNG

TRAINING ENTERPRISE ARCHITECTURE DEVELOPMENT WITH TOGAF By Purnama Academy - Training Center January 21, 2018  No comments ENTERPRISE ARCHITECTURE DEVELOPMENT WITH TOGAF Syllabus Overview TOGAF®, introduced by The Open Group, is a proven enterprise architecture methodology and framework used by the world's leading organizations to improve business efficiency. It is an enterprise architecture standard, ensuring consistent standards, methods, and communication among enterprise architecture professionals, so that we can conduct our enterprise architecture work in a better way, including: ·         An iterative process model supported by best practices ·         A re-usable set of existing architecture assets ·         Methods and tools for the planning, development, implementation, and maintenance of an enterprise architecture Togaf Core Concepts : Business Architect...