GitHub - DOCUTEE/HaMu: 🚀 A tool for quickly deploying a fully containerized pseudo-distributed Hadoop cluster, making Hadoop setup faster and easier.

🚀 HaMu is a tool for quickly deploying a fully containerized pseudo-distributed Hadoop cluster, making Hadoop setup faster and easier.

📜 Table of Contents

📖 My Story
👥 Authors
✨ Features
✨ OS support
✨ Prerequisites
🚀 Installation Guide
🔄 Modify the Owner Name
🌐 Interact with the Web UI
✨ Contributors
📞 Contact

📖 My Story (feel free to skip)

Setting up a Hadoop cluster manually can be frustrating, especially for beginners. My friends and I faced several challenges when deploying a multi-node Hadoop cluster on VMware, such as configuration issues, poor scalability, and inefficient resource usage. To solve these problems, I developed HaMu, a tool that simplifies Hadoop deployment using Docker containers.

I first learned about Hadoop in the Introduction to Big Data course at my university. The subject itself was not difficult, but it became challenging when my friends and I had to deploy a multi-node Hadoop cluster on VMware. While Hadoop is powerful, using VMware introduced several inconveniences, such as:

Inability to scroll up to read errors in the terminal
Unfriendly communication with external systems
Weak performance due to virtualization overhead
Difficulty in scaling out
High storage usage for backups
Long rebuild times when making mistakes

To address these issues, I decided to containerize the Hadoop system using Docker. This project serves as a final summary of my knowledge in both Hadoop and Docker.

💡 I hope HaMu helps you quickly set up a Hadoop multi-node cluster, making it easier and more efficient to practice Hadoop. 🚀

👥 Authors

@Quang Nguyen

✨ Features

Deploy a Hadoop multi-node cluster with a single command.
Customize the number of slave nodes.
Set the cluster owner's name.
Interact with the cluster via CLI or Web UI.

🖥️ OS Support

🪟 Windows (via WSL2 or Docker Desktop)
🐧 Linux (Ubuntu, CentOS, Debian, etc.) – ⏳ Coming Soon

📌 Prerequisites

🐳 Docker
🗃️ Basic Knowledge of Hadoop

🚀 Installation Guide

Please select one of the two options !

Modify the Owner Name

If you need to change the owner name, run the rename-owner.py script and enter your new owner name when prompted.

⏳ Note: If you want to check the current owner name, it is stored in OwnerName.txt.

📌 There are some limitations; you should use a name that is different from words related to the 'Hadoop' or 'Docker' syntax. For example, avoid names like 'hdfs', 'yarn', 'container', or 'docker-compose'.

python rename-owner.py

🌐 Interact with the Web UI

You can access the following web interfaces to monitor and manage your Hadoop cluster:

YARN Resource Manager UI → http://localhost:9004
Provides an overview of cluster resource usage, running applications, and job details.
NameNode UI → http://localhost:9870
Displays HDFS file system details, block distribution, and overall health status.

Contributors ✨

Thanks goes to these wonderful people (emoji key):

_{Nguyen Quoc Huy} 💻
Add your contributions

This project follows the all-contributors specification. Contributions of any kind welcome!

Contact

📧 Email: quangforwork1203@gmail.com

💬 My project still has many aspects that need improvement. I would greatly appreciate your feedback!

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
config-hadoop		config-hadoop
linux		linux
windows		windows
.all-contributorsrc		.all-contributorsrc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compose-dynamic.yaml		compose-dynamic.yaml
compose.yaml		compose.yaml
hamu-config.json		hamu-config.json
rename-owner.py		rename-owner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📜 Table of Contents

📖 My Story (feel free to skip)

👥 Authors

✨ Features

🖥️ OS Support

📌 Prerequisites

🚀 Installation Guide

Modify the Owner Name

🌐 Interact with the Web UI

Contributors ✨

Contact

About

Uh oh!

Releases 3

Packages

Contributors 3

Uh oh!

Languages

License

DOCUTEE/HaMu

Folders and files

Latest commit

History

Repository files navigation

📜 Table of Contents

📖 My Story (feel free to skip)

👥 Authors

✨ Features

🖥️ OS Support

📌 Prerequisites

🚀 Installation Guide

Modify the Owner Name

🌐 Interact with the Web UI

Contributors ✨

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 3

Uh oh!

Languages

Packages