Strategy & Roadmap For Bigtop & Ambari
Strategy & Roadmap For Bigtop & Ambari
Strategy & Roadmap For Bigtop & Ambari
steps below:
• Configure security groups and access permissions to allow communication between services.
• Install the Bigtop repository on the AWS VM following the instructions provided by Bigtop.
• Configure the Bigtop repository to use the desired version (which is mentioned on the provided
requirement file) of Apache Hadoop, Apache Spark, and other services.
3. Install Ambari:
• Follow the Ambari documentation to configure and set up the Ambari server.
• Configure the core Hadoop components, including MapReduce (MR), Yet Another Resource
Negotiator (YARN), and Hadoop Distributed File System (HDFS).
• Configure HBase by specifying the necessary properties, such as ZooKeeper connection details and
data directories.
• Configure Hive by setting up the Hive Metastore (HMS) and specifying the necessary properties.
• Configure Spark by specifying the necessary properties, such as the Spark master URL and
executor settings.
• Configure ZooKeeper by setting up the ZooKeeper data directory and connection details.
NOTE:
1. Install OS, then install bigtop and configure it. After that, install all services via BIGTOP repository
manager. No custom installation, later-on it will be hard to manage.
3. even Hadoop and its distribution will be installed via big top. There are plenty of videos available on
YouTube pls refer.
Bigtop and Ambari play important roles in the installation and management of the Open Source Big Data
Apache services stack.
1. Bigtop:
• Bigtop is an open-source project that provides a framework and packaging system for Apache Big
Data components.
• It simplifies the installation and configuration process by providing pre-built packages for various
Apache projects, including Hadoop, HBase, Hive, Spark, ZooKeeper, and others.
• Bigtop ensures that all the components in the stack are compatible and work together seamlessly.
• It manages dependencies, configurations, and integration between the different services, allowing
for easier deployment and maintenance of the entire stack.
• By using Bigtop, you can leverage its package manager to install, upgrade, and manage the
different Apache services consistently.
2. Ambari:
• Ambari is an open-source management and monitoring platform for Apache Hadoop clusters.
• It provides a web-based interface to simplify the administration and monitoring of the Apache Big
Data stack.
• With Ambari, you can easily deploy and manage the various services, configure their properties,
and monitor their health and performance.
• Ambari integrates with the underlying infrastructure to automate the provisioning, configuration,
and management of the cluster nodes.
• It also offers features like service alerts, metrics collection, and centralized logging for better
cluster monitoring and troubleshooting.
• Ambari supports role-based access control, allowing different users or teams to have specific
permissions and access levels for managing the services.
In the context of installing and configuring the Open-Source Big Data Apache services, Bigtop helps by
providing the packages and framework to easily install and manage the services as a cohesive stack.
Ambari, on the other hand, acts as a management and monitoring tool that allows you to configure,
deploy, and monitor the services using its user-friendly web interface.
Both services together ambari and bigtop help data engineers to take more control on administration of
big data env.
https://www.youtube.com/results?search_query=installation+hadoop+via++apache+bigtop
https://www.youtube.com/watch?v=7aC98dZHQkU
HIGH LEVEL CONCEPTUAL DIAGRAM
Internet
CentOS VM
BIGTOP