Skip to main content
Centos üzerinde Hadoop nasıl kurulur, adım adım anlatılıyor.
    • by 
    •   10  
      HadoopHadoop Technologies, Distributed ComputingApache HadoopMapReduce and Hadoop
— Cloud computing is one of the emerging techniques to process the big data. Cloud computing is also, known as service on demand. Large set or large volume of data is known as big data. Processing big data (MRI images and DICOM images)... more
    • by 
    •   6  
      Computer EngineeringCloud ComputingBig DataMapReduce and Hadoop
The Cancer is a key disease that has become the greatest risk to public health cause to its complicated early recognition. According to a study by the WHO in 2019 and so far, there are four million new cases of cancer and 28.69 million... more
    • by 
    •   5  
      DataminingHDFSNamenodeDatanode
Hadoop é o principal framework usado para processar e gerenciar grandes quantidades de dados. Qualquer pessoa que trabalhe com programação ou ciência de dados deve se familiarizar com a plataforma.
    • by 
    •   6  
      HadoopData ScienceMap ReduceBig Data
Big Data is large-volume of data generated by public web, social media and different networks, business applications, scientific instruments, types of mobile devices and different sensor technology. Data mining involves knowledge... more
    • by 
    •   5  
      HadoopClusteringMapreduceBig Data
Since Big data is so huge that it's become difficult to handle it, so it requires special technology which can handle bigdata. Hadoop is Apache Foundation's Framework which aims to provide efficient storage and analytics of big data;... more
    • by 
    •   5  
      Multi-Agent SystemsHadoopBig DataKerberos
Dissertation report on hdfc LIC LTD
    • by 
    • HDFS
Dans le monde d’aujourd’hui de multiples acteurs de la technologie numérique produisent des quantités infinies de données. Capteurs, réseaux sociaux ou e-commerce, ils génèrent tous de l’information qui s’incrémente en temps-réel selon... more
    • by 
    •   4  
      NoSQLHadoopBig DataHDFS
With an increased usage of the internet, the data usage is also getting increased exponentially year on year. So obviously to handle such an enormous data we needed a better platform to process data. So a programming model was introduced... more
    • by 
    •   11  
      Computer ScienceInformation SecurityMachine LearningDatabase Systems
Big data is the collection of large amount of data which is generated from various application such as social media, e-commerce etc. Those large amount of data were found to be tedious for storage and analysis. Now a day's various tools... more
    • by 
    •   6  
      HadoopBig DataPigHive
With the tremendous amount of research done in the field of numerical methods for engineering, a sharp rise in the number of new algorithms and software tools (academic and commercial) have been observed in the past decades. The advent of... more
    • by  and +1
    •   3  
      InteroperabilityHDFSIntegrated Computational Materials Engineering
Hadoop and Spark are widely used distributed processing frameworks for large-scale data processing in an efficient and fault-tolerant manner on private or public clouds. These big-data processing systems are extensively used by many... more
    • by  and +1
    •   6  
      Distributed ComputingMapreduceBig DataMapReduce and Hadoop
The efficiency and scalability of the cluster depends heavily on the performance of the single NameNode.
    • by 
    •   7  
      Computer ScienceData MiningDatabase SystemsDatabases
With the growing technology and exploding data the need to manage data and process it in real-time is also increasing. Hence all the incoming data needs to be handled in a fraction of seconds to get the value out of it. This type of data... more
    • by 
    •   6  
      HadoopBig DataKafkaHive
The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached... more
    • by 
    •   18  
      Computer ScienceComputer ArchitectureDistributed ComputingDatabase Systems
Abstract: Apache Hadoop is a software framework that supports data-intensive distributed application under a free license. It enables applications to work with thousands of nodes and petabytes of data. Hadoop was inspired by Google’s... more
    • by 
    •   2  
      MapreduceHDFS
Now a day’s Peta-Bytes of data becomes the norm in industries. Handling, analyzing such big data is challenging task. Even frameworks like Hadoop (Open Source Implementation of MapReduce Paradigm) and NoSQL databases like Cassandra, HBase... more
    • by 
    •   4  
      HadoopHDFSHeterogeneous clustersIaeme Ijcet
Data is being produced by the firms in ever increasing rates and firms are finding new ways to make use of data to create business value. The generated volumes of data create the need for better and cheaper storage options that allows... more
    • by 
    •   7  
      Data WarehousingHadoopData ModellingMap Reduce
The Apache Hadoop framework is an open source implementation of MapReduce for processing and storing big data. However, to get the best performance from this is a big challenge because of its large number configuration parameters. In this... more
    • by 
    •   5  
      Machine LearningHadoopMapreduceParameters
HADOOP is an open-source virtualization technology that allows the distributed processing of large data sets across standardized server clusters. With two modules, HADOOP Distributed File System (HDFS) and MapReduce framework, it is... more
    • by 
    •   4  
      HadoopMapreduceBig DataHDFS
The combination of the two quick creating logical exploration regions Semantic Web and Web Mining is called Semantic Web Mining. The immense increment in the measure of Semantic Web information turned into an ideal objective for some... more
    • by 
    •   5  
      RDFPig FarmingHiveHDFS
Big data is a collection of structured and unstructured data sets that include the huge quantities of data, social media analytics, data management capabilities, real-time data. For Big Data processing Hadoop uses Map Reduce paradigm.... more
    • by 
    •   5  
      HadoopMap ReduceBid DataParameters
Big data is used for structured, unstructured and semi-structured large volume of data which is difficult to manage and costly to store. Using explanatory analysis techniques to understand such raw data, carefully balance the benefits in... more
    • by 
    •   3  
      MapreduceBig DataHDFS
Data analytics has been rapidly growing in a variety of application areas like mining business intellect for processing the huge amount of data. MapReduce programming paradigm adds itself well to these data-intensive analytics jobs, given... more
    • by 
    •   8  
      Computer ScienceHadoopMap ReduceBig Data
Quantitative trace element data from high-purity gem diamonds from the Victor Mine, Ontario, Canada as well as near-gem diamonds from peridotite and eclogite xenoliths from the Finsch and Newlands mines, South Africa, acquired using an... more
    • by 
    •   3  
      DiamondsTrace ElementsHDFS
Big data may be a gather of structured, semi-structured and unstructured data sets that contain the large amount of data, social media analytics, information management ability, period of time information. For giant data processing Hadoop... more
    • by 
    •   6  
      Machine LearningHadoopMap ReduceBig Data
In recent years, Hadoop framework is popularly known for providing cost-effective solutions to process large scale data intensive applications in a distributed manner. Storage imbalance during replica placement in Hadoop is harmful,... more
    • by  and +1
    •   3  
      HadoopLoad BalancingHDFS
The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached... more
    • by 
    •   7  
      Computer ScienceDatabase SystemsComputer NetworksDatabases
The MapReduce model has become an important parallel processing model for largescale data-intensive applications like data mining and web indexing. Hadoop, an opensource implementation of MapReduce, is widely applied to support cluster... more
    • by 
    •   6  
      Computer ScienceMachine LearningCloud ComputingDistributed
Log Analysis is a critical procedure in most framework and system exercises where log information is utilized for different reasons, for example, for execution checking, security examining or notwithstanding for revealing and profiling.... more
    • by  and +1
    •   11  
      HadoopMap ReduceLog File AnalysisHadoop Technologies, Distributed Computing
The Lung cancer patients are at discriminating threat for COVID-19 and the reported far above the ground humanity time surrounded by lung cancer patients by way of COVID-19 has prearranged break in proceedings to oncologists who are faced... more
    • by 
    •   6  
      Data MiningSPSSK-meansDecision Tree
Clustering is a process of grouping objects that are similar among themselves but dissimilar to objects in others. Clustering large dataset is a challenging resource data intensive task. The key to scalability and performance benefits it... more
    • by 
    •   4  
      ClusteringMapreduceBig DataHDFS
In today"s world where Internet is most required and where pentabytes of data is produced per hour, there is a drastic need to speed up the performance and throughput of the cloud system. Traditional cloud systems were not able to give... more
    • by 
    •   6  
      Wireless CommunicationsComputer NetworksBenchmarkingHadoop
Sentiment Analysis is the process of using text analytics to mine various data sources for opinions. Often, sentiment analysis is done on the data that is got from the Internet and from various social media platforms. Because the content... more
    • by  and +1
    •   6  
      Sentiment AnalysisBig DataKey generationHive
There is an explosion in the volume of data in the world. The amount of data is increasing by leaps and bounds. The sources are individuals, social media, organizations, etc. The data may be structured, semi-structured or unstructured.... more
    • by 
    •   4  
      SchedulingHadoopBig DataHDFS
The Apache Hadoop framework is an open source implementation of MapReduce for processing and storing big data. However, to get the best performance from this is a big challenge because of its large number configuration parameters. In this... more
    • by 
    •   5  
      Machine LearningHadoopMapreduceParameters
The Lung cancer patients are at discriminating threat for COVID-19 and the reported far above the ground humanity time surrounded by lung cancer patients by way of COVID-19 has prearranged break in proceedings to oncologists who are faced... more
    • by 
    •   6  
      Data MiningSPSSK-meansDecision Tree
HADOOP is an open-source virtualization technology that allows the distributed processing of large data sets across standardized server clusters. With two modules, HADOOP Distributed File System (HDFS) and MapReduce framework, it is... more
    • by 
    •   6  
      Computer ScienceHadoopMapreduceBig Data
    • by 
    •   7  
      Computer ScienceDistributed ComputingMapreduceBig Data
This paper describes the outcome of an attempt to implement the same transitive closure (TC) algorithm for Apache MapReduce running on different Apache Hadoop distributions. Apache MapReduce is a software framework used with Apache... more
    • by 
    •   14  
      Computer ScienceDistributed ComputingInformation TechnologyMachine Learning
Abstract- Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services) that can be rapidly provisioned and... more
    • by 
    •   4  
      HadoopMapreduceBig DataHDFS
Since Big data is so huge that it's become difficult to handle it, so it requires special technology which can handle bigdata. Hadoop is Apache Foundation's Framework which aims to provide efficient storage and analytics... more
    • by 
    •   5  
      HadoopMulti Agent SystemsBig DataKerberos
The Lung cancer patients are at discriminating threat for COVID-19 and the reported far above the ground humanity time surrounded by lung cancer patients by way of COVID-19 has prearranged break in proceedings to oncologists who are faced... more
    • by 
    •   5  
      Data MiningSPSSK-meansDecision Tree
Nowadays, producing streams of data is not helpful if you cannot store them somewhere. Applications, software, and objects generate huge masses of data, which need to be collected, stored, and made available for analysis. Moreover, these... more
    • by 
    •   10  
      Big DataHadoop , BIgdata , NOSQLDatabase NoSQLNoSQL Databases
The objective of the proposed system is to integrate the high volume of data along with the important considerations like monitoring a wide array of heterogeneous security. When a real time cyber attack occurred, the Intrusion Detection... more
    • by 
    •   17  
      Clustering and Classification MethodsIntrusion Detection SystemsLambda CalculusAnomaly Detection
With the explosion of data in applications all around us, erasure coded storage has emerged as an attractive alternative to replication because even with significantly lower storage overhead, they provide better reliability against data... more
    • by 
    •   16  
      Information Theory and codingDistributed SystemDistributed SystemsReed-Solomon Codes
Erasure codes are an integral part of many distributed storage systems aimed at Big Data, since they provide high fault-tolerance for low overheads. However, traditional erasure codes are inefficient on replenishing lost data (vital for... more
    • by 
    •   5  
      HadoopBig DataErasure CodingDistributed Storage System
New invention of advanced technology, enhanced capacity of storage media, maturity of information technology and popularity of social media, business intelligence and Scientific invention, produces huge amount of data which made ample set... more
    • by 
    •   2  
      HadoopHDFS
    • by 
    •   10  
      Computer ScienceCloud ComputingHadoopMapreduce