Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2021, IRJET
…
6 pages
1 file
Data science is a process of extracting knowledge from different structure and unstructured data using algorithms, mathematical methods. Data Science is concept related to big data, machine learning, artificial intelligence and data mining. Data Mining consists of data in statistics format which is extracted from useful information of data. Difference between Data mining and R programming is that, data mining is a concept related to the large data set while R is object oriented programming language to process such big amount of data sets. Everything stored in Object form. This object consists of information related to object and functions which are required to process that data. While considering the applications of R programming it includes finance, banking, healthcare, social media, manufacturing, E-commerce etc. Advantage of R is it is fee of cost and it provides huge number of packages and libraries that contain mathematical, statistical and other methods. R is very simple language which provided powerful tool for data mining and data processing. These tools are applicable in almost each and every filed of research.
American Journal of Software Engineering and Applications
Data mining is a set of techniques and methods relating to the extraction of knowledge from large amounts of data (through automatic or semi-automatic methods) and further scientific, industrial or operational use of that knowledge. Data mining is closely related to the statistics as an applied mathematical discipline with an analysis of data that could be defined as the extraction of useful information from data.The only difference between the two disciplines is that data mining is a new discipline that is related to significant or large data sets. R is an object-oriented programming language. This means that everything what is done with R can be saved as an object. Every object has a class. It describes what the object contains and what each function does. Application of R as a programming language and statistical software is much more than a supplement to Stata, SAS, and SPSS. Although it is more difficult to learn, the biggest advantage of R is its free-of-charge feature and the wealth of specialized application packages and libraries for a huge number of statistical, mathematical and other methods. R is a simple, but very powerful data mining and statistical data processing tool and once "discovered", it provides users with an entirely new, rich and powerful tool applicable in almost every field of research.
2020
In today’s era, R and Python are one of the most promising tools used in all futuristic technologies. Both R and Python are open source programming languages with an abundant collection of libraries that are added continuously to their catalogue. The focus of this paper is to compare the two technologies and address the confusion of many individuals regarding the choice programming language to be used. This review proposes a comparative study between Python and R, explaining the benefits and highlighting the differences between the two. Both Python and R are well evaluated based on their performance parameters with reference to topics like Big Data, Data Analysis, Internet of Things, Machine Learning and other domains related to Data Science. Python being an object oriented programming language, is a good tool to execute algorithms that are used in production. On the contrary, R is a programming language which is used widely by professional such as statisticians, data analysts. This...
SSRN Electronic Journal, 2019
It has been 23 years since the initial version of R was launched. In this duration R has been modified several times, and now it has evolved as a giant in the field of data analysis. Nowadays R is one of the best known tool for data mining, statistics and machine learning. Today R is enjoying a vast community that provides quick response and support. With over 10000 packages available to download as per our requirement, R appears as complete solution for all data science related task .In this paper we have discussed brief history of R project. We have also described the present status of R and distinguish features of R. We have tried to explain why R is the first choice for data analysis by comparing it with other languages available for data science. We have also discussed its limitations and solutions to deal with these. This paper will be beneficial for researchers to gain an insight of R, who are going to work on data analysis related project.
Data Mining is the extraction of knowledge from the large databases. Data Mining had affected all the fields from combating terror attacks to the human genome databases. For different data analysis, R programming has a key role to play. R Studio, an effective GUI for R Programming is used extensively for generating reports based on several current trends models like random forest, support vector machine etc. It is otherwise hard to compare which model to choose for the data that needs to be mined. This paper analyses the performance of B.A. students of Dibrugarh University with respect to caste and gender.
IRJET, 2020
We all know that R and Python, both are open-source programming languages, developed in early 1990's and the two most popular programming tools for Data Science work. While both languages are competing to be the Data Scientist's language of choice, it is hard to pick the best one out of these two languages i.e. R and Python. Yes, it is true if you just stepped into Data Science and looking for the best language to start with. We cannot pick one but can figure out some strengths and weaknesses of both languages. Even you know their pros and cons, it is your choice to pick one that suits best for your use case. This paper deals with the pros and cons of both the languages in deep by taking some examples. The machine learning algorithm which is being used in this example is already known by all of us. Datasets which we used during this project i.e. in the examples are inbuilt and some are taken in natural.
IRJET, 2021
The present data upset isn't just about big data, it is about data, all things considered, and types. While the issues of volume and velocity introduced by the ingestion of gigantic measures of data stay predominant, it is the quickly creating difficulties being introduced by the third v, variety, which requires more consideration. The requirement for a farreaching way to deal with find, access, repurpose, and genuinely coordinate every one of the assortments of data is the thing that has driven us to the improvement of a data science system that structures our establishment of doing data science. Remarkable highlights in this system incorporate issue distinguishing proof, data revelation, data administration and ingestion, and morals. A contextual investigation is utilized to show the structure in real life. We close with a conversation of the significant job for data keenness.
This Paper gives an introduction of Data Mining System and development of different data mining systems. How many data mining tools or systems really exist? What are different platform under which these tools or systems has been designed? What are different areas for which these data mining system has been devised or developed. After doing study and research on data mining system. we have to decide which platform is best for the development of Data Mining System and to understand the importance of data mining system and platform on which it has been developed. As we know what is importance of data mining system for big and large organization, when the company is working in different parts of country and producing large amount of data then it very important to analyze that data which account for productivity and cost benefits from that data. Now it is important to understand that which part of the organization is giving benefit and where the organization is losing. So to take any decision, by the help of Data Mining tools the overall data is combined and given as input to produce as output in the form of high level managerial decision so that decision can increase the profit of organization. Profit of the organization can be seen in terms of working efficiency, productivity etc.
International Journal of Engineering and Advanced Technology (IJEAT), 2020
The terms machine learning, deep learning and data science are buzz words now a days. The usage of these techniques with some technologies like R and Python is most common in the industry and academics. The current work is dealing with the inherent logics existing in the algorithms like Classification, Dimensionality reduction and Recommender systems along with the suitable examples. Some of the applications mentioned here like Facebook, Twitter and LinkedIn to exploit the usage of these algorithms in their daily usage. The discussion about online platforms like Amazon, Flipkart are other areas where the recommender systems were most commonly used algorithms. The outcome of the work is the logical things hidden in the usage of the algorithms and the implementation wise which are packages and functions helpful for the implementation of the algorithms. The belief is the work will be helpful for the researchers and academicians in the context of algorithmic perspective and they can extend the work by contributing their thoughts and views on the same work. Unlike in the normal programming, R/Python simplifies the logic of algorithms so that the lines of code and understanding of the problem is bit simple when compared with general programming languages. The work explains the mail respondents related to the allocation of the house by the company as a response to their mail by considering Urban, semi-urban and rural areas of the customers, the income range of the customers also observed in the allocation of the house. The implementations are with R by using classification and the corresponding results were published with the explanation of the values found in the implementation.
Baltic-Pontic Studies 25, 2021
Contrapunto. Revista de musicología del Programa de Licenciatura en Música, 2022
Journal of Theatre Criticism and Dramaturgy, 2022
Research Journal of Aleppo University, No. 49, 2023
Raptors worldwide, 2004
HispanismeS: Revue de la Société des Hispanistes Français, 2022
Journal of Consulting and Clinical Psychology., 2023
Library Philosophy and Practice, 2018
Asian Journal of English Studies, 2018
Incontro per un Dialogo Fraterno organizzato dalla Consultazione Carismatica Italiana (CCI), Bari 27-28 maggio 2022.
Journal of the American Chemical Society, 2012
The Journal of Geology, 2016
Agrikan Jurnal Agribisnis Perikanan
Australian and New Zealand Journal of Public Health, 2000
Revista Espaco Academico, 2014
Hydrometallurgy, 2018
Nonconvex Optimization and Its Applications, 2000