A Performance Comparison of C 2013 Delph PDF
A Performance Comparison of C 2013 Delph PDF
A Performance Comparison of C 2013 Delph PDF
3, July 2015
ABSTRACT
C# 2013, Delphi XE6, and Python 3.4 are the newest and most popular programming languages. These
programming languages become more popular every passing day. In this study, the response times,
memory usages, and code lengths of these languages were tested in various workloads. Whether or not
there was any significant difference between the data obtained from workloads was tested via the Friedman
test. The test indicated a significant difference. In addition, the Wilcoxon signed rank test was used for
determining the effect size. This test showed that the level of the significant difference found in the
Friedman test was high.
KEYWORDS
1.INTRODUCTION
Most of the developers are choosing the managed programming language. Because, managed
language has the following features.
Further, most of these programming languages are object-oriented programming languages [1].
The languages C# 2013, Delphi XE6 are managed languages. The Python is unmanaged
language. Memory method of unmanaged programming languages is not automatic. Therefore
they are not safe.
Microsoft .NET platform has provided a robust framework. Thus, Windows applications have
been developed so much easier. [2]. The NET Framework is a complete “application”
development platform, which has been developed by the Microsoft and, which has been
established on open Internet protocols and standards. The scope of application concept here is
very broad. Everything from a desktop application to a web browser has been considered within
this platform and everything has been supported. [3].
Programmers and computer scientists conduct research on the advantages and disadvantages of
various programming languages. Gillings and compared C, C++, C#, Java, Perl, and Python
languages in terms of memory usage and speed of execution by using 3 standard bioinformatics
methods. C was found to be the strongest [4]. Arudchelvam et al. compared FORTRAN, C, and
Java programming languages in terms of memory usage and run-time. These three languages
were found to be equal in terms of run-time. FORTRAN was found to be using less memory [5].
Lutz Prechelt compared C, C++, Java, Perl, Python, Rexx, and Tcl programming languages in
terms of program length, programming effort, runtime efficiency, memory consumption, and
DOI : 10.5121/ijpla.2015.5301 1
International Journal of Programming Languages and Applications ( IJPLA ) Vol.5, No.3, July 2015
reliability [6]. Konstantin Berlin et al. compared Pthreads, OpenMP, MPI, and UPC parallel
computing APIs [7].
To contribute to works in this field, C# 2013, Delphi XE6, and Python 3.4 programming
languages were compared in terms of response time, memory usage, and code length in the
present study. These programming languages were chosen because they are popular and
commonly used.
2. EXPERIMENTAL STUDY
Of the 400 research articles in software engineering, which need experimental validation, 40% do
not include experiment at all and this ratio in other disciplines 15% [8]. Therefore, in this article,
in order to test the performance of programming languages, experimental algorithms have been
prepared. Whether there is a significant difference among the experimental results obtained have
been tested with the nonparametric Friedman test.
Python is a popular programming language that can develop both independent and script
applications. Python is a portable, strong, easy-to-use, and free programming language. Python is
an object-oriented, interpreted, modular, and interactive programming language. Its indentation-
based simple syntax facilitates learning and remembering it. Thus, it allows starting programming
without wasting any time in the details of syntax. It supports modular structure, class system, and
any input field. It is compatible with almost all platforms (e.g. Unix, Linux, Mac, Windows,
Amiga, Symbian Os). Python allows developing software in many fields such as systems
programming, user interface (GUI) programming, web programming, application, and database
software programming [11].
The features of the computer used in the performance test are as in the following:
2
International Journal of Programming Languages and Applications ( IJPLA ) Vol.5, No.3, July 2015
2.2. Workloads
The concept of workload is very important in modeling of computer systems [12]. The
performance evaluation using workload reduces quantity and cost of simulation [13].
Experimental system evaluations generally include workload programs. Every performance
evaluation program is worked with systems having different properties. The behavior of the
system is measured and its performance is interpreted [14]. Workload includes a list of services
that should be fulfilled by the system [2]. Workloads in this study are made up of different
programs. Each of these programs measures a different property of the programming language.
These workloads are shown in Table 1.
Table 1. Workloads
Workload
Explanation
Code
Hello [1] Printing of “Hello World” on the screen for 5000 times
Matrix [2] Multiplication of two matrices of 500 x 500 dimensions
Sorting of the series with 10000 elements, the element values of which are in the
Sorting [3]
worst situation with the Selection Sorting algorithm.
Estimation of the prime numbers at the interval of [1..8193] with the sieve
Sieve [4]
algorithm for 10.000 times
Empty
The empty cycle at the interval of [1.. 100000000]
Cycle [5]
Estimation of the mean of the numbers at the interval of [1..3000] for 30000
Mean [6]
times
Writing and Reading of the character knowledge
Table [7] “abcdefghijklmnopqrstuvwxyz1234567890abcdefghijklmnopqrstuvwxyz123456
7890abcdefgh” with a text file for 10000 times
The algorithms used as workloads have been coded in every language the performance of which
will be tested by using standard properties in a way that they are equal to each other. These
program codes have been converted into executable code (.exe) and their memory consumptions
and response times have been obtained from Windows 7 Command Prompt.
The Hello [1] program tests writing on the screen and loading performance of the program,
Matrix [2] and Mean [6] programs integer arithmetic performance, Sorting [3] program cycle and
logical decision performance. The Sieve [4] program to predict prime numbers by using the
classical Sieve Eratosthenes algorithm. The Sieve program tests the basic logical comparison and
integer arithmetic operation [15]. The Empty Cycle [5] program tests the loop performance of
programming languages. Table [7] program tests reading in the text file and writing performance.
[16].
Metrics of Performance
1 Length of Code (LOC or CLOC)
2 Time of Response (ms-mili second)
3 Memory Usege (KB (kilobyte))
3
International Journal of Programming Languages and Applications ( IJPLA ) Vol.5, No.3, July 2015
The density of the lines compiled can be calculated with CLOC*100/LOC formula [18]. The line
numbers of program codes used as a workload in this study have been calculated in line with the
explanations stated above and they have been shown in Table 3.
Table 3. Code Lengths of the Programs Written
* L:LOC CL:CLOC
The graphic of CLOC/LOC values given in Table III are shown in Figure 1.
The concept of response time is very important in computer systems performance test studies.
Response time is the measurement of the time for which a user or an application has to wait until
a command requested is completed [12]. In this study, response times of workloads run in the
programming languages desired to be measured are given in Table 4.
5
International Journal of Programming Languages and Applications ( IJPLA ) Vol.5, No.3, July 2015
Figure 3. Mean Response Times for All the Workloads by the Programming Languages
6
International Journal of Programming Languages and Applications ( IJPLA ) Vol.5, No.3, July 2015
Minimal descriptive statistics contains the following for a data set: total observation number,
mean, median, standard deviation, minimal value, maximum value and number of observations.
Presentation of descriptive statistics data on dependent variable is significant [16, 19]. One-Way
ANOVA is used when searching the effect of one independent variables on a dependent variable
[20]. In this study, the dependent variable is response time, the independent variable is
programming languages. Significant results have been obtained by applying one-way ANOVA on
these dependent and independent variables. However, because variance equality assumption has
not been ensured, these results have not been presented in the article. Instead, the Friedman test, a
non-parametric method, has been used.
The Friedman test is the non-parametric correspondence of one-way ANOVA test. When the
same samples belonging to the subjects have been treated and when these samples have been
measured at three or more points, the Friedman test is used [20, 21].
The Friedman test has been used in order to find whether there is a significant difference among
response times obtained as a result of running of every workload on C# 2013, Delphi XE6 and
Python 3.4 programming languages. The Friedman test showed that there is a significant
difference between the response times of 3 different programming languages in all workloads
except for Hello[1]. The test data obtained from the Friedman test are shown in Table 6. In
addition, mean rank values and the general average of these values clearly indicate that Delphi
XE6 is the fastest programming language in terms of response time. Statistical results regarding
Hello[1] demonstrate that there is no significant difference for this workload in terms of response
time, χ 2 (2) = 1.4, p > .05.
7
International Journal of Programming Languages and Applications ( IJPLA ) Vol.5, No.3, July 2015
Mean Rank
In this study, effect size was calculated, too. The calculation of effect size is important because it
allows identifying the group(s) from which the significant difference between groups arises [20].
Following the Friedman test, the Wilcoxon signed rank test was used for calculating the effect
sizes of the programming languages used. In this way, type 1 mistake was kept at minimum [22].
The Bonferroni correction value was calculated in order to calculate the significance coefficient
of posthoc tests. The α/number of comparisons formula was applied to determine the significance
value of Wilcoxon tests used for posthoc, according to Field (2005). New significance value was
found to be 0.05/3=0.017. Effect size (r) and significance value (p) obtained through the posthoc
test are showed in Table VII.
The below-mentioned formula was used for calculating the effect size: [23].
Post Hoc
Delphi XE6-C# 2013 Python 3.4-C# 2013 Python 3.4-DelphiXE6
Wordload
Z r p Z r p Z r p
Matrix [2] -2,81 -0,63 0,005 -2,8 -0,63 0,005 -2,80 -0,63 0,005
Sorting [3] -2,02 -0,45 0,043 -2,8 -0,63 0,005 -2,8 -0,63 0,005
N
Sieve [4] -2,67 -0,60 0,008 -2,8 -0,63 0,005 -2,8 -0,63 0,005
Empty Loop [5] -2,82 -0,63 0,005 -2,8 -0,63 0,005 -2,8 -0,63 0,005
Mean [6] -2,82 -0,63 0,005 -2,8 -0,63 0,005 -2,8 -0,63 0,005
Table [7] -2,85 -0,64 0,004 -2,8 -0,63 0,005 -2,7 -0,60 0,007
The average of
-0,60 -0,63 -0,63
the r
An effect size (r) closer to zero means a lower insignificant difference between the response times
of programming languages. However, p and r values in the Table VII indicate that the difference
between binary groups is big enough to be considered significant. However, Hello [1] workload is
ignored here. This is because; the Friedman test also indicated no significant difference in this
8
International Journal of Programming Languages and Applications ( IJPLA ) Vol.5, No.3, July 2015
workload. The average effect sizes (r) for all workloads also demonstrate that the difference
between binary groups is big enough to be considered significant. In other words, the difference
between the programming languages in terms of response time is big and statistically significant.
The general averages of the response times related to all workloads obtained from the
performance tests indicate that Delphi XE6 is considerably faster than C# 2013 and Python. Also
Delphi XE6 is 3 times as fast as C# 2013 and 49 times as fast as Python. However, C# 2013
writes and reads text file averagely 4.5 times as fast as Delphi XE6. All these results are
statistically significant in all workloads except for Hello [1]. Thus, the results concerning this test
are not interpreted here.
The detailed comparison of the response times of the programming languages by workload shows
the results given below.
The general averages of the response times regarding Matrix [2] and Average [6] workloads
indicate that Delphi XE6 is almost 3 times as fast as C# 2013. This result shows that the integer
arithmetic performance of Delphi XE6 is faster than that of C# 2013.
The results concerning Sort [3] workload indicate that C# is 1.07 times as fast as Delphi XE6 in
logical decision performance. The results related to Sieve [4] workload, which tests basic integer
arithmetic and logical comparison, point out that Delphi XE6 is 1.1 times as fast as C# 2013. C#
2013 is faster only in logical decision performance. However, when both logical decision and
integer arithmetic come into play, Delphi XE6 is faster.
In terms of response time, Python remains falls behind the other two languages in all tests.
Consequently, the programs written in Python run much slower than those written in C# 2013 and
Delphi XE6.
In terms of memory usage, Delphi XE6 uses 50% less memory than C# 2013 and almost 54% less
memory than Python. That is to say, the programs coded in Delphi XE6 can run faster by using
less memory.
Although Delphi XE6 is strong in terms of memory usage, it is weaker than Python and slightly
stronger than C# 2013 in terms of code density. The averages of compiled code density obtained
from all tests show that Python is the strongest. The averages of compiled code density of Python,
Delphi XE6, and C# 2013 are 83%, 55%, and 48% respectively. These figures indicate that code
density not used by Python during compilation is low. In other words, Python does not produce
many codes other than the program codes to be used for solving the problem. Delphi XE6 is
stronger than C# 2013 in terms of code density.
4.CONCLUSION
Delphi XE6 is considerably faster than C# 2013 and Python 3.4 in terms of response time. Python
is stronger than the other two languages only in terms of code density. Delphi XE6 uses 50% less
memory than C# 2013 and almost 54% less memory than Python. That is to say, the programs
coded in Delphi XE6 can run faster by using less memory.
9
International Journal of Programming Languages and Applications ( IJPLA ) Vol.5, No.3, July 2015
When all performance measurements are considered, Delphi XE6 is seen to be stronger than the
other two languages. The present study proved that statistically.
Python is quite weak in terms of both memory usage and response time. However, Python is
distinctive relative to C# 2013 and Delphi XE6 in that it has an indentation-based simple syntax,
is compatible with different platforms (e.g. Linux, Pardus, Windows), and is free.
REFERENCES
[1] Blackburn, S. M., McKinley, K. S.,e.g., “Wake Up and Smell the Coffee :Evaluation Methodology
for the 21st Century”, Communications of the ACM, 51:83-89, (2008).
[2] Dick ,J.R., Kent, K. B. And Libby, J. C., “A quantitative analysis of the .NET common language
runtime”, Journal of Systems Architecture, 54:679–696, (2008).
[3] Wikipedi World Wide Web site, http://tr.wikipedia.org/wiki/.NET_Framework
[4] Fourment, M., Gillings, M. R., “A comparison of common programming languages used in
bioinformatics”, BMC Bioinformatics, 9(82):1-9, (2008).
[5] Arudchelvam, T., Wijayakulasooriya, J., Hoole, S. R. H., “Comparison of Performance of Finite
Element Codes in Different Programming Languages Converted From Legacy Finite Element Codes”,
3rd International Conference on Electronics, Biomedical Engineering and its Applications
(ICEBEA'2013), Singapore, 147-151 , (2013).
[6] Prechelt, L., “An empirical comparison of seven programming languages” ,Computer, 33: 23 – 29,
(2000).
[7] Berlin K. , Mary Jacob, J., Kochhar, G. , eg., “Evaluating the Impact of Programming Language
Features on the Performance of Parallel Applications on Cluster Architectures”, Languages and
Compilers for Parallel Computing Lecture Notes in Computer Science, 2958:194-208, (2004).
[8] Juristo, N., Moreno, A., Basics of Software Engineering Experimentation, Kluwer Academic , South
America, Boston, (2001).
[9] Albahari, J., Albahari , B., C# 5.0 in a Nutshell, Fifth Edition, O’Reilly Media, O’Reilly Media, Inc,
United States of America, (2012).
[10] Wikipedi World Wide Web site, [Online]. Available:
http://en.wikipedia.org/wiki/Delphi_(programming_language)#cite_note-23
[11] [Online]. Available:http://pythontr.org/forum/5-gr/7-python-nedir.html
[12] Fortier, P., Michel, H., Computer Systems Performance Evaluation and Prediction, Digital Pres, USA,
Burlington, (2003).
[13] Conte, T. M., Hwu, W., “Benchmark Characterization”, System Sciences, Proceedings of the Twenty-
Fourth Annual Hawaii International Conference on, 1:364-372, (1991).
[14] Conte, T. M., Hwu, W., “Benchmark Characterization for Experimental System Evaluation”, Proc.
Hawaii Int'l Conf. System Science I, 6-18, (1990).
[15] Morin, R. C., Managed C# versus Unmanaged C++ (2009): [Online].
Available::http://www.csharphelp.com/archives2/archive458.html
[16] Karacı, A., “Performance Comparison of Managed C# and Delphi Prism in Visual Studio and
Unmanaged Delphi 2009 and C++ Builder 2009 Languages”, International Journal of Computer
Applications, 26(1): 9-15, (2011).
[17] ŞAHİN, M., “Java, Python Ve Ruby Dillerinin Performans Karşılaştırması”, Akademik Bilişim 2007,
Dumlupınar University, Kütahya, 529-532,(2007)
[18] Chiş, M., “Evolutionary Decision Trees and Software Metrics for Module Defects
Identification”, Proceedıngs Of World Academy Of Science, Engineering And Technology, 28:273-
277, (2008).
[19] Emam, K. El, A Methodology for Validating Software Product Metrics, National Research Council of
Canada,Ottawa, Ontario, Canada NCR/ERC-1076, (2000).
[20] Kalaycı, Ş.,SPSS Uygulamalı Çok Değişkenli İstatistik Teknikleri, Asil Yayın Dağıtım, Turkey,
Ankara, (2008).
[21] Sá, J.M., Applied Statistics Using SPSS, STATISTICA, MATLAB and R , Springer Berlin
Heidelberg, USA, Newyork, (2007).
10
International Journal of Programming Languages and Applications ( IJPLA ) Vol.5, No.3, July 2015
[22] Pallant, J., SPSS Survival Manual, Mc Graw Hill Open University Press, Newyork, (2007).
[23] Field, A. , Discovering Statistics using IBM SPSS Statistics , SAGE Publications, Washington DC,
(2005).
Authors
11