M2EC2019-MultiCloud research paper 4
M2EC2019-MultiCloud research paper 4
M2EC2019-MultiCloud research paper 4
net/publication/331760455
CITATIONS READS
46 10,479
4 authors, including:
All content following this page was uploaded by Thomas Dreibholz on 20 April 2020.
Jiangshui Hong, Thomas Dreibholz, Joseph Adam Schenkel, and Jiaxi Alessia Hu
Abstract The purpose of this paper is to provide a brief overview of cloud comput-
ing technologies, particularly with respect to multi-cloud networks. First, the basics
of cloud computing concepts are discussed. Next we outline some challenges fac-
ing cloud computing, and discuss how multi-cloud systems including multi-clouds,
hybrid clouds, federated clouds, and cross-clouds may be used to deal with some of
these issues. Finally, multi-cloud systems may also be used in conjunction with new
developing technologies such as Big Data and Machine Learning, leading to excit-
ing innovations. These are reviewed in brief. Our goal is to provide a modern look
at the state of the art in multi-cloud computing and review open issues in the field.
The goal is that this paper will help the reader to understand challenges facing cloud
computing, how multi-cloud computing addresses some of these issues, and inspire
community excitement at the future integration of multi-cloud platforms with other
novel technologies.
The National Institute of Standards and Technology (NIST) defines cloud compu-
tation as a “model for granting users access to a shared pool of configurable com-
putational resources including networks, servers, storage, applications, and services
that can be rapidly provisioned (adapted for new services) with little management
or service provider interaction” [25]. Cloud computing is not a new technology in
itself, per se, but rather a new way to incorporate existing technologies [26]. In other
words, cloud computing is simply a way to remotely access and manage computing
resources [34]. This technology has now been integrated into many businesses, as
third-party cloud networks allow firms to focus on core business elements without
1
2 Hong, Dreibholz, Schenkel, Hu
the need to worry about maintenance or the computer infrastructure, essentially turn-
ing computation into the “fifth utility” after water, electricity, gas, and telephony [6].
A major advantage of this technology is that computational resources can be used
far more efficiently, resulting in quicker results due to economies of scale. [6] il-
lustrates this point by comparing the costs of operating one server for 1000 hours
versus 1000 servers for one hour. Though the monetary costs may be the same, the
speed of 1000 servers operating in unison to solve a problem greatly reduces the
time needed for the task.
Clouds have five essential qualities that make them useful to consumers [25].
First of all, they can be remotely accessed at any point by a single individual
without requiring any additional human interaction (on-demand self-service). Ad-
ditionally, these networks can be used through various interfaces, including mo-
bile phones, laptops, and desktop computers (broad network access). Cloud Ser-
vice Providers (CSP) also usually rely on multi-tenant models, where resources are
pooled to serve multiple customers (resource pooling). Another feature of cloud
computing is that these networks can be provisioned easily and released to suit con-
sumer needs (rapid elasticity). Finally, cloud systems feature metering capabilities
for their services that can be used for transparent feedback for both provider and
user (measured service). These characteristics make cloud computing attractive for
small businesses as no upfront investment is needed. Additionally, they also reduce
operation costs, are easily accessible and decrease the pressure of having to main-
tain company networks by outsourcing the infrastructure to providers, allowing a
company to focus on core business elements [26].
Currently there are three cloud computing service models that dominate the in-
dustry: Software as a Service (SaaS), Platform as a Service (PaaS), and Infras-
tructure as a Service (IaaS). In SaaS, the consumer can operate a software on the
provider’s cloud system, typically over the Internet [25, 26]. In PaaS, the provider
allows the consumer to use the cloud network as a platform for their own developed
or acquired programs. Finally, in IaaS, the consumer is provided with virtual storage
and machines.
Now that the basics of cloud computing have been reviewed, it is possible to dis-
cuss some of the issues facing this technology. Though cloud computing is useful
to businesses as it removes the need for planned provisioning and grants small busi-
nesses the ability to scale their computing needs growing in line with their business
requirements, there are some issues and challenges facing cloud systems that could
be the subject of future research [26]. Though still in its infancy, the widespread use
of cloud computing has allowed many researchers and businesses to begin to weigh
in on potential issues facing the technology.
Security: One subject of concern cited by many authors is that of data security
and privacy [6,15,26,27]. In a study by [15], security concerns were flagged as a key
issue in 66 research papers reviewed, with the next highest issue being infrastructure
at 46, followed by data management at 15. In this study, practitioners were also
interviewed, expressing additional concerns, however still indicated that security
was a practical issue, highlighting that this is a pressing matter for many researchers
and businesses.
4 Hong, Dreibholz, Schenkel, Hu
3 Multi-Cloud Solutions
To help deal with some of the issues mentioned above, the next logical progres-
sion in cloud computing is in multi-cloud computing, or cloud systems that utilise
numerous cloud networks and services simultaneously [13]. In short, multi-cloud
systems simply use more than one CSP, though they come in many subcategories
that will be described below. This is similar to the definition of cross-cloud archi-
tectures, defined as systems that span across multiple provisioning boundaries [14].
Though the exact distinction between multi-clouds and cross-clouds is lacking (as
described in further detail below), it can be understood that in multi-clouds, the user
or business in question utilizes different cloud services for different applications in
their business. For example, they may store data on a private cloud, share documents
on the Google Cloud platform, and perform data analysis on yet another cloud. On
the other hand, cross-cloud architectures are designed to make the transfer of data
and utilisation of apps across the clouds more streamlined and cohesive [14].
6 Hong, Dreibholz, Schenkel, Hu
ever, with multi-cloud the components are all unique cloud systems, not deployment
methods, as is the case with hybrid clouds (see below). As discussed above, some
have used “multi-cloud” as an umbrella term to describe some of the other multi-
cloud systems described below, leading to some ambiguity. So, for the definition
presented here, multi-clouds will specifically refer to a cloud system where multiple
different cloud networks are combined for different roles, with the aim of reducing
the necessary trust requirements among the CSPs [24, 30]. In other words, in multi-
cloud networks, unique CSPs are used for a single business or organisation needs,
and they may all have differing levels of application as well as differing SLAs.
Again, an example of this may be a company that uses a small amount of one cloud
to store or send documents, another private cloud for sensitive company data, and
yet another for data analysis. All the clouds in this case may have different SLAs,
different costs, and different degrees of utilisation by the company.
The reasons for using the multi-cloud approach are varied. For example a busi-
ness may seek to use a multi-cloud approach for legal considerations, where the
business may operate on one cloud, but requires a different cloud to legally store
data. An- other reason is to avoid using a single CSP and [9] proposes an archi-
tectural framework and principles for Programmable Network Clouds hosting Soft-
ware Defined Networking (SDN) and Network Function Virtualisation (NFV) for
geographically distributed multi-cloud computing environments. Costs and SLA-
aware resource provisioning and scheduling that minimised the operating cost with-
out violating the negotiated SLAs were investigated with respect to techniques for
autonomic and timely NFV composition, deployment and management across mul-
tiple Clouds. The authors strongly suggested for future works to investigate opti-
misation techniques that simultaneously optimise the VM/container placement and
traffic consolidation, maximising both SLA satisfaction and reducing costs [9].
Arguably the most useful and well-researched application of multi-cloud archi-
tecture offer enhanced security. [5, 7, 18, 30] describe several multi-cloud architec-
tures with a primary focus on security, specifically with medical records. In this
paper, the authors discuss four mechanisms for enhanced security. The first is appli-
cation imitation, where the clouds within the multi-cloud essentially “double-check”
each other, as results from one cloud can be compared with those in a different cloud,
ensuring data integrity. Layer-wise application partition allows for a separation be-
tween logic and data, protecting against flaws in logic. The application provider
cannot recreate logic entirely from user data unless it is entirely executed on the
cloud users system. Partitioning into segments refers to splitting the application into
smaller segments, which are then run on different clouds, and this process involves
two phases: in the first phase, trusted private clouds receive small computational
parts, while untrusted public clouds handle higher loads. In the second phase, the
computation is shared amongst the untrusted clouds. The final mechanism for in-
creasing security is the distribution of chunks, or simply that data is split amongst
the different clouds within the multi-cloud. In this case, different methods for cryp-
tographic data segmentation have been trialed for the storage and retrieval of data.
Several models have been suggested for these mechanisms. For example, a coop-
erative provable data possession (CPDP) scheme was tested by [36]. However, it had
a problem in that the security parameter π 0 is independent of other parameters, al-
lowing authentication bypass via forgery of π 0 in the response sequence [18]. More
recently, [18] have expanded on integrity checking methods using the Co-Check
scheme, which is based on Boneh-Lynn-Shacham (BLS) signature and homomor-
phic tags [18]. As defined by [8], homomorphism is a property by which a problem
8 Hong, Dreibholz, Schenkel, Hu
involve divvying up data amongst clouds for rapid data storage. In Hadoop, an open
source implementation of Google’s method for data storage, large volumes of data
are cut into more manageable chunks across thousands of computers. A parallelised
programming API is then used to distribute the computations to where the data is
located and to aggregate the results. This method has tremendous application for
bioinformatics and genomics. However, the technical savvy and lack of bioinfor-
matics tools that run in parallel has proven to be a barrier up until very recently [4].
They reviewed various big data cloud computing frameworks including Hadoop,
Spark and Flink. It will be interesting to see what further improvements will be
made and how multi-cloud computing architectures will integrate these approaches.
Finally [29] also mentions the use of Amazon Elastic Map Reduce, IBM BigIn-
sights, and Microsoft Azure HDInsight as methods for allowing accessible large-
scale data processing frameworks, namely Apache Hadoop, and Apache Spark.
Machine Learning: Although related to big data, machine learning presents its
own sets of challenges and applications for use with cloud computing technologies.
Given the availability of data stored in clouds (e.g. photographs and videos), there
is a wealth of information that can be processed by machine learning algorithms.
Techniques such as virtual machine optimisation for healthcare services are being
trialed and studied, and show promise, but must be trialed with other diseases to
demonstrate consistency [1]. [16] has explored the use of hybrid cloud storage and
machine learning for use in the oil and gas industry, where technical documents
contain valuable information from disciplines like geoscience and engineering and
are in general stored in a unstructured format. To improve data extraction and util-
isation, the authors propose a machine-learning-enabled platform, consisting of a
carefully selected sequence of algorithms, developed as a hybrid cloud container
that automatically reads and understands the technical documents with little human
supervision. The user can upload raw data to the platform, which is stored on a pri-
vate local server. Structured data is generated as output, which is pushed through to
a search engine that is accessible to the user via the cloud. This allows for a user to
quickly identify the most important parts of technical documents, automate the ex-
traction or relevant data from the documents, present the data in a meaningful way
for further analysis, and finally, allows the user to share it easily and port it to other
platforms.
Future directions for machine learning and multi-cloud computing research in-
clude a shift towards the wide-scale adoption of auto-tuners, especially for the SaaS
layer of the Cloud. [10] also anticipate the advent of new automated tools for cloud
users to benefit from the experience of other users through partially automated ap-
plication builders, automated database sharers, query optimisers, or smart load bal-
ancers and service replicators. In other words, the interface between cloud, applica-
tion, and user, is expected to become more streamlined and intuitive based on what
is learned from the user experience. As security has always been a key concern for
cloud networks of any kind, novel machine learning methods may be used for in-
creased security measures. However, the authors did not specify what these may
look like or on what they would be based, representing a future area of research
application.
An Overview of Multi-Cloud Computing 11
5 Conclusion
References
1. Abdelaziz, A., Elhoseny, M., Salama, A.S., Riad, A.M.: A Machine Learning Model for Im-
proving Healthcare Services on Cloud Computing Environment. Measurement 119 (2018)
2. Abdo, J.B., Demerjian, J., Chaouchi, H., Barbar, K., Pujolle, G.: Broker-based Cross-Cloud
Federation Manager. In: 8th International Conference for Internet Technology and Secured
Transactions (ICITST-2013). London/United Kingdom (2013)
3. Aggarwal, R.: Resource Provisioning and Resource Allocation in Cloud Computing Envi-
ronment. International Journal of Scientific Research in Computer Science, Engineering and
Information Technology 3 (2018)
4. Aisling, O., Jurate, D., D., S.R.: Big data, Hadoop and Cloud Computing in Genomics. Journal
of Biomedical Informatics 46(5) (2013)
5. AlZain, M.A., Soh, B., Pardede, E.: A Survey on Data Security Issues in Cloud Computing:
From Single to Multi-Clouds. Journal of Software 8(5) (2013)
6. Armando, F., Rean, G., Anthony, J., Randy, K., Andrew, K., Gunho, L., David, P., Ariel, R.,
Ion, S.: Above the Clouds: A Berkeley View of Cloud Computing. Tech. Rep. UCB/EECS-
2009-28, EECS Department, University of California, Berkeley (2009)
7. Awasthi, P., Mittal, S., Mukherjee, S., Limbasiya, T.: A Protected Cloud Computation Algo-
rithm Using Homomorphic Encryption for Preserving Data Integrity. In: Recent Findings in
Intelligent Computing Techniques. Singapore (2019)
8. Benzekki, K., Fergougui, A.E., Alaoui, A.E.B.E.: A Secure Cloud Computing Architecture
using Homomorphic Encryption. International Journal of Advanced Computer Science and
Applications 7(2) (2016)
9. Buyya, R., Son, J.: Software-Defined Multi-Cloud Computing: A Vision, Architectural Ele-
ments, and Future Directions. In: Proceedings of the 18th International Conference on Com-
putational Science and Applications (ICCSA) (2018)
10. Buyya, R., Srirama, S.N., Casale, G., Calheiros, R., Simmhan, Y., Varghese, B., Gelenbe,
E., Javadi, B., Vaquero, L.M., Netto, M.A.S., Toosi, A.N., Rodriguez, M.A., Llorente, I.M.,
Vimercati, S.D.C.D., Samarati, P., Milojicic, D., Varela, C., Bahsoon, R., Assuncao, M.D.D.,
Rana, O., Zhou, W., Jin, H., Gentzsch, W., Zomaya, A.Y., Shen, H.: A Manifesto for Future
Generation Cloud Computing: Research Directions for the Next Decade. ACM Comput. Surv.
51(5) (2018)
11. Dreibholz, T.: Big Data Applications on Multi-Clouds: An Introduction to the MELODIC
Project. Keynote Talk at Hainan University, College of Information Science and Technology
(CIST) (2017)
12. Dreibholz, T., Mazumdar, S., Zahid, F., Taherkordi, A., Gran, E.G.: Mobile Edge as Part of
the Multi-Cloud Ecosystem: A Performance Study. In: Proceedings of the 27th Euromicro In-
ternational Conference on Parallel, Distributed and Network-Based Processing (PDP). Pavia,
Lombardia/Italy (2019)
12 Hong, Dreibholz, Schenkel, Hu
13. Elkhatib, Y.: Mapping Cross-Cloud Systems: Challenges and Opportunities. In: Proceedings
of the 8th USENIX Conference on Hot Topics in Cloud Computing. Berkeley/United State
(2016)
14. Elkhatib, Y., Blair, G.S., Surajbali, B.: Experiences of Using a Hybrid Cloud to Construct an
Environmental Virtual Observatory. In: Proceedings of the 3rd International Workshop on
Cloud Data and Platforms. Prague/Czech Republic (2013)
15. Ghanam, Y., Ferreira, J., Maurer, F.: Emerging Issues and Challenges in Cloud Computing –
A Hybrid Approach. Journal of Software Engineering and Applications 5(11A) (2012)
16. Hernandez, N.M., Lucañas, P.J., Graciosa, C.M.J.C., Caezar, I.P.L., Maver, K.G., Yu, C.,
Maver, M.G.: An Automated Information Retrieval Platform For Unstructured Well Data
Utilizing Smart Machine Learning Algorithms Within A Hybrid Cloud Container. In: First
EAGE/PESGB Workshop Machine Learning (2018)
17. Hong, K., Lillethun, D., Ramachandran, U., Ottenwälder, B., Koldehofe, B.: Opportunistic
Spatio-temporal Event Processing for Mobile Situation Awareness. In: Proceedings of the 7th
ACM International Conference on Distributed Event-based Systems (2013)
18. Mao, J., Tian, W., Zhang, Y., Cui, J., Ma, H., Bian, J., Liu, J., , Zhang, J.: Co-Check: Collab-
orative Outsourced Data Auditing in Multicloud Environment. Security and Communication
Networks 2017 (2017)
19. Modi, K.J.: Securing Healthcare Information over Cloud Using Hybrid Approach. In: P.C.
Rani, P.A. K., M. Sudip, P. Bibudhendu, L. Kuan-Ching (eds.) Progress in Advanced Comput-
ing and Intelligent Engineering. Springer, Singapore (2019)
20. Opara-Martins, J.: Taxonomy of Cloud Lock-in Challenges. In: M. Khatib, N. Salman (eds.)
Mobile Computing. IntechOpen, Rijeka (2018)
21. Opara-Martins, J., Sahandi, R., Tian, F.: Critical Analysis of Vendor Lock-in and its Impact
on Cloud Computing Migration: A Business Perspective. Journal of Cloud Computing 5(1)
(2016)
22. Ouyang, C., Moura, M.: A Vision of Hybrid Cloud for Big Data and Analytics. Tech. rep.,
IBM Big Data and Analytic Hub (2017)
23. Petcu, D.: Portability and Interoperability between Clouds: Challenges and Case Study. In:
Towards a Service-Based Internet. Springer, Berlin/Heidelberg (2011)
24. Petcu, D.: Multi-Cloud: Expectations and Current Approaches. In: Proceedings of the Interna-
tional Workshop on Multi-cloud Applications and Federated Clouds. Prague/Czech Republic
(2013)
25. Peter, M., Tim, G.: The NIST Definition of Cloud Computing. Tech. rep., National Institute of
Standards and Technology Gaithersburg, MD 20899-8930 and U.S. Department of Commerce
(2011)
26. Qi, Z.: Cloud Computing: State-of-the-Art and Research Challenges. Journal of Internet Ser-
vices and Applications 1(1) (2010)
27. Sun, Y., Zhang, J., Xiong, Y., Zhu, G.: Data Security and Privacy in Cloud Computing. Inter-
national Journal of Distributed Sensor Networks 10(7) (2014)
28. Taherizadeh, S., Jones, A.C., Taylor, I., Zhao, Z., Stankovski, V.: Monitoring Self-Adaptive
Applications within Edge Computing Frameworks: A State-of-the-Art Review. Journal of
Systems and Software 136 (2018)
29. Taherkordi, A., Zahid, F., Verginadis, Y., Horn, G.: Future Cloud Systems Design: Challenges
and Research Directions. IEEE Access 6 (2018)
30. Thillaiarasu, N., ChenthurPandian, S.: Enforcing Security and Privacy over Multi-Cloud
Framework using Assessment Techniques. In: Proceedings of the 10th International Con-
ference on Intelligent Systems and Control (ISCO). IEEE, Coimbatore/India (2016)
31. Vaquero, L.M., Rodero-Merino, L.: Finding Your Way in the Fog: Towards a Comprehensive
Definition of Fog Computing. SIGCOMM Comput. Commun. Rev. 44(5) (2014)
32. Varghese, B., Buyya, R.: Next Generation Cloud Computing: New Trends and Research Di-
rection. Future Generation Computer Systems 79 (2018)
33. Vurukonda, N., Rao, B.T.: A Study on Data Storage Security Issues in Cloud Computing.
Procedia Computer Science 92 (2016)
34. Winans, T.B., Brown, J.S.: Cloud Computing: A collection of working papers. Deloitte LLC
(2009)
35. Yi, S., Li, C., Li, Q.: A Survey of Fog Computing: Concepts, Applications and Issues. In:
Proceedings of the 2015 Workshop on Mobile Big Data. ACM, Hangzhou/China (2015)
36. Zhu, Y., Hu, H., Ahn, G.J., Han, Y., Chen, S.: Collaborative Integrity Verification in Hybrid
Clouds. In: 7th International Conference on Collaborative Computing: Networking, Applica-
tions and Worksharing (CollaborateCom). IEEE, Orlando/USA (2012)