El Gwekwerere C23156717L MScBDA624 AssignmentOne
El Gwekwerere C23156717L MScBDA624 AssignmentOne
El Gwekwerere C23156717L MScBDA624 AssignmentOne
Assignment Details
Student ID : C23156717L
❖ FJK Consultancy (Pvt) is a tech company that deals in Software Engineering and
Cyber Security. A new director who was recently appointed wants to improve the
company’s engagement of employees across the global centers of excellence (GCE)
to drive innovation, research, and university partnerships.
• Explain how you can accomplish the following using the BDAL, in the description
include how you would:
o Store formal and informal data
o Mine the data for patterns and insights to improve the team’s operations
and strategy
o Present results to stakeholders [30 Marks]
_____________________________________________________________________________
1
C23156717L MSCDA624 Assignment One
Phase 1: Discovery
As FJK Consultancy (Pvt) we will have to identify key roles of the operation. The
roles will define and segregate tasks among key members for a successful analytics
project. For instance a good team for FJK Consultancy will be set as below:
Business user, project sponsor, project manager – whom will understand
and provide key knowledge on main domain areas either fund the project
financially or analytical technicalities reporting to the Chief Technical
Officer (CTO).
Business Intelligence analyst – provides business domain expertise based on
deep understanding of the data.
Data engineer and DataBase Administrator – key functions will be to install
and maintain the performance of database servers and develop processes for
2
C23156717L MSCDA624 Assignment One
optimizing database security and set and maintain database standards and
manage database storage and access.
Data scientist – provides analytic techniques and modeling, and this
involves include extracting data from multiple sources, using machine
learning tools to organize data, process, clean, and validate the data, analyze
the data for information and patterns, develop prediction systems, present
the data in a clear manner, and propose solutions
The data fell into two categories
FJK Consultancy (Pvt) data may fell into two categories which will need to
brainstorm for example; Five years of idea submissions from internal innovation
contests to come up with an agreed content of data to start with and also have
minutes and notes representing innovation and research activity from around the
world for analytical benchmarking at the end of the project.
Their hypotheses can be grouped into two categories namely:
Descriptive analytics of what is happening to spark further creativity,
collaboration, and asset generation
Predictive analytics to advise executive management of where it should be
investing in the future
3
C23156717L MSCDA624 Assignment One
There are many variables to be considered which will need to prove the validity and
accuracy of the model on the test data. Other variable is the dependability of the model
output to domain experts as we ll as well the context of the parameter values of the
4
C23156717L MSCDA624 Assignment One
domain. Data model building also measure accuracy and if additional data inputs are
required, to support the model in runtime environment in case of any change on any
variable
Model building also benchmarks its operations in the event of change of the model used if
it can alter the results as shown by tables. It defines and monitors behavioral characteristics
of each data input variable as shown by the tables up and below, in case of a change in any
variable.
Social graph of top innovation influencers
5
C23156717L MSCDA624 Assignment One
Phase 6: Operationalize
This enables FJK Consultancy analysis plant or project team to consume the outcomes with
the process context and relevant recommendations. It converts abstracts of concepts into
measurable and observable conclusion. Major findings are that in requires more data in
future as some of the data maybe very sensitive. Another parallel test run initiative ie
required to enhance Business Intelligence activities, and to develop a mechanism to
constantly and continuously evaluate the model after its commissioning.
On the FJK Consultancy analytical project, different tools and techniques were used to
define the where the processing was hosted and it includes distributed cloud servers e.g.
Amazon EC2, as data could be stored on distributed storages to enable parallel process of
data across multiple nodes.
The programming model could be by the use of processing e.g. MapReduce for the
purpose of analytical provisions as the data is huge and of complex nature. • This
programming paradigm enables massive scalability across hundreds or thousands of
servers in a Hadoop cluster.
References;
- https://www.researchgate.net/figure/1-Big-Data-Analytics-Lifecycle-by-EMC-
Education_fig31_334588183
- Reeve, A. (2013) Managing Data in Motion: Data Integration Best Practice, Techniques
and Technologies, M. Kaufmann
- Map Reduce Video — Google Chrome . YouTube. 2018 [cited 2023Sep.26]. https://
https://www.youtube.com/watch?v=EmHc9hV5Xi8