© 2018 IJSRST | Volume 4 | Issue 2 | Print ISSN: 2395-6011 | Online ISSN: 2395-602X
Themed Section: Science and Technology
A Review on Big Data Application in Health Care
1
Sridhar Gujjeti1, Dr. Suresh Pabboju2
Assistant Professor, Department of Computer Science and Engineering, Kakatiya Institute of Technology and
Science, Warangal, Telangana, India
2
Professor, Department of Information Technology, Chaitanya Bharathi Institute of Technology(CBIT),
Hyderabad, Telangana, India
ABSTRACT
Big data technologies are progressively utilized for biomedical and health-care informatics research. A lot of
biological and clinical data have been created and gathered at a phenomenal speed and scale. For instance, the
new age of sequencing technologies empowers the handling of billions of DNA sequence data every day, and the
application of electronic health records (EHRs) is archiving a lot of patient data. The cost of getting and breaking
down biomedical data is required to diminish drastically with the assistance of innovation redesigns, for example,
the rise of new sequencing machines, the advancement of novel equipment and programming for parallel
computing, and the broad extension of EHRs. Big data applications introduce new chances to find new
information and make novel strategies to enhance the nature of health care. The application of big data in health
care is a quickly developing field, with numerous new disclosures and philosophies distributed over the most
recent five years. In this paper, we review and talk about big data application in four noteworthy biomedical sub
disciplines: (1) bioinformatics, (2) clinical informatics, (3) imaging informatics, and (4) general health informatics.
In particular, in bioinformatics, high-throughput tests encourage the research of new expansive affiliation
investigations of diseases, and with clinical informatics, the clinical field benefits from the immense measure of
gathered patient data for settling on smart choices. Imaging informatics is presently more quickly incorporated
with cloud stages to share medical image data and work processes, and general health informatics use big data
methods for foreseeing and observing infectious disease flare-ups, for example, Ebola. In this paper, we review the
current advance and achievements of big data applications in these health-care domains and condense the
difficulties, holes, and chances to enhance and progress big data applications in health care.
Keywords: Big Data, Literature Review, Health Care, Data-Driven Application
I. INTRODUCTION
the Swiss-Prot database. ProteomicsDB has a data
volume of 5.17 TB. In the clinical domain, the
In the biomedical informatics domain, big data is
advancement of the HITECH Act9 has about tripled the
another worldview and a biological system that changes
reception rate of electronic health records (EHRs) in
case-based investigations to huge scale, data-driven doctor's facilities to 44% from 2009 to 2012. Data from
research. It is generally acknowledged that the qualities a great many patients have just been gathered and put
of big data are characterized by three noteworthy
away in an electronic arrangement, and these collected
highlights, ordinarily known as the 3Vs: volume,
data
assortment, and speed. To start with and most
administrations
essentially,
developing
opportunities.10,11 what's more, medical imaging (eg,
exponentially in the biomedical informatics fields. For
MRI, CT filters) produces tremendous measures of data
with considerably more perplexing highlights and more
the
volume
of
data
is
instance, the ProteomicsDB8 covers 92% (18,097 of
could
possibly
improve
and
increment
health-care
research
19,629) of known human qualities that are explained in
IJSRST1841286 | Received : 11 Feb 2018 | Accepted : 21 Feb 2018 | January-February-2018 [ (4) 2 : 1231-1238 ]
1231
extensive measurements. One such case is the Visible
"biomedical." Then, we chose papers in light of the
Human Project, which has documented 39 GB of
accompanying consideration criteria:
female datasets.12 These efficient devices for finding
1. The paper was composed in English and
new examples among populace bunches utilizing web-
distributed inside the previous five years (2000–
based social networking data.
2015).
2. The paper talked about the outline and
II. BIG DATA TECHNOLOGIES
utilization of a big data application in the
Biomedical scientists are confronting new difficulties of
biomedical and health-care domains.
3. The paper detailed another pipeline or strategy
putting away, overseeing, and investigating monstrous
for handling big data and talked about the
measures of datasets. The attributes of big data require
execution of the technique.
capable and novel technologies to extricate helpful data
and
empower
more
wide
based
health-care
4. The paper assessed the execution of new or
existing big data applications.
arrangements. In the greater part of the cases revealed,
The accompanying avoidance criteria were utilized to
we found various technologies that were utilized
sift through immaterial papers:
together, for example, artificial intelligence (AI),
1. The paper did not examine a particular big data
alongside Hadoop®, and data mining devices. Parallel
applications (eg, general remarks about big data).
computing is one of the central foundations for
2. The paper was an instructional exercise or a
overseeing big data errands. It is equipped for executing
calculation undertakings at the same time on a group of
course material.
3. The paper was not in the four concentration
machines or supercomputers. As of late, novel parallel
territories: bioinformatics, clinical informatics,
computing models, for example, MapReduce by Google,
general
have been proposed for another big data framework.
informatics.
health
informatics,
and
imaging
All the more as of late, an open-source MapReduce
package called Hadoop was released by Apache for Two hunts were performed. In the principal look, the
distributed
data management. The
Hadoop primary author (JL) and the second author (MW) of
Distributed File System (HDFS) supports concurrent the present investigation started the inquiry procedure
data access to clustered machines. As such, cloud in view of the principle watchwords. All conceivably
computing is a novel model for sharing configurable
related papers were gathered by reviewing the title
computational resources over the network and can
serve as an infrastructure, platform, and/or software
and unique. This underlying inquiry brought about
755 papers from 2000 to 2015. In the second pursuit,
for providing an integrated solution. Many new big
the second author (MW) and the third author (DG)
data applications are based on cloud technologies.
screened the papers in light of the previously
mentioned consideration and prohibition criteria and
III. RESEARCH METHODS
hence chose 94 candidate papers. At long last, each
author of the present examination assessed the last
We sought four bibliographic databases to discover
determination by perusing the substance of the papers,
related research articles: (1) PubMed, (2)
and agreement was come to review 68 papers for this
ScienceDirect, (3) Springer, and (4) Scopus. In looking
investigation.
through these databases, we utilized the fundamental
catchphrases "big data," "health care," and
International Journal of Scientific Research in Science and Technology (www.ijsrst.com)
1232
IV. BIG DATA APPLICATIONS
have grown new instruments in view of CloudBurst to
help biomedical research, for example, Contrail for
Bioinformatics applications. Bioinformatics research collecting vast genomes and Crossbow for
dissects biological framework variations at the distinguishing single nucleotide polymorphisms (SNPs)
molecular level. With current patterns in customized
from sequencing data.
solution, there is an expanding need to create, store,
and break down these huge datasets in a reasonable
Clinical informatics applications. Clinical informatics
time period. Next-generation sequencing innovation centers around the application of information
empowers genomic data obtaining in a brief timeframe. innovation in the health-care domain. It incorporates
The part of big data methods in bioinformatics movement based research, analysis of relationship
applications is to give data vaults, computing
between patient principle determination (MD) and
foundation,
hidden reason for death (UCD), and storage of data
and
productive
data
manipulation
EHRs
and
different
sources
(eg,
apparatuses for investigators to gather and break down from
biological information. Taylor examines that Hadoop electrophysiological [such as EEG] data). In this area,
and MapReduce are presently utilized broadly inside
we ordered big data technologies/devices into four
the biomedical field.
categories:
(1)
data
storage
and
retrieval,
(2)
interactive data retrieval for data sharing, (3) data
Big data technologies are divided into four categories :
security, and (4) data analysis. Contrasted and
(1) data storage and retrieval, (2) error identification,
(3) data analysis, and (4) platform integration
bioinformatics, clinical informatics does not offer
numerous instruments for error identification but
deployment. These categories are correlated and may
rather gives careful consideration to data-sharing and
cover; for example, most data input applications may
data security issues. Its data analysis strategy is
bolster straightforward data analysis, or the other way
altogether different from bioinformatics, as clinical
around. Be that as it may, our classification in the informatics works with both organized and
present examination is construct just with respect to unstructured data, creates particular ontologies, and
the fundamental elements of every innovation.
utilizations natural dialect preparing widely.
Data storage and retrieval. These days, a sequencing
Data storage and retrieval. It is basic to talk about the
machine can deliver a great many short DNA
manners by which big data systems (eg, Hadoop,
sequencing data amid one run. The sequencing data NoSQL database) are utilized for putting away EHRs.
should be mapped to particular reference genomes The proficient storage of data is particularly
with a specific end goal to be utilized for extra analysis, imperative when working with clinical ongoing
for example, genotype and articulation variation
stream data. Dutta et al evaluated the capability of
analysis. Downpour is a parallel computing model that
utilizing Hadoop and HBase as data distribution
facilitates the genome mapping process. Torrent centers for putting away EEG data and talked about
parallelizes the short-read mapping procedure to their high-performance qualities. Jin et al. broke down
extensive
the capability of utilizing Hadoop HDFS and HBase
sequencing data. The CloudBurst demonstrate was
for appropriated EHRs. Besides, Sahoo et al. and
enhance
the
versatility
of
perusing
evaluated utilizing a 25-center bunch, and the Jayapandian et al. proposed an appropriated structure
outcomes indicate that the speed to process seven for putting away and questioning a lot of EEG data.
million short-peruses was just about 24 times quicker
Their framework, Cloudwave, utilizes Hadoop-based
than a solitary center machine. The CloudBurst group
data preparing modules to store clinical data, and by
International Journal of Scientific Research in Science and Technology (www.ijsrst.com)
1233
utilizing the handling energy of Hadoop, they built up
Community Living, have gathered and broke down a
an electronic interface for ongoing data visualization
lot of population health data. In this segment, no new
and retrieval. The Cloudwave group evaluated a
methodologies are presented. Rather, we show an
dataset of 77-GB EEG flag data and contrasted
integrated perspective of big data and health from a
Cloudwave
the
population point of view rather than a solitary
outcomes demonstrate that Cloudwave prepared five
medical/clinical movement viewpoint. This segment
and
a
stand-alone
framework;
EEG thinks about in 1 minute, while the stand-alone centers around four territories:
framework took over 20 minutes. Contrasted and a
customary relational database that handles organized
(1) infectious disease surveillance, (2) population
health management, (3) mental health management,
data well, the novel NoSQL is a great prospect for
and (4) chronic disease management.
putting away unstructured data. Mazurek proposed a
and
Infectious disease surveillance. Roughage talked about
NoSQL
the open doors for utilizing big data for worldwide
storehouses to empower data mining systems and give
infectious disease surveillance. They built up a
framework
that
multidimensional
joins
both
technologies
relational
with
adaptability and speed in data handling. Nguyen et al. framework that gives continuous risk checking on
exhibited a model framework for putting away clinical
delineate, out that machine learning and group
flag data, where the time arrangement data of clinical
sourcing have opened new conceivable outcomes for
sensors are put away inside HBase in a way that the
building up a persistently updated atlas for disease
line key fills in as the time stamp of a solitary esteem,
and the segment stores patient physiological esteems
observing. Feed et al trusted that online web-based
social networking joined with epidemiological
that relate with the line key time stamp. To enhance
information is a significant new data hotspot for
the availability and read-capacity of the HBase data
facilitating public health surveillance. The utilization
mapping, the metadata are put away in MongoDB,
of web-based social networking for disease observing
which is a report based NoSQL database. Google Web
was demonstrated by Young et al., in which they
Toolkit is incorporated into the framework to picture
gathered 553,186,016 tweets and separated more than
the clinical flag data.
9,800 with HIV risk-related catchphrases (eg, sexual
practices and medication utilize) and geographic
Public health information. As depicted by Short-liffe annotations. They demonstrated that there is a huge
and Cimino, public health has three center capacities:
positive correlation (P , 0.01) between HIV-related
(1)
assessment, (2) policy development, and (3)
assurance. Among these, evaluation is the essential
tweets and HIV cases in light of predominance
analysis, illustrating the significance of online
and fundamental capacity. Evaluation essentially
networking (eg, Twitter, Facebook) and its potential
includes gathering and breaking down data to track
effect on checking worldwide disease event.
and screen public health status, along these lines
giving
proof
to
basic
leadership
and
policy
Population health management. To think about the
validate appropriation and effect of sociodemographic and
whether the ser-indecencies offered by health medico-administrative variables, Lamarche-Vadel et al.
development. Assurance is utilized
to
foundations have accomplished their underlying
broke down the free association of patient MD and
target objectives for expanding public health results;
UCD. The MD was distinguished by ICD10 code,
all things considered, numerous huge public health
while the UCD was separated from a death registry. In
organizations, for example, the Centers for Disease
the event that MD and UCD were diverse occasions, at
Control and Prevention and the Administration of
that point those occasions were observed to be
International Journal of Scientific Research in Science and Technology (www.ijsrst.com)
1234
data,
recognizing chances to enhance the essential and
information from 421,460 perished patients was
optional avoidance of cardiovascular occasions in
removed
Ontario's
autonomous.
Utilizing
from
2008
health
to
protection
2009.
The
outcomes
demonstrate that 8.5% of in doctor's facility deaths
assorted
multiethnic
population.
The
examination included data from
and 19.5% of out-of-healing center deaths were free
occasions and that autonomous death was more typical
9.8 million Ontario grown-ups matured $20 years.
in elderly patients. The outcomes demonstrate that
Data
were
gathered
by
connecting
numerous
expansive scale data analysis can be utilized to databases, for example, electronic reviews, health
adequately dissect the association of medical occasions. administration, clinical, laboratory, medicate, and
electronic medical record databases utilizing encoded
Mental
health
management.
Nambisan
et
al.
discovered that messages posted via web-based
individual identifiers. Follow-up clinical occasions
were gathered
V. CONCLUSION
networking media could be utilized to screen for and
conceivably
distinguish
sadness.
Their
analysis
depends on past research of the association between
We are right now in the period of "big data," in which big
depressive issue and monotonous musings/ruminating
data innovation is in effect quickly connected to
conduct. Big data examination devices assume an
biomedical and health care fields. In this review, we
imperative part in their work by mining shrouded demonstrated different cases in which big data innovation
behavioral and passionate patterns in messages, or has assumed a vital part in cutting edge health-care upset,
"tweets," posted on Twitter. Inside these tweets, we as it has totally changed individuals' perspective of healthmight have the capacity to distinguish a disease-
care action. The initial three areas of this review
related feeling pattern, which is a formerly concealed
uncovered that big data applications facilitate three
manifestation. The authors foresee that future imperative clinical activities, while the last segment
research could dive further into the conversations of (particularly the chronic disease management segment)
the discouraged clients to understand more about their
draws an integrated picture of how separate clinical
concealed feelings and notions. What's more, Dabek
activities are finished in a pipeline to oversee singular
and Caban introduced a neural system demonstrate
patients from numerous viewpoints. We outlined late
that can anticipate the probability of creating mental
advance in the most pertinent zones in each field,
conditions, for example, nervousness, behavioral
including big data storage and retrieval, error
scatters, melancholy, and post-traumatic pressure issue.
identification, data security, data sharing and data analysis.
They additionally investigated the adequacy of their
Besides, in this review, we discovered that bioinformatics
model against a dataset of 89,840 patients, and the
is the essential field in which big data examination are as
outcomes demonstrate that they can accomplish a
of now being connected, generally because of the
general exactness of 82.35% for all conditions.
monstrous volume and multifaceted nature of
bioinformatics data. Big data application in bioChronic disease management. Tu et al. presented the
Cardiovascular Health in Ambulatory Care Research
Team (CANHEART), a one of a kind, populationbased observational research initiative went for
estimating and enhancing cardio-vascular health and
the nature of ambulatory cardiovascular care gave in
informatics is relatively mature, with sophisticated
platforms and apparatuses as of now being used to help
investigate
biological
data,
for
example,
quality
sequencing mapping devices. Be that as it may, in other
biomedical
research
fields,
for
example,
clinical
informatics, medical imaging informatics, and public
Ontario, Canada. The research concentrated on
International Journal of Scientific Research in Science and Technology (www.ijsrst.com)
1235
health informatics, there is huge, undiscovered potential In its latest industry analysis report, McKinsey and
for big data applications.
Company anticipated that big data examination for the
medical field will possibly spare more than $300 billion
This literature review likewise demonstrated that: (1) every year in US health-care costs. Future development of
integrating
distinctive
wellsprings
of
information big data applications in the biomedical fields holds
empowers clinicians to delineate another perspective of foreseeable guarantee since it is subject to the headway of
patient care forms that think about a patient's all new data standards, important research and innovation,
encompassing health status, from genome to conduct; (2) cooperation in research foundations and organizations,
the benefit capacity of novel portable health technologies and solid government motivations.
facilitates ongoing data gathering with more exactness; (3)
VI. REFERENCES
the implementation of dispersed platforms empowers data
filing and analysis, which will additionally be produced
for choice help; and (4) the incorporation of land and [1].
Stratton MR, Campbell PJ, Futreal PA. The
environmental information may additionally build the
capacity to translate gathered data and concentrate new [2].
cancer genome. Nature. 2009; 458(7239):719-24.
Shendure J, Ji H. Next-generation DNA
learning.
sequencing. Nat Biotechnol. 2008; 26(10):113545.
While big data holds critical guarantee for enhancing [3].
Metzker ML. Sequencing technologies - the next
health care, there are a few basic difficulties confronting
all the four fields in utilizing big data innovation; the most [4].
generation. Nat Rev Genet. 2010;11(1):31-46.
huge issue is the integration of different databases. For
next-generation sequencing on genomics. J
instance, the VHA's database, VISTA, isn't a solitary
Genet Genomics. 2011;38(3):95-109.
framework; it is an arrangement of 128 interlinked [5].
Lynch C. Big data: how do your data grow?
frameworks. This turns out to be much more complicated
Nature. 2008;455(7209):28-9.
when databases contain distinctive data composes (eg, [6].
Wilhelm M, Schlegl J, Hahne H, et al. Mass-
integrating an imaging database or a laboratory test comes
spectrometry-based
about database into existing frameworks), subsequently
proteome. Nature. 2014;509(7502):582-7.
constraining a framework's capacity to make questions [7].
Blumenthal D, Tavenner M. The "meaningful
against all databases to gain every single patient datum.
use" regulation for electronic health records. N
The absence of standardization for laboratory protocols
and qualities additionally creates challenges for data [8].
Engl J Med. 2010;363(6):501-4.
integration. For instance, image data can experience the ill
Comparison of Anonymous Techniques for
effects of innovative batch impacts when they originate
Privacy Preserving in Big Data" in "International
from various laboratories under various proto-cols. Efforts
Journal of Advanced Research in Computer and
are made to standardize data when there is a batch impact;
Communication Engineering", Vol 6, Issue 12,
this might be less demanding for image data, yet it is
December
inherently more hard to standardize laboratory test data.
10.17148/IJARCCE.2017.61212
Security and protection concerns additionally stay as
2278-1021, ISSN(print) : 2319-5940
obstacles to big data integration and utilization in all the [9].
Shoban Babu Sriramoju, " Review on Big Data
four fields, and in this way, secure platforms with better
and Mining Algorithm" in "International Journal
communication standards and protocols are greatly
for
required.
Engineering Technology", Volume-5, Issue-XI,
Zhang J, Chiodini R, Badr A, et al. The impact of
Shoban
Babu
Research
International Journal of Scientific Research in Science and Technology (www.ijsrst.com)
draft
of
Sriramoju,
the
human
"Analysis
2017,
in
Applied
and
DOI
ISSN(online) :
Science
1236
and
November 2017, 1238-1243 ISSN : 2321-9653],
www.ijraset.com
data processing on large clusters. Commun
[10]. Ackerman MJ. The Visible Human Project: a
resource
for
education.
Acad
Med.
1999;74(6):667-70.
ACM. 2008;51(1):107-13.
[21]. Armbrust M, Fox A, Griffith R, et al. A view of
cloud
[11]. Feldman R, Sanger J., Advanced Approaches in
Analyzing
[20]. Dean J, Ghemawat S. MapReduce: simplified
Unstructured
Data.
Cambridge:
Cambridge University Press; 2007.
[12]. Xu H, Rosenbloom ST, Denny JC, Data from
clinical notes: a perspective on the tension
computing.
Commun
ACM.
2010;53(4):50-8.
[22]. Schuster
SC.
Next-generation
transforms
today’s
2007;200(8):16-8.
sequencing
biology.
Nature.
[23]. Morozova O, Marra MA. Applications of next-
between structure and flexible documentation. J
generation
Am Med Inform Assoc. 2011;18(2):181-6.
functional genomics. Genomics. 2008;92(5):255-
[13]. Weng C, Wu X, Luo Z, An approach for
eligibility criteria extraction and representation.
sequencing
technologies
in
64.
[24]. Taylor
R.
An
overview
of
the
J Am Med Inform Assoc. 2011;18:i116-24.
Hadoop/MapReduce/HBase framework and its
[14]. Hanna M, McKenna A, Banks E. The genome
cur- rent applications in bioinformatics. BMC
analysis toolkit: a MapReduce framework for
Bioinformatics. 2010;11(suppl 12):S1.
analyzing next-generation DNA sequencing
[25]. Schatz MC. CloudBurst: highly sensitive read
data. Genome Res. 2010;20(9):1297-303.
[15]. Chou W-YS, Hunt YM, Beckjord EB, et al.
mapping
with
MapReduce.Bioinformatics.
2009;25(11):1363-9.
Social
media
use
in
the
United
States:
[26]. Schatz M, Sommer D, Kelley D, et al. Contrail:
implications for health communication. J Med
assembly
Internet Res. 2009;11(4):e48.
computing. In: CSHL Biology of Genomes
[16]. Carneiro HA, Mylonakis E. Google trends: a
web-based tool for real-time surveillance of
disease
outbreaks.
Clin
Infect
Dis.
2009;49(10):1557-64.
[17]. Margolis R, Derr L, Dunn M, et al. The National
Institutes of Health’s Big Data to Knowledge
(BD2 K) initiative: capitalizing on biomedical
big data. J Am Med Inform Assoc.
2014;21(6):957-8.
[18]. White T. Hadoop: The Definitive Guide.
of
large
genomes
using
cloud
Conference, Cold Spring Harbor, New York;
CSHL. 2010.
[27]. Gurtowski
J,
Schatz
MC,
Langmead
B.
Genotyping in the cloud with crossbow. Curr
Protoc
Bioinformatics.
2012;Chapter
15:Unit15.3.
[28]. Pandey RV, Schlötterer C. DistMap: a toolkit for
distributed short read mapping on a Hadoop
cluster. PLoS One. 2013;8(8):e72614.
[29]. George
L.
HBase:
The
Definitive
Guide.
Sebastopol, CA: O’Reilly Media, Inc.; 2012.
Sebastopol, CA: O’Reilly Media, Inc.; 2011.
[19]. Shoban Babu Sriramoju, "Mining Big Sources
[30]. Huang W, Li L, Myers JR, et al. ART: a next-
Using Efficient Data Mining Algorithms" in
generation
sequencing
read
"International Journal of Innovative Research in
Bioinformatics. 2012;28(4):593-4.
simu-
lator.
Computer and Communication Engineering"
[31]. Shoban Babu Sriramoju, "OPPORTUNITIES
Vol 2, Issue 1,January 2014 ISSN(online) : 2320-
AND SECURITY IMPLICATIONS OF BIG
9801, ISSN(print) : 2320-9798
DATA MINING" in "International Journal of
Research in Science and Engineering", Vol 3,
Issue 6, Nov-Dec 2017 ISSN : 2394-8299.
International Journal of Scientific Research in Science and Technology (www.ijsrst.com)
1237
[32]. Siripuri Kiran, 'Decision Tree Analysis Tool
with the Design Approach of Probability
Density Function towards Uncertain Data
Classification', International Journal of Scientific
Research in Science and Technology(IJSRST),
Print ISSN : 2395-6011, Online ISSN : 2395602X,Volume 4 Issue 2, pp.829-831, JanuaryFebruary
2018.
URL
http://ijsrst.com/IJSRST1841198
[33]. Shoban
Babu
Sriramoju,
Chandran,
"UP-Growth
Knowledge
Discovery
Databases"
in
Madan
:
Kumar
Algorithms
from
for
Transactional
"International
Journal
of
Advanced Research in Computer Science and
Software Engineering", Vol 4, Issue 2, February
2014 ISSN : 2277 128X
[34]. Amitha Supriya. "Implementation of Image
Processing System using Big Data in the Cloud
Environment." International Journal for
Scientific Research and Development 5.10
(2017): 211-217.
[35]. SA Supriya. "A Survey Model of Big Data by
Focusing on the Atmospheric Data Analysis."
International Journal for Scientific Research and
Development 5.10 (2017): 463-466.
[36]. Ajmera Rajesh, Siripuri Kiran, " Anomaly
Detection Using Data Mining Techniques in
Social Networking" in "International Journal for
Research in Applied Science and Engineering
Technology", Volume-6, Issue-II, February
2018,
1268-1272 ISSN : 2321-9653],
www.ijraset.com
[37]. Dr. Shoban Babu Sriramoju, "A Review on
Processing Big Data" in "International Journal of
Innovative
Research
in
Computer
and
Communication Engineering" Vol-2, Issue-1,
January
2014
ISSN(online)
:
2320-9801,
ISSN(print) : 2320-9798
International Journal of Scientific Research in Science and Technology (www.ijsrst.com)
1238