International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 07 Issue: 03 | Mar 2020
p-ISSN: 2395-0072
www.irjet.net
Crypto-Asset Trading Analysis
Harsh Wasan1, Rajdeep Khandelwal2, Hemita Haldankar3, Prof. Swati Sharma4
1,2,3Student,
IT Department, Vidyalankar Institute of Technology, Mumbai, India
4Professor, IT Department, Vidyalankar Institute of Technology, Mumbai, India
----------------------------------------------------------------------------------***--------------------------------------------------------------------------Abstract: Initial coin offerings (ICOs) are a new method of raising investment in early stage private ventures (Hall and Lerner
capital for early stage ventures, an alternative to more traditional
2010).
sources of start-up funding. Similar to the stock market the Crypto
currency market has experienced a growth in various investing
options for the investors. On the block chain one can trade crypto
currencies with actual real-life assets and one can also invest in
various fundraisers of startups for ICO’s which can be traded for
monetary value in the startup’s environment. Investing in ICOs can
be very profitable. Picking good token for investment requires
careful research. We are building a software which helps to predict
the prices of ICO’s so that the investors can decide whether to invest
in ICO or not. Since this is an emerging field there are no major
applications or websites which provide consulting services for the
crypto assets. We can access past data and use AI to forecast and
predict future trends and advise investors accordingly
Keywords: Crypto-currency, ICO, AI and ML, Block chain,
Funds.
1. Introduction
An initial coin offering (ICO) or initial currency offering may
be a sort of funding using crypto currencies. Mostly the
method is completed by crowd funding but private ICOs are
getting more common. In ICO, a quantity of crypto currency is
sold in the form of "tokens" ("coins") to speculators or
investors, in exchange for legal tender or other crypto
currencies such as Bitcoin or Ethereum. The tokens sold are
promoted as future functional units of currency if or when
the ICO's funding goal is met and therefore the project
launches. In ICO, a block chain-based issuer sells
cryptographically secured digital assets, usually called
tokens. Explosive growth in the ICO market has attracted
interest from entrepreneurs, investors, and regulators.
According to one estimate, between January 2014 and
December 2018 ICOs raised over $28 billion, and at least 15
individual ICOs to date have taken in more than $100 million.
At the same time, the market has become notorious for
scams, jokes, and frauds. This paper asks which venture and
ICO process characteristics predict real and financial success
for ICO issuers, focusing on whether the market exhibits
dynamics consistent with existing theoretical literature about
entrepreneurial finance and, more recently, about ICOs. ICOs
can provide more security, liquidity and transparency than
conventional financing instruments These features
potentially mitigate costs of asymmetric information and
agency problems that have long deterred arms-length retail
© 2020, IRJET
|
Impact Factor value: 7.34
|
There are three types of digital assets.
1. The first is a general-purpose medium of exchange and store
of value crypto currency, such as Bit coin; these are often
termed coins.
2. The second is a security token, which represents a
conventional security that is recorded and exchanged on a
block chain to reduce transaction costs and create a record of
ownership.
3. The third is a utility token, which gives its holder
consumptive rights to access a product or service.
Utility tokens comprise the largest and most well-regarded
ICOs and are the primary focus of who want to invest in
crypto currency. For example, Ether (the token of the
Ethereum block chain) is a utility token, but its widespread
circulation has led it to become also a store of value. Utility
token ICOs somewhat resemble crowd funding pre-sales of
products on platforms such as Kick starter. Perhaps a closer
analogy is selling tradable ownership rights to stadium seats
before a sports or entertainment venue is built, a practice
that goes back to the 19th century. While utility tokens can
be simple “corporate coupons” that give the holder the right
to an issuer’s product or service, the most well-known ICOs
employ them as the means of payment in a new marketplace.
In this case, we can extend the analogy to suppose that the
unbuilt stadium’s games were to be played (or at least
watched) by people in the grandstands.
Proponents argue that block chains with native tokens
permit disintermediation of Internet marketplaces such as
Uber or Facebook. In these traditional models, the developing
firm’s control over the platform enables it to extract a large
share of the platform’s surplus, and this control also raises
concerns over the developer’s use of transacting party data.
The token’s value is often expected to increase with the value
of the decentralized network. This correspondence enables
three features, though not every ICO makes use of all three.
First, the token can reward the network creators without
giving them control after the network has launched.
ISO 9001:2008 Certified Journal
|
Page 4587
International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 07 Issue: 03 | Mar 2020
p-ISSN: 2395-0072
www.irjet.net
Second, token buyers can fund the platform’s development,
speculating on the long-term value of the service it will
provide in the future.
using ml and then the forecasted costs are shown to the
investors to act as an aid.
Third, like concert tickets, food stamps, or stock certificates,
the token’s value is tied to access to a future good or service,
creating customer commitment.
2. Literature Survey
In a study a sample of 100 geographically dispersed ICOs, for
which they collected data on a wide range of characteristics,
such as whether the token has utility value, previous VC
financing, and founder professional backgrounds. The
practice of raising capital from prospective customers by
selling ownership rights for future seats in an not built arena
dates back at least to Royal Albert Hall in London in the
1860s. Others trace the practice to the time of the
Reformation or even earlier, when European church
construction began to be financed by the advance sales of
pews that were owned in perpetuity by their sponsors and
could be re-sold for profit. For utility tokens to have value,
the issuer must commit to a cap on the total supply, and this
is easily done in a sports arena or church where adding new
seats are physically difficult. Smart contracts can impose
these limits for ICO tokens. They analyzed that ICO possess
characteristics for a subsample of some successful offerings
that subsequently traded on secondary market exchanges for
at least 10 days. Following From the perspective of an early
stage investor, liquidity represents a central benefit of ICOs
relative to conventional financing instruments. Liquidity also
reflects market depth and interest in a token. Sockin and
Xiong (2018) show that token trading enables information
aggregation from potential customers about demand for a
platform’s service, and they conclude that an individual’s
decision to join a token-based platform depends positively on
the token’s trading volume. However, the liquidity of ICOs
may have a dark side if issuers’ ability to cash out quickly
undercuts their incentives to build successful businesses, or
if investors do not have incentives to monitor intensively.
Consistent with the previous analysis of real outcome
predictors, they found that liquidity and trading volume are
higher for tokens that offer voluntary disclosure in a white
paper, credibly commit to the project through insider vesting
restrictions, and signal quality via prior VC investment and
past entrepreneurial success of the CEO. For example,
success is associated with token sales that use dynamic
pricing mechanisms, that promote transparency and crowd
source development by publicly posting source code on
Github, and that have large Telegram user groups. In
contrast, asset management and other crypto financial
services are if anything negatively associated with success.
These results shed light on where the market has perceived
opportunities for value creation. So using some of the afore
mentioned factors we analyze the current live ico’s to predict
their costs in order to advice the investors whether they
should invest in a particular ico or not. We create a web- app
which acquires data from an api and then the data is analyzed
© 2020, IRJET
|
Impact Factor value: 7.34
|
Factors such as-target proceeds, fraction of total token
supply sold, pricing mechanism, distribution method, lockups and set-asides, token rights, and selection of secondary
market exchange [1] affect the cost of the ICO’s. A significant
predictor of survival and employment is whether a token has
apparent utility value, which has strong relevance for current
policy debates over whether ICO tokens are investment
securities in disguise, or whether they represent an
innovation that enables a new venture to raise funds in a way
that promotes future product adoption and loyalty, while also
offering liquidity. Additional factors associated with ICO
success reflect longstanding theories in corporate finance
about the importance of reducing information asymmetry
and the use of bonding and certification strategies to reduce
agency costs. First, they examined factors that predict listing
these largely parallel those that predict survival and
employment. Second, noted that listing is itself an interesting
characteristic, with a strong connection to token liquidity,
they instrument for listing to assess its effect on employment.
Specifically, they used price changes in the Ethereum Classic
(ETC) token around the time of an ICO, focusing on periods
when Bit coin prices are high
They also assessed which characteristics exhibit significant
associations with secondary market liquidity and trading
volume.
Piotr Płoński in his medium post displays how to use ML to
predict the cost of the ICO’s he says that the following factors
[2] affect the ICO’s cost.
ICO Price
Number of ICO tokens (number of tokens available
for sale in ICO)
Total Supply Ratio (number of tokens in ICO divided
by total token number)
ICO Market Cap
Information if a prototype is available
Number of points for team scored by Ian(analyst)
Number of points for advisors scored by Ian(analyst)
Number of points for ICO idea
The output of the model will be Max_CMC_x which is
defined as:
Max_CMC_x = Max_CMC / ICO_Price
From a banker’s perspective, the problem with crypto
currencies [3] is not having insight into cashflows, which is
needed if one wants to both take deposits and lend - cashflow
performance equals credit worthiness. In this sense, banks
ISO 9001:2008 Certified Journal
|
Page 4588
International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 07 Issue: 03 | Mar 2020
p-ISSN: 2395-0072
www.irjet.net
have a competitive advantage over crowdfunding and other
decentralised innovations, including Bitcoin (Dimon, 2016)
The first step in collecting data about each project is to collect
information from the most used Internet sources as icobench,
TokenData or similar. In this step we look for general
characteristics such as the name, the token symbol, start and
end dates of the crowdfunding [4], the country of origin,
financial data such as the total number of issued token, the
initial price of the token, the platform used, data on the team
proposing the ICO, data on the advisory board, data on the
availability of the website, availability of white paper and
social channels. Some of these data, such as short and long
description, and milestones are textual descriptions. Others
are categorical variables, such as the country, the platform,
the category (which can assume many values), and variables
related to the team members (name, role, group). The
remaining variables are numeric, with different degrees of
discretization. Unfortunately, not all ICOs record all variables,
so there are several missing data. The ICO web databases that
we use are fully checked in order to minimize the missing
values of one of the platforms, therefore we validate the
information checking for the details on the website and on
the white paper. As a result, the complete set of reliable
information comes from the matching between the website
and the white paper
The independent variable for this study is the closing price of
Bitcoin in US Dollars taken from the [5] Coindesk Bitcoin
Price Index. Rather than focusing on one specific exchange
this price index takes the average prices from five major
Bitcoin exchanges; Bitstamp, Bitfinex, Coinbase, OkCoin and
itBit. If one were to implement trades based on the signals it
would be beneficial to focus on one exchange.
The standard approach for asset value predictions is based
on market analysis with an LSTM(Long short-term memory)
[6] neural network. Block chain technologies, however, give
us access to vast amounts of public data, like the executed
transactions and therefore the account balance distribution.
We explore whether analysing this data with modern Deep
Leaning techniques leads to higher accuracies than the
quality approach
Initial ICOs [7] showed that one among the most important
problem of this fundraising method was the danger of fraud
and absence of trust to founding teams because the whole
process was administered entirely online. Quickly, some
individuals offered escrow services by acting as a trusted
third party that collected, held, disbursed funds according to
predetermined rules, usually by reaching some milestones
announced by the ICO teams.
© 2020, IRJET
|
Impact Factor value: 7.34
|
Linear Regression is a machine learning algorithm based on
supervised learning [8]. It performs a regression task.
Regression models a target prediction value based on
independent variables. It is mostly used for finding out the
relationship between variables and forecasting. Different
regression models differ based on – the kind of relationship
between dependent and independent variables, they are
considering and the number of independent variables being
used.
Random forests or random decision forests are an ensemble
learning method for classification, regression [9] and other
tasks that operates by constructing a multitude of decision
trees at training time and outputting the class that is the
mode of the classes (classification) or mean prediction
(regression) of the individual trees. Random decision forests
correct for decision trees' habit of over fitting to their
training set
The following are the advantages of Random Forest
algorithm over most of the other algorithms
It overcomes the problem of over fitting by
averaging or combining the results of different
decision trees.
Random forests work well for a large range of data
items than a single decision tree does.
Random forest has less variance then single decision
tree.
Random forests are very flexible and possess very
high accuracy.
Scaling of data does not require in random forest
algorithm. It maintains good accuracy even after
providing data without scaling.
A Random Forest algorithm maintains good accuracy
even a large proportion of the data is missing.
The K-fold Cross Validation (KCV) technique is one of the
most used approaches by practitioners for model selection
and error estimation of classifiers. The KCV consists in
splitting a dataset into k subsets; then, iteratively, some of
them are used to learn the model, while the others are
exploited to assess its performance. However, in spite of the
KCV [10] success, only practical rule-of-thumb methods exist
to choose the number and the cardinality of the subsets. We
propose here an approach, which allows to tune the number
of the subsets of the KCV in a data–dependent way, so to
obtain a reliable, tight and rigorous estimation of the
probability of misclassification of the chosen model.
3. Proposed System
Block Diagram
ISO 9001:2008 Certified Journal
|
Page 4589
International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 07 Issue: 03 | Mar 2020
p-ISSN: 2395-0072
www.irjet.net
4. Hardware and Software Requirements
Software requirements:
Node.js
.Net(if used as an web service)
SQL/mongo dB database
Hardware requirements:
Laptop ( to access the web-app)
5. Methodology
Step 1:
Creating Input data:
We are using past data of the ICOs. Data is fetched by
using
API:https://api.nomics.com/v1/currencies/ticker
The dataset comprises of ICO_Name, Ian_ICO_Grade,
ICO_price, Number_of_ ICO_Tokens,ICO_Market_Cap,
Total_Supply_Ratio, Max_CMC_x indicators.
The ICO price is the most important feature for
predicting ICO returns.
Any Analyst’s grade is the second most important
feature, the analyst’s grade is correlated with the
project’s quality. The higher the grade is, the better
project is, and the higher chance for the price
growth.
There are also number of tokens sold in ICO, ICO
market cap, and total supply it is similar situation to
that of the ICO’s price.
Data files are stored in .csv format
Fig. 3.1 Overall system block diagram
Fig. 3.2 – Data received from api
Step 2:
In order to mitigate the risk involved in investing in ICO’s we
will use machine learning on the data collected by us by call
an API which returns data about the current ICO’s and stored
in the database. Once the model is trained we apply the
trained model to the real time data to predict the variance in
the cost of the individual ICO’s. In order to mitigate the risk
involved in investing in ICO’s we will use machine learning
on the data collected by us by call an API which returns data
about the current ICO’s and stored in the database. Once the
model is trained we apply the trained model to the real time
data to predict the variance in the cost of the individual ICO’s.
In this way the investors can easily see the real time data of
the ICO and also see what the ML model predicts about its
future cost. We also display the data in a graphical format
which makes it easier for the investors to decide for
themselves whether to invest in the ICO or not.
In this way the investors can easily see the real time data of
the ICO and also see what the ML model predicts about its
future cost. which makes it easier for the investors to decide
for themselves whether to invest in the ICO or not.
© 2020, IRJET
|
Impact Factor value: 7.34
|
Training the Data
Next step is to train the gathered data set by using following
machine learning algorithms
1. Regression (Random decision forest)
2. K-fold cross validation.
Random forest
Random forest is a supervised learning algorithm which is
used for both classification as well as regression. But
however, it is mainly used for classification problems [9]. As
we know that a forest is made up of trees and more trees
means more robust forest. Similarly, random forest algorithm
creates decision trees on data samples and then gets the
prediction from each of them and finally selects the best
solution by means of voting. It is an ensemble method which
is better than a single decision tree because it reduces the
over-fitting by averaging the result.
ISO 9001:2008 Certified Journal
|
Page 4590
International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 07 Issue: 03 | Mar 2020
p-ISSN: 2395-0072
www.irjet.net
After training the models and finding the model which has
the most accuracy we will then forecast the cost of the
individual ICO and display it to the user
Checking whether the cost is going to rise in the future or not
one can make a simple statement that whether one must
invest in the ICO or not.
Step 3:
Creating User Interface:
We are using HTML, CSS, BOOTSTRAP, JavaScript and PHP
for designing front-end of website.
Fig 5.1
Working of Random Forest Algorithm
Step 1 − First, start with the selection of random
samples from a given dataset.
Step 2 − Next, this algorithm will construct a
decision tree for every sample. Then it will get the
prediction result from every decision tree.
Step 3 − In this step, voting will be performed for
every predicted result.
Step 4 − At last, select the most voted prediction
result as the final prediction result.
Through this user can interact with software and can see the
required predicted results.
Our front-end interfaces are going to be user-friendly for
both new and professional traders. Typical functions will be
presented with intuitive visual feedback that is common to
other prevailing platforms. Our platform is modular,
lightweight and extendable. This ensures that we offer topnotch services for all our users while retaining our usability
and productivity.
6. Result
k-Fold Cross-Validation
Cross-validation is a resampling procedure used to evaluate
machine learning models on a limited data sample.
The procedure has a single parameter called k that refers to
the number of groups that a given data sample is to be split
into. As such, the procedure is often called k-fold crossvalidation. When a specific value for k is chosen, it may be
used in place of k in the reference to the model, such as k=10
becoming 10-fold cross-validation.
Fig. 6.1 User Interface
Cross-validation is primarily used in applied machine
learning to estimate the skill of a machine learning model on
unseen data. That is, to use a limited sample in order to
estimate how the model is expected to perform in general
when used to make predictions on data not used during the
training of the model.
The purpose of cross validation is not to help select a
particular instance of the classifier (or decision tree, or
whatever automatic learning application) but rather to
qualify the model, i.e. to provide metrics such as the average
error ratio, the deviation relative to this average etc. which
can be useful in asserting the level of precision one can
expect from the application. One of the things cross
validation can help assert is whether the training data is big
enough.
© 2020, IRJET
|
Impact Factor value: 7.34
|
Fig. 6.2 List of ICO
ISO 9001:2008 Certified Journal
|
Page 4591
International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 07 Issue: 03 | Mar 2020
p-ISSN: 2395-0072
www.irjet.net
References
[1] Sabrina T. Howell, Marina Niessner ,David Yermack
“INITIAL COIN OFFERINGS: FINANCING GROWTH WITH
CRYPTOCURRENCY TOKEN SALES”
https://www.nber.org/papers/w24774.pdf
[2]
https://medium.com/@MLJARofficial/predicting-icoreturns-with-machine-learning-af6108ab9e39
[3]
Initial Coin Offering (ICOs) Risk, Value and Cost in Blockchain
Trustless crypto-markets Percy Venegas
https://scihub.ren/https://papers.ssrn.com/sol3/papers.cfm?abstract_i
d=3012238
[4]
http://economiaweb.unipv.it/wpcontent/uploads/2018/01/DEMWP0164.pdf
Paola Cerchiello (Università di Pavia) Anca Mirela Toma
(Università di Pavia), July 2018
[5]https://arxiv.org/pdf/1810.06696.pdf
Besarabov, Todor Kolev, 15 Oct 2018
Zvezdin
[6] http://trap.ncirl.ie/2496/McNally, Sean, (2016) .
[7]https://sciHub.ren/https://jai.iijournals.com/content/21
/4/13.abstract Dmitri Boreiko, Gioia Vidusso, October 2018
[8]https://www.geeksforgeeks.org/ml-linear-regression/
[9]https://www.tutorialspoint.com/machine_learning_with_
python/machine_learning_with_python_classification_algorit
hms_random_forest.htm
[10] “The ‘K’ in K-fold Cross Validation”
Davide Anguita, Luca Ghelardoni, Alessandro Ghio, Luca
Oneto and Sandro Ridella 2012
https://ww.elen.ucl.ac.be/Proceedings/esann/esannpdf/es2
012-62.pdf
[11]
https://towardsdatascience.com/icoomen-using-machinelearning-to-predict-ico-prices-29fa4cec6d86
© 2020, IRJET
|
Impact Factor value: 7.34
|
ISO 9001:2008 Certified Journal
|
Page 4592