Assignment SI Dr. Javed Iqbal Fall 21 New
Assignment SI Dr. Javed Iqbal Fall 21 New
Assignment SI Dr. Javed Iqbal Fall 21 New
1) Are different continents of the world equally good in sports? One way to answer this
question is to compare the average of the total number of (gold, silver, bronze) summer
Olympic medals won per capita by countries in each continent. You can consider the
record of last three Olympics only. Aggregate the number of total medals for each continent
(Asia, Europe, South America, North America, Africa) (ignore Australia) and investigate
this question using pairwise t –tests and ANOVA. Then consider the cross country analysis
of y = total number of medals in last three Olympics by each country and relate it to some
explanatory variables e.g. per capita GDP, population size etc. of that country. Perform the
relevant analysis and prepare a report. Use your multiple regression model predict the
number of total medals won for this country and compare this prediction with actual medals
of this country. Make 5 such predictions and present the information in a table. Prepare a
report of your analysis. Data can be searched using the relevant Olympics website or
Wikipedia. Select at least 15 countries from each continent.
3) Analysis of text book prices of various subjects on Amazon. Consider the Amazon Text
Book section (https://www.amazon.com/Best-Sellers-Books-Textbooks/). Compare the
average prices of new paperback books for sale across the subjects (e.g. Business and
Finance, Communication and Journalism, Computer Science, Engineering, Medical and
Health Sciences, Social Sciences etc.) using pairwise t tests and ANOVA. Then related the
y = price ($) to relevant characteristics e.g. number of total ratings, number of stars, years
since publication etc. via simple and multiple regression. Use your multiple regression
model predict the text book price for a text book and compare this prediction with actual
price of this book. Make 5 such predictions and present the information in a table. Perform
analysis and write a report of your analysis. Select at least 15 books from each group.
6) How are world university rankings related to student/faculty ratio? This question can be
answered by looking at latest QS TOPUNIVERSITIES
(https://www.topuniversities.com/universities) and other relevant sources. Compare the
average student faculty ratio across countries/regions of the world or across subjects via t-
tests (pairwise) and ANOVA using at least 15 observations. Then for individual
universities relate the y = student/faculty ratio to relevant explanatory variables e.g. total
QS score, international students etc. Use your multiple regression model predict student
faculty ratio for a university and compare this prediction with actual ratio of this university.
Make 5 such predictions and present the information in a table. Perform analysis and write
a report of your analysis.
8) Comparing box office revenues of different types (e.g. action, crime, animated, comedy,
documentary etc.) of Hollywood movies released over last 10 years. You can collect data
on at least 15 movies of each type and compare the average box office revenue across the
movie types via t-tests (pairwise) and ANOVA. Then you can relate the y = box office
revenue of an individual movie with the explanatory variables e.g. number of weeks run,
movie type (coded as dummy variables). The data can be collected from relevant industries
e.g. for the UK the Statistical Year Book for various years can be accessed from
https://www.bfi.org.uk/industry-data-insights. For revenue data. Other relevant data can
also be accessed from relevant Wikipedia articles. Use your multiple regression model
predict box-office revenue for a movie and compare this prediction with actual box-office
revenue of this movie. Make 5 such predictions and present the information in a table.
Perform analysis and write a report of your analysis. Select at least 15 movies from each
type.
9) Consider estimating risk and return and classifying riskiness of stocks listed on the US
stock market. The website https://finance.yahoo.com/lookup/ contains the list of symbols
from where you can select company names (select at least 10 companies). Select the desired
companies and go to historical data. In the ‘Time Period’ select at least 10 years starting
from Jan 2008 to present. In the ‘Frequency’ select monthly and click Apply. Thus your
analysis is based on monthly data of 10 years. Download closing price (5th column) data in
excel format. From prices calculate return as percent change. Also select the data on closing
monthly prices for the market index i.e. S&P500 index (symbols: ^GSPC).
𝑅𝑖𝑡 = 𝛽0 + 𝛽1 𝑅𝑚𝑡 + 𝜖𝑖𝑡 , i stands of ith company and t stands for tth month.
Here 𝛽1 provides estimate of systematic risk of security, where a value greater than 1
indicates an aggressive or risky security and a beta1 of less than 1 indicates a defensive
security or less risky than the market. For all the companies compute descriptive statistics
and present them in a table. Compare the companies with respect to their financial
soundness for investors. For each stock compute beta, R-Sq of regression,
𝑅̅ −𝑅
𝑓 𝑅̅ −𝑅
Sharpe Ratio = 𝑆𝑡.𝐷𝑒𝑣(𝑅) and Tryenor’s Ratio = 𝐵𝑒𝑡𝑎𝑓 , where beta is the systematic risk of
the security as estimated above and 𝑅𝑓 is risk free rate of return e.g. on 30 day treasury
bills. These are measures of risk adjusted returns thus measure investment attractiveness
of stocks. You can work on full sample period and some sub-periods e.g. before 9/11, after
9/11 or pre-post global financial crises of 2007 etc. to investigate what changes are brought
about in the risk profile of companies.
11) Risk and Returns of Pakistani Industries: Collect the historical monthly closing stock price
data of firms belonging to different sectors of Pakistan Stock Exchange. The sector and
constituent firms can be selected as follows:
https://markets.brecorder.com/market-data/information-by-sectors.html
Automobile Assembler: Atlas Honda, Dewan Farooq Motors, Pak Suzuki Motor Company, Indus
Motor Company, Millat Tractors, Hino Pak Motors, Al-Ghazi Tractors, Sazgar
Engineering Works
Cement: Attock Cement, Bestway Cement, D.G. Khan Cement, Dadhabhoy Cement, Dewan
Cement, Fauji Cement Company, Fecto Cement, Lucky Cement
Chemicals: Berger Paints Pakistan, Biafo Industries, ICI Pakistan, Pakistan PVC, Colgate
Palmolive, Engro Polymer and Chemicals, Ittehad Chemicals, Sitara Chemicals
Engineering: Ados Pakistan, Agha Steel Industries, Aisha Steel Mills, Bolan Castings, Hufffaz
Seamless Pipes Industries, Pakistan Engineering Corporation, Quality Steel Works, Ittefaq
Iron Industries
Oil and Gas: Mari Petroleum Company, Oil and Gas Development Company (OGDC), Pakistan
Oilfields, Pakistan Petroleum, Attock Petroleum, Shell Pakistan, Sui Northern, Sui
Southern
Pharmaceuticals: Abbot Laboratories, Citi Pharma, Ferozsons Laboratories, GlaxoSmithKline
Pakistan, The Searl Company, Wyeth Pakistan, Macter International, Sanofi-Anentis
Pakistan
Consider monthly closing price data from Jan 2015 to 2021 (latest available month).
(https://markets.brecorder.com/market-data/company-archives.html). Select terms as monthly
and choose date from Jan 2015 to current month. If complete data for a firm are not available,
you can select another firm from the same sector for which complete data are available. After
collecting price data, compute monthly returns (percentage change in price) for each firm. Then
form (equally weighted) portfolio returns for each sector (average of returns for firms within a
sector). You should get a time series variable of portfolio returns for each of the six sectors.
Also collect data on closing value of KSE-100 Index and compute returns (market returns).
Fixing a sector of your choice (e.g. Chemicals), perform t- tests of equality of mean returns for
sector for each pair of sectors. Then conduct an ANOVA for testing the equality of all sector
mean returns (assuming constant variance of returns for each sector portfolio). Next conduct
a regression analysis on individual firm returns on market returns and assess the riskiness of
each firm (beta risk). You can also use other variables e.g. inflation or exchange rate in the
regression for which you need to collect monthly data on these variables (e.g. from IFS). Write
a report of your analysis.
12) Consider the popular macroeconomic data set ‘Penn World Tables latest version 10’
(http://www.rug.nl/ggdc/productivity/pwt/). Download data in Excel. This database
contains macroeconomic data for about 180 countries from 1950 to 2019. Consider the
variable Human Capital index which measure the quality of human capital based on
education and returns to education. First sort the data with respect to year from largest to
smallest so that you should have data of 2019 for all countries on top. Collect data of human
capital (hc) for all countries for 2019 only. Compare and analyze human capital quality
across groups of countries. e.g. across the continents of world or across development level
i.e. with respect to per capital income. (you can use the following world bank site to classify
countries with respect to per capita income
https://datahelpdesk.worldbank.org/knowledgebase/articles/378833-how-are-the-income-
group-thresholds-determined. Check the worksheet ‘County Analytical History’. Use 2019
data to classify countries Low Income (L), Lower middle income (LM), Upper middle
income (UM), and high income (H). For each group of countries collect the data of human
capital from the PWT for year 2019 and compare the ‘hc’ across group of countries. Next
you can link this variable with other interesting variable i.e. size of government measured
by proportion of government expenditure in GDP (csh_g) to examine the relationship using
simple and multiple regression analysis. Use your multiple regression model predict hc for
a country and compare this prediction with actual hc of this country. Make 5 such
predictions and present the information in a table. Perform analysis and write a report of
your analysis. Write a report of your analysis. Select at least 15 countries from each group.
14) Estimate and compare total time in hours allocated or used in last week by students of IBA
on social media or watching electronic media across gender and across year of study. For
this case you have to design a questionnaire based survey asking students about the relevant
aspects. Input data in excel and analyze using descriptive statistics and regression analysis.
Get the survey form approved from me before conducting survey.
16) Consider comparing and investigating relationships between price and other characteristics
within a brand or across brands of products e.g. between mobile phones. The website
www.mega.pk contains such information. Select product e.g. laptops or mobile, and a
brand e.g. Dell (core i5 or any particular core). Note down prices (PKR) and some
important characteristics which can be related to price e.g. processing speed (GHz), RAM,
hard disk storage, for mobiles (Samsung, Apple, Huawei, Lenovo etc), select x variables
e.g. screen size (inches), battery life (mAh), camera quality (MP). Perform a descriptive
analysis of prices and comparisons across brands with similar characteristics. Within each
brand you can investigate relationship of dependent variable price with related
characteristics e.g. price vs size, battery life, camera quality etc. Investigate which
characteristic is a good predictor of price e.g. using R-Sq of regression and through scatter
plot. Then select any product e.g. a particular mobile from this website and note down its
features (x variables) and using your multiple regression model predict its price and then
compare this prediction with actual price given for the product. Make 5 more such
predictions and present information in a table.
17) The Quality of Government (QoG) Institute conducts surveys on different aspects of
quality of government intuitions e.g. transparency, level of corruption etc. for different
countries. Consider the Expert Survey data provided in the csv file on QoG Country-Level
Survey Data 2015. (https://qog.pol.gu.se/data/datadownloads/qogexpertsurveydata). The
data correspond to different questions on government quality. The questions are available
at (https://www.qogdata.pol.gu.se/data/qog_exp_15.pdf). Consider Q-9a which is related
to percentage of aid amount actually delivered to the needy persons. This percentage is
provided for all the countries and can be considered an estimate of level of transparency in
public sector. Provide a descriptive analysis including histogram, charts, summary
statistics, and box plots across the geographic regions (ht-region). Also conduct regression
analysis relating this y=’level of transparency’ to other variable you think might be related
to this e.g. an x variable in Q2a (to what extent merit and skills are considered for public
sector employment). Write a report of your findings
18) Analysis of financial ratios: Consider the SBP database: Analysis of Financial Statement
for Non-Financial Companies. The latest document (2014-2019) is available at:
(https://www.sbp.org.pk/departments/stats/FSA(Non).pdf). The document contains annual
data for each company and overall industry sector for all the non-financial firms listed on
Pakistan Stock Exchange. (There is a separate document for financial firms). The list of
sectors and constituent firms is on p-450 of the document. Consider any 6 sectors from
these sectors (1) Spinning, Weaving, Finishing, (2) Sugar (3) Chemicals &
Pharmaceuticals (4) Food Products (5) Manufacturing (6) Cement (7) Motor Vehicles,
Trailers & Auto parts (8) Fuel & Energy (9) Information, Comm. & Transport (10) Paper,
Paperboard & Products. Select any 8 firms of your choice from each of your selected
sectors. Consider the data for any latest year e.g. 2019. Consider any profitability ratio e.g.
Returns on Equity or Return on Assets. You should have profitability ratio data on 48 firms.
(6 sectors * 8 firms). Perform a graphical and descriptive comparative analysis of the
selected profitability ratio over the selected sectors. Compare the bivariate average
profitability ratio using t tests and test the average profitability of all the selected sectors
using the ANOVA. Then using individual firms’ data using y = profitability ratio, conduct
a regression analysis using appropriate x variables e.g. sales (F1), EBIT (F6), financial
leverage (P4) etc. Use your multiple regression model to make predictions for a firm that
is not in your selected sample. Predict profitability ratio for this firm and compare this
prediction with actual profitability ratio of this firm. Make 5 such predictions and present
the information in a table. Prepare a report of your analysis.