Pair T-test by Shaheer

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 20

NATIONAL UNIVERSITY OF MODERN

LANGUAGES

SRE Project Documentation


ROLL NO:
FSD-FL-248
Class:
BSSE(3st)
Department:
Software Engineering
Course name:
Software Requirement Engineering
SUBMITTED BY:
M. SHAHEER
SUBMITTED To:
Mr. Muhammad Umar
 Requirements in the LinkedIn Application
 Profile Customization Options
 Profile Visibility Control
 Career Advice Chatbot Powered with AI
 Custom Resume Download
 Video Introductions in the Profile
 Career Quizzes and Challenges
 Job Interview Preparation Tools
 Freelancer Profile Section
 Real-time Profile Feedback
 Portfolio Section in the Profile

 Applying Pair T test on these


Requirements

To apply the paired t-test for the listed requirements individually in


the context of Software Requirements Engineering (SRE), we will
simulate a step-by-step approach for each requirement. Since I do not
have access to real user data or direct results from implementing these
features, I'll outline a hypothetical scenario, explain how data would be
collected, and show what the resulting tables would look like if you
were to analyze these features with the paired t-test.

 Profile Customization Options


Hypothetical Data Collection (Ratings from 10 Users)

User Before Non- After Difference


Customizable Customizable (After -
Profiles) Profiles) Before)
1 6 8 2

2 7 8 1
3 5 7 2

4 4 8 4

5 6 9 3

6 7 9 2

7 6 7 1

8 5 8 3

9 8 9 1

1 7 10 3
0

 Calculations:
1. Mean of Differences (dˉ\bar{d}):
dˉ=Sum of DifferencesNumber of Users=(2+1+2+4+3+2+1+3+1+
3)10=2.2\bar{d} = \frac{\text{Sum of Differences}}{\text{Number
of Users}} = \frac{(2 + 1 + 2 + 4 + 3 + 2 + 1 + 3 + 1 + 3)}{10} =
2.2
Standard Deviation of Differences (sds_d):
Calculate variance:
Variance=∑(di−dˉ)2n−1=(2−2.2)2+(1−2.2)2+…+(3−2.2)29=0.96\
text{Variance} = \frac{\sum (d_i - \bar{d})^2}{n-1} = \frac{(2-
2.2)^2 + (1-2.2)^2 + \ldots + (3-2.2)^2}{9} = 0.96
Standard deviation:
sd=Variance=0.96≈0.98s_d = \sqrt{\text{Variance}} = \sqrt{0.96}
\approx 0.98
2 t-Statistic:
Use the formula:
t=dˉsd/n=2.20.98/10=2.20.31≈7.10t = \frac{\bar{d}}{s_d / \
sqrt{n}} = \frac{2.2}{0.98 / \sqrt{10}} = \frac{2.2}{0.31} \
approx 7.10
3 Degrees of Freedom (df):
df=n−1=10−1=9df = n - 1 = 10 - 1 = 9
4 Critical t-value:
From the t-distribution table, for df=9df = 9 at a 95% confidence
level, the critical t-value is approximately 2.262.

 Conclusion:
Since t=7.10>2.262t = 7.10 > 2.262, the result is statistically
significant. This means that enabling profile customization leads to a
significant improvement in the evaluated metric (e.g., user
satisfaction).

 Profile Visibility Control


Hypothetical Data Collection: User Satisfaction Ratings

Use Before After Difference


r (Default (Custom (After -
Visibility Visibility Before)
Settings) Settings)

1 6 8 2

2 5 7 2

3 4 7 3

4 6 9 3

5 7 9 2

6 6 8 2

7 5 8 3

8 6 9 3

9 7 9 2

10 6 8 2

 Calculations:
1. Mean of Differences (dˉ\bar{d}dˉ):
dˉ=Sum of DifferencesNumber of Users=(2+2+3+3+2+2+3+3+2+
2)10=2.4\bar{d} = \frac{\text{Sum of Differences}}{\text{Number
of Users}} = \frac{(2 + 2 + 3 + 3 + 2 + 2 + 3 + 3 + 2 + 2)}{10} =
2.4dˉ=Number of UsersSum of Differences
=10(2+2+3+3+2+2+3+3+2+2)=2.4
2. Standard Deviation of Differences (sds_dsd):
Calculate variance:
Variance=∑(di−dˉ)2n−1=(2−2.4)2+(2−2.4)2+…
+(2−2.4)29=0.267\text{Variance} = \frac{\sum (d_i - \bar{d})^2}
{n-1} = \frac{(2-2.4)^2 + (2-2.4)^2 + \ldots + (2-2.4)^2}{9} =
0.267Variance=n−1∑(di−dˉ)2=9(2−2.4)2+(2−2.4)2+…+(2−2.4)2
=0.267
Standard deviation:
sd=Variance=0.267≈0.52s_d = \sqrt{\text{Variance}} = \
sqrt{0.267} \approx 0.52sd=Variance=0.267≈0.52
3. t-Statistic:
Use the formula:
t=dˉsd/n=2.40.52/10=2.40.164≈14.63t = \frac{\bar{d}}{s_d / \
sqrt{n}} = \frac{2.4}{0.52 / \sqrt{10}} = \frac{2.4}{0.164} \
approx 14.63t=sd/ndˉ=0.52/102.4=0.1642.4≈14.63
4. Degrees of Freedom (df):
df=n−1=10−1=9df = n - 1 = 10 - 1 = 9df=n−1=10−1=9
5. Critical t-value:
From the t-distribution table, for df=9df = 9df=9 at a 95%
confidence level, the critical t-value is approximately 2.262.

 Conclusion:
Since t=14.63>2.262t = 14.63 > 2.262t=14.63>2.262, the result is
statistically significant. This suggests that implementing the
visibility control feature significantly improves user satisfaction with
managing their profile's visibility.
 Career Advice Chatbot Powered with AI
Hypothetical Data Collection: User Satisfaction
Ratings

Use Before (No After (With Difference


r Chatbot) Chatbot) (After -
Before)

1 6 9 3

2 5 8 3

3 4 7 3

4 7 9 2

5 6 8 2

6 5 9 4

7 6 9 3

8 4 7 3

9 5 8 3

10 6 9 3

 Calculations:
1. Mean of Differences (dˉ\bar{d}dˉ):
dˉ=Sum of DifferencesNumber of Users=(3+3+3+2+2+4+3+3+3+
3)10=3.0\bar{d} = \frac{\text{Sum of Differences}}{\text{Number
of Users}} = \frac{(3 + 3 + 3 + 2 + 2 + 4 + 3 + 3 + 3 + 3)}{10} =
3.0dˉ=Number of UsersSum of Differences
=10(3+3+3+2+2+4+3+3+3+3)=3.0
2. Standard Deviation of Differences (sds_dsd):
Calculate variance:
Variance=∑(di−dˉ)2n−1=(3−3.0)2+(3−3.0)2+…+(3−3.0)29=0.67\
text{Variance} = \frac{\sum (d_i - \bar{d})^2}{n-1} = \frac{(3-
3.0)^2 + (3-3.0)^2 + \ldots + (3-3.0)^2}{9} =
0.67Variance=n−1∑(di−dˉ)2=9(3−3.0)2+(3−3.0)2+…+(3−3.0)2
=0.67
Standard deviation:
sd=Variance=0.67≈0.82s_d = \sqrt{\text{Variance}} = \sqrt{0.67}
\approx 0.82sd=Variance=0.67≈0.82
3. t-Statistic:
Use the formula:
t=dˉsd/n=3.00.82/10=3.00.26≈11.54t = \frac{\bar{d}}{s_d / \
sqrt{n}} = \frac{3.0}{0.82 / \sqrt{10}} = \frac{3.0}{0.26} \
approx 11.54t=sd/ndˉ=0.82/103.0=0.263.0≈11.54
5 Degrees of Freedom (df):
df=n−1=10−1=9df = n - 1 = 10 - 1 = 9df=n−1=10−1=9
6 Critical t-value:
From the t-distribution table, for df=9df = 9df=9 at a 95%
confidence level, the critical t-value is approximately 2.262.

 Conclusion:
Since t=11.54>2.262t = 11.54 > 2.262t=11.54>2.262, the result is
statistically significant. This means that the introduction of the AI-
powered chatbot significantly improves user satisfaction and
engagement, justifying its implementation as an enhancement to
LinkedIn’s user experience.

 Custom Resume Download


Hypothetical Data Collection: User Satisfaction
Ratings

Use Before After Difference


r (Default (Customizable (After -
Resume
Format) Templates) Before)

1 6 9 3

2 5 8 3

3 6 10 4

4 7 9 2

5 6 9 3

6 5 8 3

7 4 7 3

8 6 10 4

9 5 8 3

10 6 9 3

 Calculations:
1. Mean of Differences (dˉ\bar{d}dˉ):
dˉ=Sum of DifferencesNumber of Users=(3+3+4+2+3+3+3+4+3+
3)10=3.1\bar{d} = \frac{\text{Sum of Differences}}{\text{Number
of Users}} = \frac{(3 + 3 + 4 + 2 + 3 + 3 + 3 + 4 + 3 + 3)}{10} =
3.1dˉ=Number of UsersSum of Differences
=10(3+3+4+2+3+3+3+4+3+3)=3.1
2. Standard Deviation of Differences (sds_dsd):
Calculate variance:
Variance=∑(di−dˉ)2n−1=(3−3.1)2+(3−3.1)2+…+(3−3.1)29=0.3\
text{Variance} = \frac{\sum (d_i - \bar{d})^2}{n-1} = \frac{(3-
3.1)^2 + (3-3.1)^2 + \ldots + (3-3.1)^2}{9} =
0.3Variance=n−1∑(di−dˉ)2=9(3−3.1)2+(3−3.1)2+…+(3−3.1)2
=0.3
Standard deviation:
sd=Variance=0.3≈0.55s_d = \sqrt{\text{Variance}} = \sqrt{0.3} \
approx 0.55sd=Variance=0.3≈0.55
3. t-Statistic:
Use the formula:
t=dˉsd/n=3.10.55/10=3.10.17≈18.24t = \frac{\bar{d}}{s_d / \
sqrt{n}} = \frac{3.1}{0.55 / \sqrt{10}} = \frac{3.1}{0.17} \
approx 18.24t=sd/ndˉ=0.55/103.1=0.173.1≈18.24

4. Degrees of Freedom (df):


df=n−1=10−1=9df = n - 1 = 10 - 1 = 9df=n−1=10−1=9
5. Critical t-value:
From the t-distribution table, for df=9df = 9df=9 at a 95%
confidence level, the critical t-value is approximately 2.262.

 Conclusion:
Since t=18.24>2.262t = 18.24 > 2.262t=18.24>2.262, the result is
statistically significant. This indicates that the customizable resume
download feature significantly improves user satisfaction, validating
the feature’s effectiveness and alignment with user needs.

 Video Introductions in the Profile


Hypothetical Data Collection: User Engagement Ratings

Use Before (No After (With Difference


r Video Video (After -
Introduction) Introduction) Before)

1 6 8 2

2 5 7 2

3 7 9 2

4 6 10 4

5 5 8 3

6 6 9 3
7 7 9 2

8 4 7 3

9 5 8 3

10 6 9 3

 Calculations:

1. Mean of Differences (dˉ\bar{d}dˉ):

dˉ=Sum of DifferencesNumber of Users=(2+2+2+4+3+3+2+3+3+


3)10=2.7\bar{d} = \frac{\text{Sum of Differences}}{\text{Number
of Users}} = \frac{(2 + 2 + 2 + 4 + 3 + 3 + 2 + 3 + 3 + 3)}{10} =
2.7dˉ=Number of UsersSum of Differences
=10(2+2+2+4+3+3+2+3+3+3)=2.7

2. Standard Deviation of Differences


(sds_dsd):
Calculate variance:

Variance=∑(di−dˉ)2n−1=(2−2.7)2+(2−2.7)2+…+(3−2.7)29=0.3\
text{Variance} = \frac{\sum (d_i - \bar{d})^2}{n-1} = \frac{(2-
2.7)^2 + (2-2.7)^2 + \ldots + (3-2.7)^2}{9} =
0.3Variance=n−1∑(di−dˉ)2=9(2−2.7)2+(2−2.7)2+…+(3−2.7)2
=0.3

Standard deviation:

sd=Variance=0.3≈0.55s_d = \sqrt{\text{Variance}} = \sqrt{0.3} \


approx 0.55sd=Variance=0.3≈0.55

3. t-Statistic:
Use the formula:
t=dˉsd/n=2.70.55/10=2.70.17≈15.88t = \frac{\bar{d}}{s_d / \
sqrt{n}} = \frac{2.7}{0.55 / \sqrt{10}} = \frac{2.7}{0.17} \
approx 15.88t=sd/ndˉ=0.55/102.7=0.172.7≈15.88

4. Degrees of Freedom (df):


df=n−1=10−1=9df = n - 1 = 10 - 1 = 9df=n−1=10−1=9

5. Critical t-value:
From the t-distribution table, for df=9df = 9df=9 at a 95%
confidence level, the critical t-value is approximately 2.262.

 Conclusion:

Since t=15.88>2.262t = 15.88 > 2.262t=15.88>2.262, the result is


statistically significant. This indicates that the video introduction
feature significantly improves user engagement and profile appeal,
confirming its value as a useful addition to LinkedIn’s functionality.

 Career Quizzes and Challenges


Hypothetical Data Collection: User Engagement
Ratings

Use Before (No After (With Difference


r Quiz-Based Quiz-Based (After -
Game) Game) Before)

1 5 8 3

2 6 9 3

3 7 10 3

4 5 9 4

5 6 10 4

6 5 8 3
7 6 9 3

8 4 7 3

9 5 8 3

10 6 9 3

 Calculations:
1. Mean of Differences (dˉ\bar{d}dˉ):
dˉ=Sum of DifferencesNumber of Users=(3+3+3+4+4+3+3+3+3+3)1
0=3.2\bar{d} = \frac{\text{Sum of Differences}}{\text{Number of
Users}} = \frac{(3 + 3 + 3 + 4 + 4 + 3 + 3 + 3 + 3 + 3)}{10} =
3.2dˉ=Number of UsersSum of Differences
=10(3+3+3+4+4+3+3+3+3+3)=3.2
2. Standard Deviation of Differences (sds_dsd):
Calculate variance:
3. Variance=∑(di−dˉ)2n−1=(3−3.2)2+(3−3.2)2+…+(3−3.2)29=0.18\
text{Variance} = \frac{\sum (d_i - \bar{d})^2}{n-1} = \frac{(3-
3.2)^2 + (3-3.2)^2 + \ldots + (3-3.2)^2}{9} =
0.18Variance=n−1∑(di−dˉ)2=9(3−3.2)2+(3−3.2)2+…+(3−3.2)2
=0.18
4. Standard deviation:
5. sd=Variance=0.18≈0.42s_d = \sqrt{\text{Variance}} = \sqrt{0.18}
\approx 0.42sd=Variance=0.18≈0.42
6. t-Statistic:
Use the formula:
t=dˉsd/n=3.20.42/10=3.20.13≈24.62t = \frac{\bar{d}}{s_d / \
sqrt{n}} = \frac{3.2}{0.42 / \sqrt{10}} = \frac{3.2}{0.13} \approx
24.62t=sd/ndˉ=0.42/103.2=0.133.2≈24.62

7. Degrees of Freedom (df):


df=n−1=10−1=9df = n - 1 = 10 - 1 = 9df=n−1=10−1=9
8. Critical t-value:
From the t-distribution table, for df=9df = 9df=9 at a 95%
confidence level, the critical t-value is approximately 2.262.

 Conclusion:
Since t=24.62>2.262t = 24.62 > 2.262t=24.62>2.262, the result is
statistically significant. This indicates that the quiz-based game feature
significantly improves user engagement and satisfaction. It
demonstrates the potential of gamified elements to enhance the
LinkedIn experience effectively.

 Job Interview Preparation Tools


Hypothetical Data Collection: Confidence Ratings

Use Before (No After (With Difference


r Preparation Preparation (After -
Tools) Tools) Before)

1 5 8 3

2 6 9 3

3 4 7 3

4 5 9 4

5 6 10 4

6 5 8 3

7 4 7 3

8 5 9 4

9 6 9 3

10 4 7 3
 Calculations:
1. Mean of Differences (dˉ\bar{d}dˉ):
dˉ=Sum of DifferencesNumber of Users=(3+3+3+4+4+3+3+4+3+
3)10=3.3\bar{d} = \frac{\text{Sum of Differences}}{\text{Number
of Users}} = \frac{(3 + 3 + 3 + 4 + 4 + 3 + 3 + 4 + 3 + 3)}{10} =
3.3dˉ=Number of UsersSum of Differences
=10(3+3+3+4+4+3+3+4+3+3)=3.3

2. Standard Deviation of Differences


(sds_dsd):
Calculate variance:
Variance=∑(di−dˉ)2n−1=(3−3.3)2+(3−3.3)2+…+(3−3.3)29=0.21\
text{Variance} = \frac{\sum (d_i - \bar{d})^2}{n-1} = \frac{(3-
3.3)^2 + (3-3.3)^2 + \ldots + (3-3.3)^2}{9} =
0.21Variance=n−1∑(di−dˉ)2=9(3−3.3)2+(3−3.3)2+…+(3−3.3)2
=0.21
Standard deviation:
sd=Variance=0.21≈0.46s_d = \sqrt{\text{Variance}} = \sqrt{0.21}
\approx 0.46sd=Variance=0.21≈0.46
3. t-Statistic:
Use the formula:
t=dˉsd/n=3.30.46/10=3.30.15≈22.0t = \frac{\bar{d}}{s_d / \
sqrt{n}} = \frac{3.3}{0.46 / \sqrt{10}} = \frac{3.3}{0.15} \
approx 22.0t=sd/ndˉ=0.46/103.3=0.153.3≈22.0

4. Degrees of Freedom (df):


df=n−1=10−1=9df = n - 1 = 10 - 1 = 9df=n−1=10−1=9
5. Critical t-value:
From the t-distribution table, for df=9df = 9df=9 at a 95%
confidence level, the critical t-value is approximately 2.262.
 Conclusion:
Since t=22.0>2.262t = 22.0 > 2.262t=22.0>2.262, the result is
statistically significant. This indicates that the job interview preparation
tools significantly improve users' confidence in interview readiness,
validating the tools' effectiveness in providing value to users.

 Freelancer Profile Section


Hypothetical Data Collection: Engagement or Job
Acquisition Ratings

Use Before (No After (With Difference


r Freelancer Freelancer (After -
Profile) Profile) Before)

1 6 8 2

2 5 9 4

3 4 7 3

4 6 9 3

5 5 8 3

6 7 9 2

7 4 6 2

8 5 8 3

9 6 9 3

10 5 8 3

 Calculations:
1. Mean of Differences (dˉ\bar{d}dˉ):
dˉ=Sum of DifferencesNumber of Users=(2+4+3+3+3+2+2+3+3+
3)10=2.8\bar{d} = \frac{\text{Sum of Differences}}{\text{Number
of Users}} = \frac{(2 + 4 + 3 + 3 + 3 + 2 + 2 + 3 + 3 + 3)}{10} =
2.8dˉ=Number of UsersSum of Differences
=10(2+4+3+3+3+2+2+3+3+3)=2.8
Standard Deviation of Differences (sds_dsd):
Calculate variance:
Variance=∑(di−dˉ)2n−1=(2−2.8)2+(4−2.8)2+…+(3−2.8)29=0.4\
text{Variance} = \frac{\sum (d_i - \bar{d})^2}{n-1} = \frac{(2-
2.8)^2 + (4-2.8)^2 + \ldots + (3-2.8)^2}{9} =
0.4Variance=n−1∑(di−dˉ)2=9(2−2.8)2+(4−2.8)2+…+(3−2.8)2
=0.4
Standard deviation:
sd=Variance=0.4≈0.63s_d = \sqrt{\text{Variance}} = \sqrt{0.4} \
approx 0.63sd=Variance=0.4≈0.63
2. t-Statistic:
Use the formula:
t=dˉsd/n=2.80.63/10=2.80.20≈14.0t = \frac{\bar{d}}{s_d / \sqrt{n}}
= \frac{2.8}{0.63 / \sqrt{10}} = \frac{2.8}{0.20} \approx 14.0t=sd/n
dˉ=0.63/102.8=0.202.8≈14.0

3. Degrees of Freedom (df):


df=n−1=10−1=9df = n - 1 = 10 - 1 = 9df=n−1=10−1=9

4. Critical t-value:
From the t-distribution table, for df=9df = 9df=9 at a 95%
confidence level, the critical t-value is approximately 2.262.

 Conclusion:
Since t=14.0>2.262t = 14.0 > 2.262t=14.0>2.262, the result is
statistically significant. This demonstrates that the freelancer profile
feature substantially improves user engagement and job acquisition
rates, validating its effectiveness and its value to users in expanding
their professional opportunities.
 Real-time Profile Feedback
Hypothetical Data Collection: Profile Optimization
Ratings

Use Before (No After (With Difference


r Real-Time Real-Time (After -
Feedback) Feedback) Before)

1 6 9 3

2 5 8 3

3 6 9 3

4 7 10 3

5 5 8 3

6 4 7 3

7 6 9 3

8 5 8 3

9 7 10 3

10 6 9 3

 Calculations:
1. Mean of Differences (dˉ\bar{d}dˉ):
dˉ=Sum of DifferencesNumber of Users=(3+3+3+3+3+3+3+3+3+3)1
0=3.0\bar{d} = \frac{\text{Sum of Differences}}{\text{Number of
Users}} = \frac{(3 + 3 + 3 + 3 + 3 + 3 + 3 + 3 + 3 + 3)}{10} =
3.0dˉ=Number of UsersSum of Differences
=10(3+3+3+3+3+3+3+3+3+3)=3.0
Standard Deviation of Differences (sds_dsd):
Since all differences are the same (di=3d_i = 3di=3), the variance is
000, and thus:
sd=0s_d = 0sd=0
2. t-Statistic:
Using the formula:
t=dˉsd/nt = \frac{\bar{d}}{s_d / \sqrt{n}}t=sd/ndˉ
Since sd=0s_d = 0sd=0, the t-statistic is undefined because there is no
variability in the differences.

 Conclusion:
When all differences are constant, the data lacks variability, and a
paired t-test is not suitable. However, the consistent improvement
indicates that the real-time feedback feature has positively impacted
user profiles. A different statistical analysis (e.g., non-parametric tests)
may be more appropriate to confirm these results.

 Portfolio Section in the Profile


Hypothetical Data Collection: Engagement Rates

Use Before (No After (With Difference


r Portfolio Portfolio (After -
Section) Section) Before)

1 50 80 30

2 60 90 30

3 55 85 30

4 70 100 30

5 65 95 30

6 50 80 30
7 60 90 30

8 55 85 30

9 70 100 30

10 65 95 30

 Calculations:
1. Mean of Differences (dˉ\bar{d}dˉ):
dˉ=Sum of DifferencesNumber of Users=(30+30+30+30+30+30+30+
30+30+30)10=30\bar{d} = \frac{\text{Sum of Differences}}{\
text{Number of Users}} = \frac{(30 + 30 + 30 + 30 + 30 + 30 + 30 +
30 + 30 + 30)}{10} = 30dˉ=Number of UsersSum of Differences
=10(30+30+30+30+30+30+30+30+30+30)=30
Standard Deviation of Differences (sds_dsd):
Since all differences are the same (di=30d_i = 30di=30), the variance
is 000, and thus:
sd=0s_d = 0sd=0
2. t-Statistic:
Using the formula:
t=dˉsd/nt = \frac{\bar{d}}{s_d / \sqrt{n}}t=sd/ndˉ
Since sd=0s_d = 0sd=0, the t-statistic is undefined because there is
no variability in the differences.

 Conclusion:
The lack of variability in differences indicates that all users experienced
the same improvement in engagement rates. While a paired t-test is
not applicable in this scenario due to zero standard deviation, the
consistent improvement strongly suggests that the portfolio section
positively impacts profile engagement. Additional analysis using a
larger dataset or alternative statistical methods might further validate
these results.

You might also like