Court Cases Winning Stratergys
Court Cases Winning Stratergys
Court Cases Winning Stratergys
Taipei, Taiwan, November 21-22, 2022. The Association for Computational Linguistics and Chinese Language Processing
205
The 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
Taipei, Taiwan, November 21-22, 2022. The Association for Computational Linguistics and Chinese Language Processing
206
The 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
Taipei, Taiwan, November 21-22, 2022. The Association for Computational Linguistics and Chinese Language Processing
In the research by Waltl et al. (2017), they the decision class and the unanimity of deci-
have conducted their research fundamentally sions have been designed. They have achieved
on German tax law cases. The research is good accuracy for some of the many model
conducted on features extracted using mostly variations.
regular expressions and manual annotations.
A Naive Bayes classifier have been chosen as 3 Methodology
the best performing machine learning model.
In this section, the approach used for dataset
They have achieved 0.57 precision, 0.58 F1
preparation and the methodology for deriving
score and 0.60 recall for positive outcomes.
the architecture used in this research are be
Research done by Aletras et al. (2016) on discussed.
predicting the decision of the European court
of human rights, is identified as the first sys- 3.1 Dataset Preparation
tematic approach to predicting winning par- As observed by Kreutzer et al. (2022), the
ties by using NLP, as per the authors. They quality of the data sets used often play a vi-
have modeled the problem as a binary classi- tal role in research. This research was con-
fication problem, while using Support Vector ducted on a dataset extracted from the case
Machines and N-grams and topics as features law website1 ranging from the year 2000 to
for the model. year 2010 and belonging to the criminal cate-
Liu and Chen (2017) also proposes a classi- gory. The extracted cases were pre-processed
fication approach for identifying the winning by removing paragraphs at the beginning and
party of a court case. The process consists of the end. These paragraphs include the in-
two phases. In the 1st phase, an Article Classi- troductory paragraphs where the background
fication model extracts top k articles that are of the case is summarized and the last para-
cited in the case document. In the 2nd phase, graphs where the decision is stated. After-
the Judgement Classification model tries to wards several preprocessing steps were applied
predict the judgement of the court case. They to the remaining paragraphs to remove cita-
have considered domain specific aspects such tions and other notations, as they do not add
as punishment, cited statutes and features de- any semantic meaning to the case. In our data
rived using NLP such as sentiment, as features pair, these cleaned and remaining paragraphs
for their model. constitute the input. Since the decision of each
A tree based approach which uses new fea- case was found in the aforementioned removed
ture engineering techniques is proposed in the paragraphs with a retrievable convention in al-
research conducted by Katz et al. (2014). The most all the cases, the decision of the court
dataset used in this research consist of cases cases were extracted automatically. In our
from the United States Supreme Court. Re- data pair, this extracted verdict constitutes
searchers have considered the impact from po- the expected output.
litical biases for the decisions as well. They Stanza NLP Library (Qi et al., 2020) was
have used data ranging over multiple presi- used to split a court case document into a list
dential terms to generalize the model more. of sentences as for the representation purposes
Features already present with there chosen discussed in Section 3.2. Since Stanza is a
dataset have been used and some has been in- general purpose NLP library (not specifically
troduced by them. With the 7700 cases used, trained on legal context), there could be sen-
they have succeeded in getting 69.7% accuracy tences divided by the periods in between ab-
and individual judge votes with 70.9% accu- breviations (some of which are specific jargon
racy. of the legal domain) and the periods within
Lage-Freitas et al. (2019) have proposed a brackets. So, further pre-processing steps were
machine learning approach to develop a sys- needed to be taken to make the sentence split-
tem that predicts Brazilian court decisions. ting process accurate.
Researchers have suggested for it to be used
as a supporting tool or a benchmark for legal • Removed text within rounded brackets.
professionals. The approach to calculate both 1
https://caselaw.findlaw.com/
207
The 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
Taipei, Taiwan, November 21-22, 2022. The Association for Computational Linguistics and Chinese Language Processing
208
The 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
Taipei, Taiwan, November 21-22, 2022. The Association for Computational Linguistics and Chinese Language Processing
209
The 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
Taipei, Taiwan, November 21-22, 2022. The Association for Computational Linguistics and Chinese Language Processing
210
The 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
Taipei, Taiwan, November 21-22, 2022. The Association for Computational Linguistics and Chinese Language Processing
Critical
Sentence Sentence
Model Embedding Annotation Accuracy Macro F1
Sentence-BERT N 56.32 53.14
GRU DistilBERT N 65.71 57.14
DistilBERT Y 73.05 63.27
LSTM DistilBERT Y 72.04 65.52
GRU - Bidirectional DistilBERT Y 75.46 63.88
Sentence-BERT N 69.26 60.85
Transformer Encoder DistilBERT N 74.88 64.96
DistilBERT Y 75.75 66.54
obtain an accuracy of 75.75%, outperforming cal evaluation of gated recurrent neural net-
other models. The need for domain-specific works on sequence modeling. arXiv preprint
arXiv:1412.3555.
models can also be seen by the increase in ac-
curacy when the critical sentence annotation Cornell Law School. 2020a. Case law. https://www.
is used. This system can be horizontally ex- law.cornell.edu/wex/case_law. Accessed: 2022-
tended by adding more sub models to pro- 08-18.
vide features to the final model. While the Cornell Law School. 2020b. Legal party. https:
results obtained by DistilBERT (Sanh et al., //www.law.cornell.edu/wex/party. Accessed:
2019) sentence embeddings are impressive, ex- 2022-08-18.
tending the conclusions drawn by Sugathadasa Cornell Law School. 2020c. Petitioner. https:
et al. (2017) for word embeddings, it can be //www.law.cornell.edu/wex/petitioner. Ac-
postulated that legal-domain specific sentence cessed: 2022-08-18.
embeddings would potentially reap better re-
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and
sults. Also as future work, the impact of hav- Kristina Toutanova. 2018. Bert: Pre-training of
ing models trained with supervised approaches deep bidirectional transformers for language un-
and unsupervised approaches should be experi- derstanding. arXiv preprint arXiv:1810.04805.
mented, as legal domain has a deficit of labeled
Michael C Dorf. 1994. Prediction and the rule of
data compared to its large corpora. law. UCLA L. Rev., 42:651.
211
The 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
Taipei, Taiwan, November 21-22, 2022. The Association for Computational Linguistics and Chinese Language Processing
IEEE 16th International Conference on Indus- Chanika Ruchini Mudalige, Dilini Karunarathna,
trial and Information Systems (ICIIS), pages Isanka Rajapaksha, Nisansa de Silva, Gathika
146–151. IEEE. Ratnayaka, Amal Shehan Perera, and Ramesh
Pathirana. 2020. Sigmalaw-absa: Dataset for
V. Jayawardana, D. Lakmal, Nisansa de Silva, aspect-based sentiment analysis in legal opinion
A. S. Perera, K. Sugathadasa, B. Ayesha, and texts. arXiv preprint arXiv:2011.06326.
M. Perera. 2017a. Word Vector Embeddings
and Domain Specific Semantic based Semi- Peng Qi, Yuhao Zhang, Yuhui Zhang, Jason
Supervised Ontology Instance Population. In- Bolton, and Christopher D Manning. 2020.
ternational Journal on Advances in ICT for Stanza: A python natural language process-
Emerging Regions, 10(1):1. ing toolkit for many human languages. arXiv
preprint arXiv:2003.07082.
Vindula Jayawardana, Dimuthu Lakmal, Nisansa
de Silva, Amal Shehan Perera, Keet Sugath- Isanka Rajapaksha, Chanika Ruchini Mudalige,
adasa, and Buddhi Ayesha. 2017b. Deriving a Dilini Karunarathna, Nisansa de Silva, Gathika
Representative Vector for Ontology Classes with Rathnayaka, and Amal Shehan Perera. 2020.
Instance Word Vector Embeddings. In 2017 Rule-based approach for party-based sentiment
Seventh International Conference on Innovative analysis in legal opinion texts. arXiv preprint
Computing Technology (INTECH), pages 79–84. arXiv:2011.05675.
IEEE.
Isanka Rajapaksha, Chanika Ruchini Mudalige,
Vindula Jayawardana, Dimuthu Lakmal, Nisansa Dilini Karunarathna, Nisansa de Silva,
de Silva, Amal Shehan Perera, Keet Sugath- Amal Shehan Perera, and Gathika Ratnayaka.
adasa, Buddhi Ayesha, and Madhavi Perera. 2021. Sigmalaw PBSA-A Deep Learning Model
2017c. Semi-Supervised Instance Population of for Aspect-Based Sentiment Analysis for the
an Ontology using Word Vector Embedding. In Legal Domain. In International Conference
Advances in ICT for Emerging Regions (ICTer), on Database and Expert Systems Applications,
2017 Seventeenth International Conference on, pages 125–137. Springer.
pages 1–7. IEEE.
G. Ratnayaka, T. Rupasinghe, Nisansa de Silva,
Daniel Martin Katz, Michael J Bommarito II, and M. Warushavithana, V. Gamage, M. Perera, and
Josh Blackman. 2014. Predicting the behavior A. S. Perera. 2019a. Classifying Sentences in
of the supreme court of the united states: A gen- Court Case Transcripts using Discourse and Ar-
eral approach. arXiv preprint arXiv:1407.6333. gumentative Properties. ICTer, 12(1).
Julia Kreutzer, Isaac Caswell, Lisa Wang, Ah- Gathika Ratnayaka, Thejan Rupasinghe,
san Wahab, Daan van Esch, Nasanbayar Ulzii- Nisansa de Silva, Viraj Gamage, Menuka
Orshikh, Allahsera Tapo, Nishant Subramani, Warushavithana, and Amal Shehan Perera.
Artem Sokolov, Claytone Sikasote, et al. 2022. 2019b. Shift-of-Perspective Identification
Quality at a Glance: An Audit of Web-Crawled Within Legal Cases. In Proceedings of the 3rd
Multilingual Datasets. Transactions of the Asso- Workshop on Automated Detection, Extraction
ciation for Computational Linguistics, 10:50–72. and Analysis of Semantic Information in Legal
Texts.
André Lage-Freitas, Héctor Allende-Cid, Orivaldo
Santana, and Lívia de Oliveira-Lage. 2019. Pre- Gathika Ratnayaka, Thejan Rupasinghe, Nisansa
dicting brazilian court decisions. arXiv preprint de Silva, Menuka Warushavithana, Viraj Gam-
arXiv:1905.10348. age, and Amal Shehan Perera. 2018. Identifying
Relationships Among Sentences in Court Case
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaim- Transcripts Using Discourse Relations. In 2018
ing He, and Piotr Dollár. 2017. Focal loss for 18th International Conference on Advances in
dense object detection. In Proceedings of the ICT for Emerging Regions (ICTer), pages 13–
IEEE international conference on computer vi- 20. IEEE.
sion, pages 2980–2988.
Gathika Ratnayaka, Nisansa de Silva, Amal She-
Yihung Liu and Yen-Liang Chen. 2017. A two- han Perera, and Ramesh Pathirana. 2020. Effec-
phase sentiment analysis approach for judge- tive approach to develop a sentiment annotator
ment prediction. Journal of Information Sci- for legal domain in a low resource setting. arXiv
ence, 44. preprint arXiv:2011.00318.
Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Nils Reimers and Iryna Gurevych. 2019. Sentence-
Du, Mandar Joshi, Danqi Chen, Omer Levy, BERT: Sentence Embeddings using Siamese
Mike Lewis, Luke Zettlemoyer, and Veselin BERT-Networks. In Proceedings of the 2019
Stoyanov. 2019. Roberta: A robustly opti- Conference on Empirical Methods in Natural
mized bert pretraining approach. arXiv preprint Language Processing. Association for Computa-
arXiv:1907.11692. tional Linguistics.
212
The 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
Taipei, Taiwan, November 21-22, 2022. The Association for Computational Linguistics and Chinese Language Processing
213