2021.acl-long.0(7)

ACL-IJCNLP 2021
The 59th Annual Meeting of the

Association for Computational Linguistics
and the 11th International Joint Conference
on Natural Language Processing
Proceedings of the Conference, Vol. 1 (Long Papers)
August 1 - 6, 2021
Diamond Sponsors
Platinum Sponsors
Gold Sponsors
ii
Silver Sponsors
Bronze Sponsors
©2021 The Association for Computational Linguistics
Order copies of this and other ACL proceedings from:
Association for Computational Linguistics (ACL)

209 N. Eighth Street
Stroudsburg, PA 18360
USA
Tel: +1-570-476-8006
Fax: +1-570-476-0860
[email protected]
ISBN 978-1-954085-52-7 (Volume 1)
iii
Message from the General Chair
I am delighted to welcome you to the Joint Conference of the 59th Annual Meeting of the Association for
Computational Linguistics and the 11th International Joint Conference on Natural Language Processing
(ACL-IJCNLP 2021)!
We are very grateful for many people. Fei Xia, Wenjie Li (Maggie) and Roberto Navigli, as the
Program Chairs, have admirably guided the work of main conference organization and management.
The calm and experienced Priscilla Rasmussen has done a lot of work for the signing of contracts
with virtual platform company, Underline.io, calculation of registration fees and managing the entire
registration process, and communication with sponsors and exhibitors. The amazing 68-person
organizing committee, who all contributed so much to make the conference successful: Local Chairs
(Priscilla Rasmussen, Thepchai Supnithi, Thanaruk Theeramunkong), Tutorial Chairs (David Chiang,
Min Zhang), Workshop Chairs (Kentaro Inui, Michael Strube), Student Research Workshop Chairs
(Jad Kabbara, Haitao Lin, Amandalynne Paullada, Jannis Vamvas), Faculty Advisors to the Student
Workshop (Jing Jiang, Rico Sennrich, Derek F. Wong, Nianwen Xue), Audio-Video Chairs (Suchathit
Boonnag, Rachasak Somyanonthanakul), Conference Handbook Chair (Krit Kosawat), Demonstration
Chairs (Heng Ji, Jong C. Park, Rui Xia), Diversity and Inclusion Committee Chairs (Academic Inclusion
Chairs: Avirup Sil, Kayathi Chandu, Lifu Huang, Sara Rosenthal; Accessibility Chairs: Minlie Huang,
Vivian Chen, Yang Feng; Financial Access Chairs: Martha Yifiru Tachbelie, Alexis Palmer, Ignatius
Eziani, Manuel Mager, Nafise Moosavi; Socio-cultural Inclusion Chairs: Alvin Grissom, Xanda
Schofield, Pedro Rodriguez), Local Sponsorship Chairs (Rachada Kongkrachantra, Jing Li, Kobkrit
Viriyayudhakorn, Zhongyu Wei), Publications Chairs (Yuki Arase, Jing-Shin Chang, Yvette Graham),
Publicity Chair (Kai-Fam Wong), Remote Presentation Chairs (Zhongjun He, Nattapol Kritsuthikul,
Yadollah Yaghoobzadeh), Sustainability Chairs (Angeliki Lazaridou, Qi Zhang), Reviewer Mentoring
Committe Chairs (Jing Huang, Antoine Bosselut, Christophe Gravier), Website and Conference App
Chairs (Chutima Beokhaimook, Witchaworn Mankhong), Student Volunteer Coordinator (Dongyan
Zhao), Ethic Advisory Committee Chairs (Malvina Nissim, Min-Yen Kan, Xanda Schofield), Social
Media Committee Chairs (Luciana Benotti, Lidong Bing, Zhumin Chen, Rachele Sprugnoli, Mark
Seligman), Virtual Infrastructure Committee Advisor (Hao Fang), Virtual Infrastructure Committee
Chairs (Wei Lu, Krich Nasingkun, Alessandro Raganato, Shaonan Wang, Liang-Chih Yu, Jianfei Yu).
The success of the conference is inseparable from the guidance and advice of ACL Officers. Special
thanks to Hinrich Schütze, Rada Mihalcea, David Yarowsky, Shiqi Zhao and Yusuke Miyao. The general
chair of NAACL’2021, Dr. Kristina Toutanova provided me much advice based on her experience with
NAACL’2021 organization. The friendly cooperation with NAACL’2021 and EACL’2021 workshop
chairs and tutorial chairs is very important and is of mutual benefit to each other.
Sponsors and exhibitors are always very important. We are extremely grateful to all sponsors for their
continuing support to help our conferences be very successful.
And finally, I would like to thank every one of you for making ACL-IJCNLP’2021 such a success by
submitting papers and demos, serving as area chairs and reviewers, session chairs, invited speakers and
volunteers, and by joining us in virtual environment.
Welcome and hope you all enjoy the conference!
Chengqing Zong
ACL-IJCNLP’2021 General Chair
June 28, 2021
iv
Message from the Program Chairs
Welcome to the Joint Conference of the 59th Annual Meeting of the Association for Computational
Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP
2021)! ACL-IJCNLP 2021 has a special historical significance as this is a particularly exciting period:
our field has grown dramatically, NLP research is now ubiquitous in products, and the barrier to entry to
the field has lowered considerably. Like ACL 2020, ACL-IJCNLP 2021 is held as a virtual conference
again due to the worldwide COVID-19 pandemic which has lasted for more than one year. We are very
grateful for all of your support and contributions during this difficult time, which make this conference
special and memorable.
Abstract and Full-paper Submissions: To synchronize with NAACL 2021, our conference’s review
cycle was about three weeks shorter than that of ACL 2020. To make the short review cycle work, we
introduced an abstract submission step, which required authors to submit an abstract by Jan 25, 2021,
one week before the full-paper submission deadline on Feb 1, 2021. This extra step gave NAACL 2021
authors an opportunity to withdraw their papers from NAACL 2021 and submit them to ACL-IJCNLP
2021 based on feedback from NAACL 2021’s rebuttal period. In total, we received 4, 266 abstract
submissions and 3, 350 full paper submissions.
Tracks: The submissions were assigned to one of 24 topic tracks. The tracks were similar to those used
in previous conferences but with a few changes:
1. Based on the number of submissions in previous conferences, we followed NAACL 2021 and
combined two tracks (“Semantics: Sentence Level” and “Semantics: Textual Inference and Other
Areas of Semantics”) into a single track “Semantics: Sentence-level Semantics, Textual Inference
and Other areas”.
2. To accommodate a wider and more diverse area, we changed the name of the “Computational
Social Science and Social Media” track to “Computational Social Science and Cultural Analytics”.
3. Following NAACL 2021, we combined the “Theory and Formalism” with the “Cognitive
Modeling and Psycholinguistics” areas into “Linguistic theories, Cognitive Modeling and
Psycholinguistics”. This track is designed to encourage submissions targeted to theoretical
underpinning of NLP models which had little/small presence in the past ACL conferences.
4. We introduced a new theme: “NLP for Social Good (NLP4SG)”. The application of AI to provide
positive social impact has been an important topic in recent years. However, to date, this has not
been a topic highlighted at the ACL main conference. This track is designed to invite submissions
that can provide insights for the ACL-IJCNLP community on the topic of NLP for Social Good as
well as how NLP could potentially cause or be used for social harm.
Program Committee: To meet the reviewer demands of a growing conference without compromising
review quality, we started recruiting Senior Area Chairs (SACs) and Area Chairs in early fall 2020. Then
we initiated a large-scale reviewer recruiting effort in Nov 2020. We compiled a big list of reviewers from
previous conferences, and sent out invitations to more than 9, 000 candidates, asking the ones who were
willing to serve to fill out a Microsoft reviewer form. About 4, 400 of the invitees filled out the form. We
then worked with SACs and ACs in selecting reviewers and assigning them to appropriate tracks. The
whole process of forming the program committee was very complex and took several months to complete
and, at the end, we have the largest ever program committee in the history of ACL with 60 SACs, 323
ACs, and 3, 685 primary reviewers.
v
Reviewer Mentoring Program: Review quality is crucial for the success of a large conference like
ACL. Thus, it is of central importance for our community to mentor and train new reviewers in order to
keep up with the community’s rapid growth, both in terms of submissions and in terms of new members
of the community. Therefore, this year we continued the reviewer mentoring program launched with
ACL 2020. Ultimately, the goal of this program is to provide long-needed mentoring to new reviewers.
We formed a reviewer mentoring committee. Collaborating with them and SACs, we paired Area Chairs
(mentors) with first-time ACL reviewers (mentees, often Ph.D. students or junior researchers) during
the paper assignment process. The mentees would submit reviews early for the mentors to provide
feedback, and the mentees would then revise their reviews based on the feedback. In addition, to help
all the reviewers, the reviewer mentoring committee created several videos including the presentation
of the mentoring program, a general reviewing tutorial, information about the review form used for this
conference, and guidelines on how to consider ethical issues reproducibility in submissions.
Ethical review: The ethical impact and potential applications of our research should be an important
consideration for research design, and as artificial intelligence is becoming more mainstream, these issues
are increasingly pertinent. To address the potential ethical concerns, we allowed authors to include
a broader impact statement or other discussion of ethics in the paper, which does not count towards
the page limit. We formed an Ethics Advisory Committee (EAC) with three co-chairs and 57 EAC
reviewers. During the review process, reviewers were asked to flag submissions with ethical concerns.
The EAC then reviewed all the flagged papers to determine whether the papers should be (a) accepted
as is, (b) conditional accepted (with specification of what must be addressed in the camera-ready version
in order for the condition to be removed), or (c) rejected on ethical grounds (with explanation of the
reject decision). Based on their decisions and the SAC recommendations, we made the accept/reject
decisions and sent out acceptance notifications on May 6, 2021. The whole process was explained in a
blog posted to the conference website on May 10, 2021. The camera-ready version of the conditionally
accepted papers were checked by the EAC again. The EAC informed us that all these papers had made
satisfactory revisions and thus we removed the condition on the papers. The whole process was very
complex, and we were grateful for the hard work of the EAC and the authors.
Acceptance to Main Conference: After the review process, out of the 3, 350 full submissions, 710
papers (139 short, 571 long) were accepted into the main conference. With an acceptance rate of 21.2%,
ACL-IJCNLP 2021 continues to be a highly competitive conference. Based on the nominations from
Senior Area Chairs, we selected 28 papers as candidates for the Best Paper awards. We formed a Best
Paper Award Committee, who went over all the candidates and selected one best paper, one best theme
paper and six outstanding papers.
Findings: To continue the success of Findings at EMNLP 2020, we decided to introduce Findings
papers, which are papers that are not accepted for publication in the main conference, but nonetheless
have been assessed by the Program Committee as solid work with sufficient substance, quality and
novelty. Out of the 3, 350 full submissions, 493 papers were invited to be included in the Findings.
Thirty-six papers declined the offer, leading to 457 papers (118 short and 339 long) to be published in the
Findings of ACL: ACL-IJCNLP 2021. To increase the visibility of the Finding papers, the authors of such
papers can choose to make a 3-minute video to be included in the virtual conference site. Our workshop
chairs also helped to pair Findings papers with ACL-IJCNLP 2021 workshops for the possibility of
Finding papers to be presented at those workshops.
TACL and CL papers: Continuing the tradition, ACL-IJCNLP 2021 will also feature 27 papers that
were published at Transactions of the Association for Computational Linguistics (TACL) and 5 papers
from the journal of Computational Linguistics (CL).
Keynote speakers: Another highlight of our program is three exciting keynote talks, given by Prof.
Christopher Potts (Stanford University), Prof. Helen Meng (Chinese University of Hong Kong), and Dr.
Alejandrina Cristia (École Normale Supérieure).
vi
ACL-IJCNLP 2021 would not be possible without the support from the community. There are many
people we would like to thank for their significant contributions! First, we would like to thank our
Program Committee, whose names are included in the Program Committee pages in the proceedings:
• Our awesome 60 Senior Area Chairs who were instrumental in every aspect of the review process
(e.g., AC/reviewer selection, paper assignment, recommendation for paper acceptance, nomination
of best papers and outstanding reviewers). For many of them, the scope of their responsibilities was
equivalent to chairing a small conference. The 323 Area Chairs who led paper review discussions,
wrote meta-reviews, and mentored junior reviewers. In addition, they have helped SACs with
reviewer selection, paper assignment, and many other tasks.
• Our 3, 685 primary reviewers and 262 secondary reviewers who provided valuable feedback
to the authors. Special thanks to those who stepped in at the last minute to serve as emergency
reviewers.
Second, we would like to thank many ACL-IJCNLP 2021 committees that we have worked with,
including:
• Our Best Paper Selection Committee, Bonnie Webber, Tim Baldwin and Ellen Riloff for selecting
best papers and outstanding papers under a very tight schedule.
• Our Ethics Advisory Committee, chaired by Min-Yen Kan, Malvina Nissim, and Xanda
Schofield, for their hard work to ensure that all the accepted papers have addressed the ethical
issues appropriately.
• Our Reviewer Mentoring Committee, Jing Huang, Antoine Bosselut and Christophe Gravier, for
preparing mentoring materials and providing review support to first-time reviewers.
• Our Publication Co-Chairs, Jing-Shin Chang, Yuki Arase, and Yvette Graham, for their
tremendous effort in making the proceedings.
• Our Social Media Committee, chaired by Luciana Benotti, Lidong Bing, Zhumin Chen, Mark
Seligman, and Rachele Sprugnoli, for effectively communicating conference updates and other
urgent information on social media platforms.
• The Workshop Chairs, Kentaro Inui and Michael Strube, for connecting Findings paper authors
with individual workshops for possible presentations.
• The Website & Conference App Chairs, Chutima Beokhaimook and Witchaworn Mankhong, for
making numerous updates to the conference website.
Third, we would like to thank many people who help us with various software used for the conference:
• Rich Gerber at SoftConf, who is always quick to respond to our emails and resolve difficulties we
encountered with the START system.
• C. M. Downey at the University of Washington, who helped us to extend and run the external paper
assignment system developed by Graham Neubig.
• Caterina Lacerra and Rocco Tripodi at the Sapienza University of Rome, who helped us in the
creation of internal spreadsheets and processing scripts.
• The whole Underline team (Sol Rosenberg, Fun Lee, Jordan Young, Daniel Luise) who created a
virtual site for the conference.
vii
As Program chairs, we were in charge of several dozen tasks and many of them were new to us. We
would not be able to complete the tasks without the advice from our colleagues, including:
• Our General Chair Chengqing Zong, who has been very supportive throughout the whole process,
giving us the flexibility to innovate while providing an invaluable sounding board.
• The Program Co-Chairs of ACL 2020, Joyce Chai, Natalie Schluter and Joel Tetreault; the
Program Co-Chairs of EMNLP 2020, Trevor Cohn, Yulan He and Yang Liu; the Program
Co-Chairs of NAACL 2021, Anna Rumshisky, Luke Zettlemoyer and Dilek Hakkani-Tur, for
generously sharing their experience, documentation, and advice in organizing ACL conferences
and for answering our questions, often on short notice.
• ACL Executive Committee, especially Rada Mihalcea (the ACL President) and Hinrich Schütze
(the ACL Past President), Shiqi Zhao (Secretary), Priscilla Rasmussen (Business Manager),
Nitin Madnani (Member-at-large), to help us sort through various issues.
• TACL Editors-in-Chief Ani Nenkova and Brian Roark, TACL Editorial Assistant Cindy
Robinson, and CL Editor-in-Chief Hwee Tou Ng for coordinating TACL and CL presentations at
the conference.
We would also like to thank all the authors (8, 757 in total) who submitted their work to the conference.
Although we were only able to accept a small percentage of the submissions, your hard work makes this
conference exciting and our community strong.
Last, but not least, we thank our students, interns, postdocs, colleagues, and families for being so
understanding and supportive when we were swamped by countless conference deadlines and meetings.
Our deepest gratitude is to all of you. We hope you will enjoy the conference.
Fei Xia, University of Washington

Wenjie Li, The Hong Kong Polytechnic University
Roberto Navigli, Sapienza University of Rome
ACL-IJCNLP 2021 Program Committee Co-Chairs
viii
Organizing Committee
General Chair:
Chengqing Zong, Institute of Automation, Chinese Academy of Sciences
Program Committee Co-Chairs:

Local Organization Committee Co-Chairs:

Priscilla Rasmussen, Association for Computational Linguistics (ACL)
Thepchai Supnithi, National Electronics and Computer Technology Center (NECTEC)
Thanaruk Theeramunkong, The Artificial Intelligence Association of Thailand and Sirindhorn
International Institute of Technology (SIIT), Thammasat University
Tutorial Chairs:
David Chiang, University of Notre Dame
Min Zhang, Soochow University
Workshop Chairs:
Kentaro Inui, Tohoku University
Michael Strube, GmbH Heidelberg
Student Research Workshop Chairs:

Jad Kabbara, McGill University and the Montreal Institute for Learning Algorithms (MILA)
Haitao Lin, Institute of Automation, Chinese Academy of Sciences
Amandalynne Paullada, University of Washington
Jannis Vamvas, Universität Zürich
Faculty Advisors to the Student Research Workshop:

Jing Jiang, Singapore Management University
Rico Sennrich, University of Edinburgh
Derek F. Wong, University of Macau
Nianwen Xue, Brandeis University
Demo Chairs:
Heng Ji, University of Illinois at Urbana-Champaign
Jong C. Park, Korea Advanced Institute of Science and Technology
Rui Xia, Nanjing University of Science and Technology
Publications Chairs:
Yuki Arase, Osaka University
Jing-Shin Chang, National Chi-Nan University
Yvette Graham, Trinity College Dublin
ix
Publicity Chair:
Kai-Fam Wong, The Chinese University of Hong Kong
Sponsorship Co-Chairs:
Rachada Kongkrachantra, Thammasat University
Jing Li, The Hong Kong Polytechnic University
Kobkrit Viriyayudhakorn, iApp Technology Co., Ltd.
Zhongyu Wei, Fudan University
Diversity & Inclusion (D&I) Chairs:
Sub-Committee of Childcare ++ Accessibility:

Leader: Minlie Huang, Tsinghua University
Member: Vivian Chen, National Taiwan University
Member: Yang Feng, Institute of Computing Technology, Chinese Academy of Sciences
Sub-Committee of Academic Inclusion:

Leader: Avirup Sil, IBM
Member: Kayathi Chandu, Carnegie Mellon University
Member: Lifu Huang, Virginia Tech
Member: Sara Rosenthal, IBM Research AI
Sub-Committee of Financial Access:

Leader: Alexis Palmer, University of Colorado Boulder
Leader: Martha Yifiru Tachbelie, Addis Ababa University
Member: Ignatius Eziani, Lancaster University
Member: Manuel Mager, University of Stuttgart
Member: Nafise Moosavi, TU Darmstadt
Sub-Committee of Socio-cultural Inclusion:

Leader: Alvin Grissom, Haverford College
Member: Pedro Rodriguez, University of Maryland, College Park
Member: Xanda Schofield, Harvey Mudd College
Ethics Advisory Committee (EAC):

Min-Yen Kan, National University of Singapore
Malvina Nissim, University of Groningen
Xanda Schofield, Harvey Mudd College
Sustainability Chairs:
Angeliki Lazaridou, DeepMind
Qi Zhang, Fudan University
Audio-Video Chairs:
Suchathit Boonnag, AIAT
Rachasak Somyanonthanakul, Rangsit University
x
Remote Presentation Chairs:
Zhongjun He, Baidu Co.
Nattapol Kritsuthikul, NECTEC, NSTDA
Yadollah Yaghoobzadeh, University of Tehran
Virtual Infrastructure Committee (VIC):
Advisor:
Hao Fang, Microsoft Semantic Machines
Co-Chairs:
Wei Lu, Singapore University of Technology and Design
Krich Nasingkun, National Electronics and Computer Technology Center
Alessandro Raganato, University of Helsinki
Shaonan Wang, Institute of Automation, Chinese Academy of Sciences
Jianfei Yu, Nanjing University of Science and Technology
Liang-Chih Yu, Yuan Ze University
Reviewer Mentoring Committee Chairs:

Antoine Bosselut, Stanford University
Christophe Gravier, Universite de Saint-Etienne/Lyon
Jing Huang, JD AI Research
Social Media Committee Co-Chairs:

Luciana Benotti, National University of Cordoba
Lidong Bing, DAMO Academy, Alibaba Group
Zhumin Chen, Shandong University
Mark Seligman, Speechmorphing, Inc.
Rachele Sprugnoli, Università Cattolica del Sacro Cuore
Handbook Chair:
Krit Kosawat, NECTEC, NSTDA
Website & Conference App Chairs:

Chutima Beokhaimook, Rangsit University
Witchaworn Mankhong, NECTEC, NSTDA
Student Volunteer Coordinator:

Dongyan Zhao, Peking University
Technical Support:
C. M. Downey, University of Washington
Caterina Lacerra, Sapienza University of Rome
Rocco Tripodi, University of Bologna
Naoki Okada, Osaka University
Masato Yoshinaka, Osaka University
xi
Program Committee
Program Chairs:

Senior Area Chairs and Area Chairs:
(Senior area chairs are in bold.)
Computational Social Science and Cultural Analytics:
David Jurgens, Paolo Rosso, Noah Smith, Timothy Baldwin, Cristina Bosco,
Antoine Doucet, Manuel Montes, Alice Oh, Simone Paolo Ponzetto, Sara Rosen-
thal, Thamar Solorio, Chenhao Tan, Oren Tsur, Leo Wanner, Diyi Yang
Dialogue and Interactive Systems:
Minlie Huang, Gina-Anne Levow, Jason Williams, Luciana Benotti, Y-Lan

Boureau, Yunbo Cao, Asli Celikyilmaz, Yun-Nung Chen, Heriberto Cuayahuitl,
Emily Dinan, Maryam Fazel-Zarandi, Kallirroi Georgila, Alborz Geramifard,
Matthew Henderson, Ryuichiro Higashinaka, Kentaro Inui, Casey Kennington,
Kazunori Komatani, Sungjin Lee, Rebecca J. Passonneau, Giuseppe Riccardi,
Ethan Selfridge, Gabriel Skantze, Ruihua Song, David Traum, Stefan Ultes,
Tsung-Hsien Wen, Wei Wu, Rui Yan, Kai Yu, Zhou Yu, Wei-Nan Zhang
Discourse and Pragmatics:
Vera Demberg, Michael Strube, Jacob Andreas, Chloé Braud, Sadao Kurohashi,
Sharid Loáiciga, Nafise Sadat Moosavi
Ethics in NLP:
Ryan Georgi, Dirk Hovy, Kai-Wei Chang, Karën Fort, Alvin Grissom II, Margot
Mieskes, Vinodkumar Prabhakaran
Information Extraction:
Yunyao Li, Hoifung Poon, Dan Roth, Alan Akbik, Christos Christodoulopoulos,
Leon Derczynski, Jacob Eisenstein, Luheng He, Parisa Kordjamshidi, Mausam,
Stephen Mayhew, Makoto Miwa, Lluís Màrquez, Thien Huu Nguyen, Qiang
Ning, Haoruo Peng, Roi Reichart, Xiang Ren, Alan Ritter, Alla Rozovskaya,
Kevin Small, Yangqiu Song, Vivek Srikumar, Shashank Srivastava, Elior Sulem,
Chen-Tse Tsai, William Yang Wang, Wenpeng Yin
Information Retrieval and Text Mining:
Hang Li, Gabriella Pasi, Sophia Ananiadou, Mohand Boughanem, Nicola Ferro,
Nazli Goharian, Seung-won Hwang, Jing Jiang, Jian-Yun Nie, Raffaele Perego,
Suzan Verberne, Quan Wang, Gerard de Melo
Interpretability and Analysis of Models for NLP:
xii
Anna Rogers, Sameer Singh, Xu Sun, Afra Alishahi, Jasmijn Bastings, Yonatan
Belinkov, Danushka Bollegala, Grzegorz Chrupala, Bhuwan Dhingra, Sebastian
Gehrmann, Wei Lu, Marco Tulio Ribeiro, Anders Søgaard, Ian Tenney, Byron
Wallace
Language Generation:
Michel Galley, Michael White, Jiajun Zhang, Anya Belz, Giuseppe Carenini,
Nina Dethlefs, Mark Dras, Michael Elhadad, Angela Fan, Mary Ellen Foster,
Liang Huang, Shujian Huang, Yangfeng Ji, Ioannis Konstas, Sujian Li, Lili Mou,
Myle Ott, Ankur P. Parikh, Owen Rambow, Stephen Roller, Advaith Siddharthan,
Jinsong Su, Duyu Tang, Zhiguo Wang, Yizhe Zhang
Language Grounding to Vision, Robotics and Beyond:
Mohit Bansal, Hannaneh Hajishirzi, Yoav Artzi, Joyce Chai, Nancy Chen,
Desmond Elliott, Chuang Gan, Zhe Gan, Ani Kembhavi, Radu Soricut, Jesse
Thomason, Mark Yatskar
Linguistic Theories, Cognitive Modeling and Psycholinguistics:
Roger Levy, James Pustejovsky, Alexander Clark, Afsaneh Fazly, Naomi Feld-
man, Tal Linzen, Kyle Mahowald
Machine Learning for NLP:
Ming-Wei Chang, Kevin Duh, Tie-Yan Liu, Sebastian Ruder, Waleed Ammar,
Yuki Arase, Niranjan Balasubramanian, Loïc Barrault, Daniel Beck, Yonatan
Bisk, Wray Buntine, Allyson Ettinger, Matthias Gallé, Marjan Ghazvininejad,
Mohit Iyyer, Shafiq Joty, Sarvnaz Karimi, Hideto Kazawa, Junyi Jessy Li, Zachary
Lipton, Yang Liu, Zhiyuan Liu, Daichi Mochihashi, Naoaki Okazaki, Jong Park,
Nanyun Peng, Tao Qin, Sujith Ravi, Mrinmaya Sachan, Natalie Schluter, Pontus
Stenetorp, Karl Stratos, Jun Suzuki, Lu Wang, Dani Yogatama, Koichiro Yoshino
Machine Translation and Multilinguality:
Philipp Koehn, Qun Liu, François Yvon, Wilker Aziz, Marine Carpuat, Box-
ing Chen, Colin Cherry, Marta R. Costa-jussà, Marcello Federico, Yang Feng,
Andrew Finch, Mark Fishel, Jiatao Gu, Gholamreza Haffari, Zhongjun He, Mu
Li, Liangyou Li, Junhui Li, Kenton Murray, Jan Niehues, Maja Popović, Artem
Sokolov, Sara Stymne, Longyue Wang, Tong Xiao
Multidisciplinary and Area Chair COI:
Iryna Gurevych, Andreas Vlachos, Dan Goldwasser, Omer Levy, Diarmuid Ó

Séaghdha
NLP Applications:
Jimmy Lin, Vincent Ng, Min Zhang, Beata Beigman Klebanov, Luigi Di Caro,
Sanda Harabagiu, Mamoru Komachi, Juntao Li, Jing Li, Yang Liu, David Mimno,
Preslav Nakov, Tristan Naumann, Emily Prud’hommeaux, David Smith, Lijun
Wu, Jingjing Xu, Min Yang, Jing Yuan, Marcos Zampieri, Wei Zhang
Phonology, Morphology and Word Segmentation:
Yan Song, Nianwen Xue, Ryan Cotterell, Xipeng Qiu, Attapol Rutherford
xiii
Question Answering:
Jennifer Chu-Carroll, Alessandro Moschitti, Furu Wei, Roberto Basili, Jor-

dan Boyd-Graber, Weiwei Cheng, Eunsol Choi, Danilo Croce, Li Dong, Yansong
Feng, Simone Filice, Radu Florian, Zornitsa Kozareva, Jing Liu, Ramesh Nal-
lapati, Cicero Nogueira dos Santos, Siddharth Patwardhan, Matthias Petri, Oleg
Rokhlenko, Minjoon Seo, Avi Sil, Luca Soldaini, Anh Tuan Luu, Olga Uryupina,
Thuy Vu, Fabio Massimo Zanzotto
Resources and Evaluation:
Samuel Bowman, Nancy Ide, Johan Bos, Tommaso Caselli, Jesse Dodge, Kyle
Gorman, Daniel Khashabi, Jin-Dong Kim, Jonathan K. Kummerfeld, John P.
McCrae, Joakim Nivre, Massimo Poesio, Saku Sugawara, Adina Williams
Semantics: Lexical:
Mona Diab, Mohammad Taher Pilehvar, Marianna Apidianaki, Eduardo Blanco,

Jose Camacho-Collados, Manaal Faruqui, Tommaso Pasini, German Rigau, Vered
Shwartz, Veselin Stoyanov, Aline Villavicencio, Ivan Vulić, Yadollah Yaghoobzadeh,
Yi Zhang
Semantics: Sentence-level Semantics, Textual Inference and Other areas:
Doug Downey, Raymond Mooney, Xiaodan Zhu, Iz Beltagy, Jonathan Berant,

Chandra Bhagavatula, Chris Callison-Burch, Danqi Chen, Greg Durrett, Katrin
Erk, Francis Ferraro, Daniel Gildea, Edward Grefenstette, Robin Jia, Douwe Kiela,
Mike Lewis, Quan Liu, Christopher Potts, Rachel Rudinger, Mo Yu
Sentiment Analysis, Stylistic Analysis, and Argument Mining:
Bing Liu, Rada Mihalcea, Saif Mohammad, Alexandra Balahur, Lidong Bing,
Julian Brooke, Anna Feldman, Yulan He, Lun-Wei Ku, John Lawrence, Maria
Liakata, Smaranda Muresan, Soujanya Poria, Bing Qin, Serena Villata, Xiaojun
Wan
Speech and Multimodality:
Haizhou Li, Florian Metze, Julia Hockenmaier, Preethi Jyothi, Herman Kamper,
Dorothea Kolossa, Hung-yi Lee, Lei Xie
Summarization:
Mirella Lapata, Horacio Saggion, Florian Boudin, Jackie Chi Kit Cheung, Katja
Filippova, Peter Liu, Fei Liu, Shashi Narayan, Manabu Okumura, Laura Perez-
Beltrachini, Maxime Peyrard, Laura Plaza, Xingxing Zhang
Syntax: Tagging, Chunking and Parsing:
Slav Petrov, Emily Pitler, Carlos Gómez-Rodríguez, Daniel Hershcovich, Marco

Kuhlmann, Yuji Matsumoto, Reut Tsarfaty, Yannick Versley, Yue Zhang, Miryam
de Lhoneux
Theme:
Jinho Choi, Joel Tetreault, Tim Althoff, Isabelle Augenstein, Steven Bethard,
Courtney Napoles, Brendan O’Connor, Yulia Tsvetkov, Rob Voigt
xiv
Best Paper Selection Committee:
Timothy Baldwin, Ellen Riloff, Bonnie Webber
Primary Reviewers:
Asma Ben Abacha, Jade Abbott, Ahmed Abdelali, Muhammad Abdul-Mageed, Anne Abeille,
Omri Abend, Ahmed AbuRa’ed, Abdalghani Abujabal, Pablo Accuosto, Manoj Acharya,
Judit Ács, Heike Adel, Somak Aditya, Stergos Afantenos, Haithem Afli, Sachin Agarwal,
Sanchit Agarwal, Shubham Agarwal, Sumeet Agarwal, Rodrigo Agerri, Karan Aggarwal,
Piush Aggarwal, Manex Agirrezabal, Željko Agić, Ameeta Agrawal, Priyanka Agrawal,
Sweta Agrawal, Gustavo Aguilar, Roee Aharoni, Wasi Ahmad, Natalie Ahn, Lars Ahrenberg,
Aman Ahuja, Chaitanya Ahuja, Mohammad Ailannejadi, Akiko Aizawa, Reina Akama,
Mohammad Akbari, Alan Akbik, Ahmet Aker, Farhad Akhbardeh, Md. Shad Akhtar, Syed
Sarfaraz Akhtar, Adewale Akinfaderin, Nader Akoury, Arjun Akula, Hend Al-Khalifa, Rami
Al-Rfou, Nora Al-Twairesh, Fahad AlGhamdi, Firoj Alam, Mehwish Alam, Chris Alberti,
Laura Alonso Alemany, Nikolaos Aletras, Jan Alexandersson, Georgios Alexandridis, Mark
Alfano, Raquel G. Alhama, Tariq Alhindi, Hamed Alhoori, Malihe Alikhani, Ilseyar Al-
imova, Afra Alishahi, Tamer Alkhouli, Emily Allaway, Carl Allen, Khalid Alnajjar, Héctor
Martínez Alonso, Miguel A. Alonso, Emily Alsentzer, Milad Alshomary, Christoph Alt,
Malik Altakrori, Sophia Althammer, Tim Althoff, Tanel Alumäe, Sandra Aluísio, Fernando
Alva-Manchego, David Alvarez-Melis, Rami Aly, Marcelo Amancio, Bharat Ram Ambati,
Maxime Amblard, Enrique Amigo, Aida Amini, Massih R Amini, Prithviraj Ammanabrolu,
Waleed Ammar, Aixiu An, Bo An, Guozhen An, Jisun An, Ashish Anand, Sophia Ananiadou,
Raviteja Anantha, Antonios Anastasopoulos, Mark Anderson, Jacob Andreas, Nicholas
Andrews, Anietie Andy, Gabor Angeli, Stefanos Angelidis, Luis Espinosa Anke, Diego
Antognini, Jean-Yves Antoine, Kaveri Anuranjana, Xiang Ao, Marianna Apidianaki, Emilia
Apostolova, Jun Araki, Rahul Aralikatte, Eiji Aramaki, Yuki Arase, Mozhdeh Ariannezhad,
Naveen Arivazhagan, Jacob Arkin, Stéphane Aroca-Ouellette, Kushal Arora, Simran Arora,
Leila Arras, Ekaterina Artemova, Mikel Artetxe, Philip Arthur, Yoav Artzi, Kristjan Arumae,
Ehsaneddin Asgari, Nabiha Asghar, Elliott Ash, Arian Askari, Zhenisbek Assylbekov, Ramón
Fernandez Astudillo, Duygu Ataman, Pepa Atanasova, Awais Athar, Giuseppe Attardi, Is-
abelle Augenstein, Tal August, Eleftherios Avramidis, Ai Ti Aw, Parul Awasthy, Hosein
Azarbonyad, Erfan Sadeqi Azer, Wilker Aziz,
Nastaran Babanejad, Rohit Babbar, Bogdan Babych, Nguyen Bach, Ebrahim Bagheri, Parnia
Bahar, Ashutosh Baheti, Fan Bai, He Bai, Yu Bai, Yushi Bai, JinYeong Bak, Collin Baker,
Vidhisha Balachandran, Alexandra Balahur, Mithun Balakrishna, Anusha Balakrishnan, Oana
Balalau, Niranjan Balasubramanian, Ivana Balažević, Ioana Baldini, Timothy Baldwin, Ka-
lika Bali, Miguel Ballesteros, Ramy Baly, Juan Banda, Sivaji Bandyopadhyay, Siddhartha
Banerjee, Jeesoo Bang, Seojin Bang, Hritik Bansal, Mohit Bansal, Sameer Bansal, Trapit
Bansal, Forrest Sheng Bao, Junwei Bao, Siqi Bao, Yu Bao, Ankur Bapna, Roy Bar-Haim,
Mohamad Hardyman Barawi, Edoardo Barba, Adrien Barbaresi, Samuel Barham, Ken Barker,
Gianni Barlacchi, Jeremy Barnes, Antonio Valerio Miceli Barone, Loïc Barrault, Valentin
Barriere, Alberto Barrón-Cedeño, Max Bartolo, Marco Basaldella, Pierpaolo Basile, Roberto
Basili, Ali Basirat, Jasmijn Bastings, Jordi Atserias Batalla, Lisa Bauer, Timo Baumann,
William Baumgartner, Susana Bautista, Rachel Bawden, Kathy Baxter, Ian Beaver, Frederic
Bechet, Daniel Beck, Lee Becker, Steven Bedrick, Dorothee Beermann, Lisa Beinborn, Ah-
mad Beirami, Giannis Bekoulis, Núria Bel, Yonatan Belinkov, Eric Bell, Jerome Bellegarda,
Meriem Beloucif, Iz Beltagy, Anya Belz, Eyal Ben-David, Luca Benedetto, Luciana Benotti,
Adrian Benton, Jonathan Berant, Alexandre Berard, Klaus Berberich, Gábor Berend, Leon
xv
Bergen, Maria Berger, Sabine Bergler, Toms Bergmanis, Rafael Berlanga, Delphine Bern-
hard, Dario Bertero, Robert Berwick, Laurent Besacier, Steven Bethard, Michele Bevilacqua,
Rahul Bhagat, Chandra Bhagavatula, Rasika Bhalerao, Rishabh Bhardwaj, Aditya Bhargava,
Archna Bhatia, Parminder Bhatia, Sumit Bhatia, Gantavya Bhatt, Suvrat Bhooshan, Rajarshi
Bhowmik, Bin Bi, Wei Bi, Federico Bianchi, Przemyslaw Biecek, Ann Bies, Laura Biester,
Yi Bin, Lidong Bing, Alexandra Birch, Steven Bird, Arianna Bisazza, Yonatan Bisk, Johannes
Bjerva, Henrik Björklund, Philippe Blache, Eduardo Blanco, Nate Blaylock, Terra Blevins,
Rexhina Blloshmi, Su Lin Blodgett, Jelke Bloem, Michael Bloodgood, Théodore Bluche,
Valts Blukis, Victoria Bobicev, Praveen Kumar Bodigutla, Ben Bogin, Danushka Bollegala,
Valeriia Bolotova-Baranova, Rishi Bommasani, Daniele Bonadiman, Claire Bonial, Francesca
Bonin, Ludovico Boratto, Georgeta Bordea, Claudia Borg, Johan Bos, Antal van den Bosch,
Cristina Bosco, Antoine Bosselut, Robert Bossy, Nadjet Bouayad-Agha, Florian Boudin,
Mohand Boughanem, Gosse Bouma, Zied Bouraoui, Y-Lan Boureau, Samuel R. Bowman,
Jordan Boyd-Graber, Johan Boye, Faeze Brahman, António Branco, Jamie Brandon, Kianté
Brantley, Pavel Braslavski, Chloé Braud, Felipe Bravo-Marquez, Arthur Bražinskas, Jonathan
Brennan, Chris Brew, Thomas Brochhagen, Chris Brockett, Julian Brooke, Samuel Broscheit,
Thomas Brovelli (Meyer), Caroline Brun, Dominique Brunato, Luna De Bruyne, Tomáš
Brychcín, Yi Bu, Paweł Budzianowski, Sven Buechel, Alberto Bugarín-Diz, Michael Bugert,
Trung Bui, Paul Buitelaar, Harry Bunt, Wray Buntine, Greg Burnham, Jill Burstein, Hendrik
Buschmeier, Jan Buys, Joan Byamugisha, Bill Byrne, Benjamin Börschinger,
Marco Antonio Sobrevilla Cabezudo, Elena Cabrio, Avi Caciularu, Samuel Cahyawijaya,
Deng Cai, Han Cai, Hengyi Cai, Jon Z. Cai, Yi Cai, Andrew Caines, Ruken Cakici, Agostina
Calabrese, Iacer Calixto, Chris Callison-Burch, Jesus Calvillo, Jose Camacho-Collados, Erik
Cambria, Oana-Maria Camburu, Giovanni Campagna, Leonardo Campillos-Llanos, Nic-
colò Campolungo, Jon Ander Campos, Ricardo Campos, Burcu Can, Marie Candito, Erion
Çano, Guihong Cao, Jiannong Cao, Qingqing Cao, Qingxing Cao, Yanan Cao, Yixin Cao,
Yu Cao, Yuan Cao, Yunbo Cao, Ziqiang Cao, Annalina Caputo, Cornelia Caragea, Doina
Caragea, Dallas Card, Giuseppe Carenini, Vicente Ivan Sanchez Carmona, Luigi Di Caro,
Marine Carpuat, Lucien Carroll, Paula Carvalho, Francisco Casacuberta, Iñigo Casanueva,
Helena Caseli, Tommaso Caselli, Vittorio Castelli, Giuseppe Castellucci, Richard Eckart de
Castilho, Sheila Castilho, Chundra Cathcart, Andrew Cattle, Paulo Cavalin, Asli Celikyilmaz,
Alessandra Cervone, Suchet Chachra, Haixia Chai, Joyce Chai, Abhisek Chakrabarty, Tuhin
Chakrabarty, Aishik Chakraborty, Tanmoy Chakraborty, Bharathi Raja Chakravarthi, Gaël
de Chalendar, Yllias Chali, Ilias Chalkidis, Nathanael Chambers, Alvin Chan, Hou Pong
Chan, Zhangming Chan, Senthil Chandramohan, Muthu Kumar Chandrasekaran, Tai Chang-
You, Angel Chang, Baobao Chang, Ernie Chang, Haw-Shiuan Chang, Jing-Shin Chang,
Kai-Wei Chang, Ming-Wei Chang, Serina Chang, Yu-Yun Chang, Yung-Chun Chang, Soravit
Changpinyo, Guan-Lin Chao, Rajen Chatterjee, Akshay Chaturvedi, Iti Chaturvedi, Stergios
Chatzikyriakidis, Aditi Chaudhary, Vishrav Chaudhary, Geeticka Chauhan, Kushal Chawla,
Emmanuel Chemla, Bo Chen, Boxing Chen, Chacha Chen, Chung-Chi Chen, Danqi Chen,
Daoyuan Chen, Guanyi Chen, Hanjie Chen, Hong-You Chen, Hongshen Chen, Hsin-Hsi
Chen, Huimin Chen, Jiaao Chen, Jifan Chen, John Chen, Jun Chen, Kehai Chen, Kezhen
Chen, Kuan-Yu Chen, Lei Chen, Lei Chen, Lin Chen, Long Chen, Long Chen, Lu Chen,
Luoxin Chen, MeiHua Chen, Meng Chen, Mingda Chen, Muhao Chen, Nancy Chen, Penghe
Chen, Qi Chen, Qian Chen, Qianglong Chen, Qingcai Chen, Sanxing Chen, Shizhe Chen,
Sihao Chen, Tao Chen, Tongfei Chen, Wenhu Chen, Wenqing Chen, Xilun Chen, Xinchi
Chen, Xiuyi Chen, Xiuying Chen, Yang Chen, Yen-Chun Chen, Yi-Chen Chen, Yihong Chen,
Yu Chen, Yubo Chen, Yue Chen, Yun Chen, Yun-Nung Chen, Zhenfang Chen, Zhi Chen,
Zhiyu Chen, Zhuang Chen, Zhumin Chen, Ziliang Chen, Hao Cheng, Liying Cheng, Lu
Cheng, Pengxiang Cheng, Pengyu Cheng, Pu-Jen Cheng, Weiwei Cheng, Xingyi Cheng,
xvi
Yong Cheng, Yu Cheng, Vijil Chenthamarakshan, Joe Cheri, Colin Cherry, Emmanuele
Chersoni, Jackie Chi Kit Cheung, Jonathan Chevelu, Ethan A. Chi, Zewen Chi, Christian
Chiarcos, Jen-Tzung Chien, Hai Leong Chieu, Patricia Chiril, Luis Chiruzzo, Jaemin Cho,
Sangwoo Cho, Won Ik Cho, Daejin Choi, Eunsol Choi, Jaesik Choi, Jihun Choi, Jinho D.
Choi, Seungtaek Choi, Yejin Choi, Shamil Chollampatt, Jaegul Choo, Leshem Choshen,
Prafulla Kumar Choubey, Monojit Choudhury, Khalid Choukri, Jishnu Ray Chowdhury,
Koel Dutta Chowdhury, Md Faisal Mahbub Chowdhury, Christos Christodoulopoulos, Fenia
Christopoulou, Grzegorz Chrupała, Jennifer Chu-Carroll, Chenhui Chu, Christopher Chu,
Zewei Chu, Shun-Po Chuang, Aleksandr Chuklin, Hyung Won Chung, Jin-Woo Chung,
Tagyoung Chung, Yi-Ling Chung, Kenneth Church, Abu Nowshed Chy, Manuel Ciosici,
Alexander Clark, Christopher Clark, Elizabeth Clark, Kevin Clark, Stephen Clark, Aaron
Clauset, Vincent Claveau, Orphee De Clercq, Éric de la Clergerie, Ann Clifton, Miruna-
Adriana Clinciu, Maximin Coavoux, Oana Cocarascu, Anne Cocos, Arman Cohan, Edo
Cohen-Karlik, Daniel Cohen, Kevin Cohen, Philip Cohen, Trevor Cohn, Marcus Collins,
Costanza Conforti, Simone Conia, John Conroy, Danish Contractor, Paul Cook, Bonaventura
Coppola, Anna Corazza, Francesco Corcoglioniti, Gonçalo Correia, Caio Corro, Luciano
Del Corro, Marta R. Costa-jussà, Ryan Cotterell, Andreas van Cranenburgh, Josep Crego,
Alina Maria Cristea, Dan Cristea, Alejandrina Cristia, Danilo Croce, Fabien Cromieres, Paul
Crook, James Cross, Tim Van de Cruys, Berthold Crysmann, Montse Cuadros, Heriberto
Cuayahuitl, Baiyun Cui, Lei Cui, Leyang Cui, Shaobo Cui, Yiming Cui, Aron Culotta, Iria
da Cunha, Washington Cunha, Anna Currey, Tonya Custis,
Wisdom d’Almeida, Jennifer D’Souza, Raj Dabre, Deborah Dahl, Daniel Dahlmeier, Falcon
Dai, Xiang Dai, Xinyu Dai, Zeyu Dai, Beatrice Daille, Daniel Dakota, Hercules Dalianis,
Siddharth Dalmia, Fahim Dalvi, Marco Damonte, Sandipan Dandapat, Ankit Dangi, Dana
Dannells, Abhishek Das, Dipanjan Das, Shouman Das, Pradeep Dasigi, Hal Daumé III,
Aida Mostafazadeh Davani, Sam Davidson, Brian Davis, Forrest Davis, Joe Davison, Heidar
Davoudi, Johannes Daxenberger, Steve DeNeefe, Jay DeYoung, Alok Debnath, Francien
Dechesne, Thierry Declerck, Mathieu Dehouck, Herve Dejean, Sebastien Delecraz, Felice
Dell’Orletta, Rodolfo Delmonte, Louise Deléger, Vera Demberg, David Demeter, Seniz
Demir, Cagatay Demiralp, Dorottya Demszky, Lingjia Deng, Shumin Deng, Yang Deng, Yue
Deng, Yuntian Deng, Zhi-Hong Deng, Pascal Denis, Michael Denkowski, Leon Derczynski,
Tyler Derr, Shrey Desai, Nina Dethlefs, Tim Dettmers, Daniel Deutsch, Sunipa Dev, Murthy
Devarakonda, Chris Develder, Ann Devitt, Joseph P. Dexter, Sameer Dharur, Paramveer
Dhillon, Bhuwan Dhingra, Mona Diab, Shizhe Diao, Gaël Dias, Aniket Didolkar, Emily
Dinan, Caiwen Ding, Chenchen Ding, Haibo Ding, Kaize Ding, Liang Ding, Ruixue Ding,
Shuoyang Ding, Weicong Ding, Xiao Ding, Zixiang Ding, Liviu P. Dinu, Stefanie Dipper,
Anne Dirkson, Nemanja Djuric, Dmitriy Dligach, Simon Dobnik, Jesse Dodge, Charles
Dognin, Bill Dolan, Elham Dolatabadi, Miguel Domingo, Lucia Donatelli, Li Dong, MeiX-
ing Dong, Ruihai Dong, Xin Dong, Xin Dong, Yue Dong, Longxu Dou, Zi-Yi Dou, Antoine
Doucet, C. Downey, Doug Downey, A. Seza Doğruöz, Eduard Dragut, Mark Dras, Markus
Dreyer, Rotem Dror, Aleksandr Drozd, Chunning Du, Jiaju Du, Jingfei Du, Jinhua Du, Lan
Du, Mengnan Du, Pan Du, Wanyu Du, Yupei Du, Junwen Duan, Nan Duan, Xiangyu Duan,
Kumar Dubey, Pablo Duboue, Philipp Dufter, Liam Dugan, Kevin Duh, Ambedkar Dukkipati,
Jonathan Dunn, Yoann Dupont, Benjamin Van Durme, Esin Durmus, Nadir Durrani, Greg
Durrett, Rory Duthie, Ritam Dutt, Pratik Dutta, Ondřej Dušek, Melody Dye, Chris Dyer,
William Dyer, Marc Dymetman, Nouha Dziri,
Haihong E, Kurt Eberle, Sebastian Ebert, Javid Ebrahimi, Daniel Edmiston, Sergey Edunov,
Aleksandra Edwards, Steffen Eger, Markus Egg, Koji Eguchi, Yo Ehara, Maud Ehrmann,
Vladimir Eidelman, Liat Ein-Dor, Jacob Eisenstein, Asif Ekbal, Asif Ekbal, Wassim El-Hajj,
xvii
Yanai Elazar, Maha Elbayad, Heba Elfardy, Ahmed Elgohary, Michael Elhadad, Desmond
Elliott, Micha Elsner, Ali Emami, Guy Emerson, Messina Enza, Aykut Erdem, Erkut Erdem,
Alexander Erdmann, Akiko Eriguchi, Tomaž Erjavec, Katrin Erk, Liana Ermakova, Patrick
Ernst, Marieke van Erp, Carla Parra Escartín, Ramy Eskander, Cristina España-Bonet, Diego
Esteves, Dominique Estival, Thierry Etchegoyhen, Allyson Ettinger, Barbara Di Eugenio,
Kilian Evang, Richard Evans,
Alexander Fabbri, Guglielmo Faggioli, Farzane Fakhrian, Agnieszka Falenska, Tobias Falke,
Angela Fan, Chuang Fan, James Fan, Kai Fan, Yixing Fan, Hao Fang, Hui Fang, Licheng
Fang, Rui Fang, Wei Fang, Yimai Fang, Farhood Farahnak, M. Amin Farajian, Oladimeji
Farri, Mireia Farrús, Manaal Faruqui, Delia Irazú Hernández Farías, Jean-Philippe Fauconnier,
Adam Faulkner, Benoit Favre, Maryam Fazel-Zarandi, Afsaneh Fazly, Amir Feder, Marcello
Federico, Guy Feigenblat, Anna Feldman, Naomi Feldman, Sergey Feldman, Junlan Feng,
Rui Feng, Shi Feng, Song Feng, Yang Feng, Yansong Feng, Zhangyin Feng, Paulo Fernandes,
Daniel Fernández-González, Raquel Fernández, Elisa Ferracane, Francis Ferraro, Thiago
Castro Ferreira, Olivier Ferret, Nicola Ferro, Elisabetta Fersini, Oluwaseyi Feyisetan, Anjalie
Field, Alejandro Figueroa, Elena Filatova, Simone Filice, Katja Filippova, Andrew Finch,
Catherine Finegan-Dollak, Orhan Firat, Mauajama Firdaus, Mark Fishel, Margaret Fleck,
Lucie Flek, Dan Flickinger, Michael Flor, Radu Florian, Fabian Flöck, Marina Fomicheva,
José A. R. Fonollosa, Erick Fonseca, Marco Aurelio Fonseca, Maxwell Forbes, Tommaso
Fornaciari, Karën Fort, Paula Fortuna, George Foster, Mary Ellen Foster, Anette Frank,
Robert Frank, Stella Frank, Thomas François, Alexander Fraser, Kathleen C. Fraser, Diego
Frassinelli, Dayne Freitag, Markus Freitag, Lea Frermann, Daniel Fried, Annemarie Friedrich,
Jason Fries, Guohong Fu, Liye Fu, Tsu-Jui Fu, Zhenxin Fu, Zihao Fu, Zuohui Fu, Akinori
Fujino, Yoshinari Fujinuma, Atsushi Fujita, Fumiyo Fukumoto, Nancy Fulda, Adam Funk,
Richard Futrell, Michael Färber, Hagen Fürstenau,
Matteo Gabburo, Saadia Gabriel, David Gaddy, Marco Gaido, Núria Gala, Andrea Galassi,
Boris Galitsky, Michel Galley, Matthias Gallé, Pablo Gamallo, Michael Gamon, Chuang
Gan, Leilei Gan, Yujian Gan, Zhe Gan, Kuzman Ganchev, Sudeep Gandhe, Balaji Ganesan,
Devi Ganesan, Suryakanth V Gangashetty, Debasis Ganguly, Cuiyun Gao, Ge Gao, Hanning
Gao, Jun Gao, Qiaozi Gao, Shen Gao, Tianyu Gao, Wei Gao, Xiang Gao, Yang Gao, Yang
Gao, Yifan Gao, Yingbo Gao, Cristina Garbacea, Diego Garcia-Olano, Eva Martínez Garcia,
Marcos Garcia, Matt Gardner, Sarthak Garg, Saurabh Garg, Siddhant Garg, Aparna Garimella,
Ekaterina Garmash, Dan Garrette, Milica Gasic, Albert Gatt, Lorenzo Gatti, Manas Gaur,
Eric Gaussier, Dipesh Gautam, Vasundhara Gautam, Jidong Ge, Tao Ge, Sebastian Gehrmann,
Michaela Geierhos, Alexander Gelbukh, Josef van Genabith, Xinwei Geng, Xiubo Geng,
Ryan Georgi, Kallirroi Georgila, Alborz Geramifard, Kim Gerdes, Ulrich Germann, Felix
Gervits, Mor Geva, Hamidreza Ghader, Raji Ghawi, Sarik Ghazarian, Marjan Ghazvininejad,
Mozhdeh Gheini, Nadia Ghobadipasha, Deepanway Ghosal, Debanjan Ghosh, Sayan Ghosh,
Shaona Ghosh, Sourav Ghosh, Sucheta Ghosh, Daniela Gifu, Daniel Gildea, C Lee Giles,
Salvatore Giorgi, Voula Giouli, Marco Di Giovanni, Adrià de Gispert, Dimitra Gkatzia,
George Gkotsis, Goran Glavaš, Martin Gleize, Kristina Gligoric, Pranav Goel, Rahul Goel,
Vaibhava Goel, Nazli Goharian, Seraphina Goldfarb-Tarrant, Anna Goldie, Dan Goldwasser,
Sharon Goldwater, Sujatha Das Gollapalli, Marcos Goncalves, Lovedeep Gondara, Heng
Gong, Jingjing Gong, Linyuan Gong, Ming Gong, Yeyun Gong, Zhengxian Gong, Ana
Valeria González, Jeff Good, Michael Wayne Goodman, Rob van der Goot, Karthik Gopalakr-
ishnan, Jonathan Gordon, Philip John Gorinski, Kyle Gorman, Koustava Goswami, Sourabh
Gothe, Cyril Goutte, Amit Goyal, Anuj Goyal, Kartik Goyal, Naman Goyal, Pawan Goyal,
Tanya Goyal, Natalia Grabar, Jorge Gracia, Mario Graff, Yvette Graham, Christophe Gravier,
Edward Grefenstette, Andrej Zukov Gregoric, David Griol, Yulia Grishina, Ralph Grishman,
xviii
Alvin Grissom II, Adam Grycner, Stig-Arne Grönroos, Jia-Chen Gu, Jiatao Gu, Jing Gu,
Qing Gu, Shuhao Gu, Yue Gu, Jian Guan, Saiping Guan, Yi Guan, Imane Guellil, Lin Gui,
Vincent Guigue, Bruno Guillaume, Liane Guillou, Camille Guinaudeau, Kristina Gulordava,
Kalpa Gunaratna, Beliz Gunel, Daya Guo, Han Guo, Honglei Guo, Hongyu Guo, Jiang Guo,
Junliang Guo, Qipeng Guo, Quan Guo, Ruocheng Guo, Yinpeng Guo, Yinuo Guo, Zhijiang
Guo, Abhinav Gupta, Ankit Gupta, Arpit Gupta, Arshit Gupta, Raghav Gupta, Sonal Gupta,
Sparsh Gupta, Vivek Gupta, Iryna Gurevych, Suchin Gururangan, Joakim Gustafson, Ximena
Gutierrez-Vasques, Francisco Guzmán, Markus Gärtner, Carlos Gómez-Rodríguez, Jana
Götze, Tunga Güngör,
Jung-Woo Ha, Le An Ha, Thanh-Le Ha, Ivan Habernal, Hatem Haddad, Kais Haddar,
Asmelash Teka Hadgu, Christian Hadiwinoto, Gholamreza Haffari, Michael Hahn, Udo
Hahn, Zhen Hai, Thomas Haider, Jan Hajic, Eva Hajicova, Hannaneh Hajishirzi, Hazem
Hajj, Sherzod Hakimov, Kishaloy Halder, Felix Hamborg, William L. Hamilton, Michael
Hammond, Thierry Hamon, Jialong Han, Kyu Han, Namgi Han, Ting Han, Wenjuan Han,
Xianpei Han, Xiaochuang Han, Xu Han, Abram Handler, Chung-Wei Hang, Viktor Hangya,
Tianyong Hao, Rejwanul Haque, Syed Haque, Sanda Harabagiu, Momchil Hardalov, Randy
Harris, Mareike Hartmann, Matthias Hartung, Thomas Hartvigsen, Sadid A. Hasan, Peter
Hase, Chikara Hashimoto, Saeed-Ul Hassan, Nabil Hathout, Annette Hautli-Janisz, Serhii
Havrylov, Hiroaki Hayashi, Katsuhiko Hayashi, Yoshihiko Hayashi, Devamanyu Hazarika,
Amir Hazem, Ben He, Hangfeng He, Hao He, Hua He, Jiangen He, Junxian He, Luheng
He, Shizhu He, Tianxing He, Xuanli He, Yifan He, Yulan He, Zhengqiu He, Zhongjun He,
Kenneth Heafield, Marti A. Hearst, Michael Heck, Behnam Hedayatnia, Johannes Heinecke,
Benjamin Heinzerling, Jindřich Helcl, James Henderson, Matthew Henderson, Lisa Anne
Hendricks, Simon Hengchen, Leonhard Hennig, Nico Herbig, Christian Herold, Teresa
Herrmann, Daniel Hershcovich, Jonathan Herzig, Jack Hessel, Gerhard Heyer, Remu Hida,
Christopher Hidey, Djoerd Hiemstra, Ryuichiro Higashinaka, Bertrand Higy, Tsutomu Hirao,
Tatsuya Hiraoka, Graeme Hirst, Sorami Hisamoto, Kasia Hitczenko, Lydia-Mai Ho-Dac,
Tin Kam Ho, Cong Duy Vu Hoang, Cuong Hoang, Julia Hockenmaier, Johannes Hoffart,
Chris Hokamp, Eben Holderness, Nora Hollenstein, Kristy Hollingshead, Laura Hollink,
Ari Holtzman, Christopher Homan, Takeshi Homma, Dezhi Hong, Kai Hong, Yu Hong,
Mark Hopkins, Enamul Hoque, Helmut Horacek, Ales Horak, Mohammad Javad Hosseini,
saghar Hosseini, Veronique Hoste, Feng Hou, Lei Hou, Yufang Hou, Yutai Hou, Dirk Hovy,
David M. Howcroft, Christine Howes, Estevam Hruschka, Chao-Chun Hsu, I-Hung Hsu,
Wei-Ning Hsu, Phu Mon Htut, Baotian Hu, Bojie Hu, Changjian Hu, Changwei Hu, Chi
Hu, Guangneng Hu, Hai Hu, Huang Hu, Jennifer Hu, Jinyi Hu, Mengting Hu, Minghao Hu,
Pengwei Hu, Po Hu, Renfen Hu, Wenpeng Hu, Zhe Hu, Zhiting Hu, Ziniu Hu, Xinyu Hua,
Yiqing Hua, Chenyang Huang, Chieh-Yang Huang, Chung-Chi Huang, Fei Huang, Guoping
Huang, Haoran Huang, Hen-Hsen Huang, Heyan Huang, Jimmy Xiangji Huang, Jing Huang,
Jizhou Huang, Kuan-Hao Huang, Liang Huang, Lifu Huang, Luyao Huang, Minlie Huang,
Po-Yao Huang, Qingbao Huang, Ruihong Huang, Shujian Huang, Siyu Huang, Xiaolei
Huang, Xinting Huang, Xuanjing Huang, Yi-Ting Huang, Yongfeng Huang, Yufang Huang,
Zhongqiang Huang, Ziming Huang, Luwen (Vivian) Huangfu, Patrick Huber, Matthias Huck,
Kai Hui, Zhen Hui, Ben Hutchinson, Jena D. Hwang, Seung-won Hwang, Sung Ju Hwang,
Mika Hämäläinen, Ali Hürriyetoğlu,
Ignacio Iacobacci, Nancy Ide, Adrian Iftene, Oana Ignat, Ryu Iida, Gabriel Ilharco, Filip
Ilievski, Dmitry Ilvovsky, Kenji Imamura, Muhammad Imran, Oana Inel, Diana Inkpen, Koji
Inoue, Naoya Inoue, Kentaro Inui, Radu Tudor Ionescu, Maxim Ionov, Daphne Ippolito,
Tatsuya Ishigaki, Aminul Islam, Tunazzina Islam, Hayate Iso, Dan Iter, Takumi Ito, Lubomir
Ivanov, Julia Ive, Tomoya Iwakura, Kenichi Iwatsuki, Srinivasan Iyer, Mohit Iyyer,
xix
Cassandra L. Jacobs, Gilles Jacobs, Jeff Jacobs, Alon Jacovi, Aaron Jaech, Abhyuday
Jagannatha, Labiba Jahan, Kokil Jaidka, Prachi Jain, Sarthak Jain, Mimansa Jaiswal, Shoaib
Jameel, Abhik Jana, Hyeju Jang, Maciej Janicki, David Janiszek, Sujay Kumar Jauhar, Tommi
Jauhiainen, Arun kumar Jayapal, Sébastien Jean, Hwisang Jeon, Sungho Jeon, Minwoo Jeong,
Yacine Jernite, Kevin Jesse, Rahul Jha, Donghong Ji, Feng Ji, Yangfeng Ji, Zongcheng Ji,
Chen Jia, Robin Jia, Ruipeng Jia, Shengbin Jia, Yuxiang Jia, Zixia Jia, Sittichai Jiampojamarn,
Ping Jian, Daxin Jiang, Jing Jiang, Jyun-Yu Jiang, Meng Jiang, Nanjiang Jiang, Zhengbao
Jiang, Zhuoren Jiang, Zhuoxuan Jiang, Pengfei Jiao, Wenxiang Jiao, Zhanming Jie, Di Jin,
Lifeng Jin, Lisa Jin, Peng Jin, Qin Jin, Xiaolong Jin, Zhijing Jin, Ishan Jindal, Baoyu Jing,
Liping Jing, Anna Jobin, Charles Jochim, Anders Johannsen, Richard Johansson, Melvin
Johnson, Nebojsa Jojic, Kristiina Jokinen, Erik Jones, Gareth Jones, Siddhartha Reddy Jon-
nalagadda, Arne Jonsson, Aditya Joshi, Mandar Joshi, Dhanya Jothimani, Shafiq Joty, Meizhi
Ju, Xincheng Ju, Yingnan Ju, Jaap Jumelet, Heewoo Jun, Kyomin Jung, Taehee Jung, Zhu
Junguo, David Jurgens, Prathyusha Jwalapuram, Preethi Jyothi, Lena Jäger,
Besim Kabashi, Alexandre Kabbach, Jad Kabbara, Sushant Kafle, Sylvain Kahane, Ivana
Kajic, Tomoyuki Kajiwara, Mihir Kale, Oren Kalinsky, Aikaterini-Lida Kalouli, Ehsan Ka-
malloo, Herman Kamper, Jaap Kamps, Min-Yen Kan, Hiroshi Kanayama, Masahiro Kaneko,
Jenna Kanerva, Jaewoo Kang, Xiaomian Kang, Katharina Kann, Ryuji Kano, Yoshinobu
Kano, Evangelos Kanoulas, Pavan Kapanipathi, Micaela Kaplan, Pinar Karagoz, Alina
Karakanta, Svebor Karaman, Giannis Karamanolakis, Siddharth Karamcheti, Mladen Karan,
Sarvnaz Karimi, Younes Karimi, Börje Karlsson, Saurav Karmakar, Shubhra Kanti Kar-
maker, Sanjeev Kumar Karn, Jungo Kasai, Omid Kashefi, Zdeněk Kasner, Nora Kassner,
Denys Katerenchuk, Anoop Katti, David Kauchak, Divyansh Kaushik, Pride Kavumba,
Daisuke Kawahara, Efsun Sarioglu Kayi, Hideto Kazawa, Ashkan Kazemi, Pei Ke, Katherine
Keith, Simon Keizer, Aniruddha Kembhavi, Brendan Kennedy, Casey Kennington, Tom
Kenter, Daniel Kershaw, Santosh Kesiraju, Vaibhav Kesri, Madian Khabsa, Shahram Khadivi,
Salam Khalifa, Sammy Khalife, Maxim Khalilov, Dinesh Khandelwal, Aparna Khare, Daniel
Khashabi, Khalid Al Khatib, Alizishaan Khatri, Chandra Khatri, Tushar Khot, Ashiqur
KhudaBukhsh, Douwe Kiela, Halil Kilicoglu, Byeongchang Kim, Donghwan Kim, Gunhee
Kim, Hansaem Kim, Hyounghun Kim, Hyunwoo Kim, Jihyuk Kim, Jin-Dong Kim, Joo-
Kyung Kim, Jung-Jae Kim, Juyong Kim, Najoung Kim, Seokhwan Kim, Sun Kim, Sundong
Kim, Taeuk Kim, Daniel King, Tracy Holloway King, Christo Kirov, Nikita Kitaev, Beata
Beigman Klebanov, Ayal Klein, Bennett Kleinberg, Jan-Christoph Klie, Roman Klinger,
Julien Kloetzer, Kevin Knight, Alistair Knott, Rebecca Knowles, Miyoung Ko, Hayato
Kobayashi, Sosuke Kobayashi, Thomas Kober, Elena Kochkina, Ekaterina Kochmar, Vid
Kocijan, Jordan Kodner, Philipp Koehn, Rob Koeling, Svetla Koeva, Mare Koit, Noriyuki
Kojima, Dimitrios Kokkinakis, Dorothea Kolossa, Mamoru Komachi, Kazunori Komatani,
Rik Koncel-Kedziorski, Grzegorz Kondrak, Fang Kong, Lingkai Kong, Miloslav Konopik,
Ioannis Konstas, Parisa Kordjamshidi, Valia Kordoni, Yuta Koreeda, Mandy Korpusik, Kat-
sunori Kotani, Bhushan Kotnis, Fajri Koto, Neema Kotonya, Alexander Kotov, George Kour,
Olga Kovaleva, Venelin Kovatchev, Zornitsa Kozareva, Jared Kramer, Bernhard Kratzwald,
Sebastian Krause, Elisa Kreiss, Simon Krek, Ralf Krestel, Julia Kreutzer, Amrith Krishna,
Kalpesh Krishna, Jayant Krishnamurthy, Rajasekar Krishnamurthy, Nikhil Krishnaswamy,
Reno Kriz, Canasai Kruengkrai, Udo Kruschwitz, Anna Kruspe, Germán Kruszewski, Woj-
ciech Kryscinski, Alexander Ku, Lun-Wei Ku, Da Kuang, Marco Kuhlmann, Roland Kuhn,
Seth Kulick, Ilia Kulikov, Malhar Kulkarni, Mayank Kulkarni, Artur Kulmizev, Saurabh
Kulshreshtha, Abhay Kumar, Abhishek Kumar, Adarsh Kumar, Ashutosh Kumar, Sachin
Kumar, Sawan Kumar, Shankar Kumar, Sumeet Kumar, Varun Kumar, Vishwajeet Kumar,
Jonathan K. Kummerfeld, Anoop Kunchukuttan, Adhiguna Kuncoro, Souvik Kundu, Florian
xx
Kunneman, Tsung-Ting Kuo, Murathan Kurfalı, Tatsuki Kuribayashi, Mikko Kurimo, Shuhei
Kurita, Sadao Kurohashi, Ugur Kursuncu, Aditya Kusupati, Kordula De Kuthy, Mucahid
Kutlu, Andrey Kutuzov, Haewoon Kwak, Tom Kwiatkowski, Hongseok Kwon, Arne Köhn,
Caterina Lacerra, Cheng-I Lai, Yuxuan Lai, Chiraag Lala, Divesh Lala, John P. Lalor,
Tsz Kin Lam, Wai Lam, Hemank Lamba, Vasileios Lampos, Gerasimos Lampouras, Wuwei
Lan, Yunshi Lan, Frédéric Landragin, Phillippe Langlais, Ni Lao, Mirella Lapata, Gabriella
Lapesa, Ekaterina Lapshinova-Koltunski, François Lareau, Brian Larson, Stefan Larson,
Kornel Laskowski, Mark Last, Luis Lastras, Jey Han Lau, Michael A. Laurenzano, Anne
Lauscher, Hady Lauw, Alberto Lavelli, Carolin Lawrence, John Lawrence, Dawn Lawrie,
Angeliki Lazaridou, Hung Le, Phong Le, Kevin Leach, Chong Min Lee, Dongkyu Lee,
Dongyub Lee, Fei-Tzin Lee, Grandee Lee, Hung-yi Lee, Hwaran Lee, I-Ta Lee, Jay Yoon
Lee, Jeong Min Lee, Ji-Ung Lee, Jihwan Lee, Jinhyuk Lee, John Lee, Jongwuk Lee, Kyung-
jae Lee, Lung-Hao Lee, Mina Lee, Minwoo Lee, Moontae Lee, Nayeon Lee, Roy Ka-Wei
Lee, Sungjin Lee, Yoonhyung Lee, Young-Suk Lee, Els Lefever, Fabrice Lefèvre, Jie Lei,
Wenqiang Lei, Jochen L. Leidner, Alessandro Lenci, Yichong Leng, Ben Lengerich, Chee
Wee Leong, Yves Lepage, Haley Lepp, Piyawat Lertvittayakumjorn, Gregor Leusch, Jake
Lever, Lori Levin, Tomer Levinboim, Rivka Levitan, Sarah Ita Levitan, Gina-Anne Levow,
Omer Levy, Ran Levy, Roger Levy, Mike Lewis, Patrick Lewis, Miryam de Lhoneux, Baoli
Li, Bei Li, Bryan Li, Chang Li, Chen Li, Cheng-Te Li, Chenliang Li, Dianqi Li, Dongfang
Li, Fangtao Li, Fei Li, Feng-Lin Li, Haizhou Li, Hang Li, Hao Li, Haoran Li, Haoran Li,
Hongzheng Li, Huayang Li, Irene Li, Jinchao Li, Jing Li, Jiyi Li, Juncheng Li, Junhui Li,
Juntao Li, Junyi Jessy Li, Kun Li, Lei Li, Lei Li, Liangyou Li, Manling Li, Maoxi Li, Mu Li,
Pan Li, Peifeng Li, Peng Li, Piji Li, Qi Li, Quanzhi Li, Raymond Li, Ruijiang Li, Ruizhe Li,
Runnan Li, Shaohua Li, Sheng Li, Shuangyin Li, Si Li, Sujian Li, Tao Li, Tianrui Li, Toby
Jia-Jun Li, Wei Li, Wenjie Li, Xiang Li, Xiang Lisa Li, Xiang Lorraine Li, Xiao Li, Xiaoya Li,
Xin Li, Xintong Li, Xiujun Li, Xue Li, Yang Li, Yang Li, Yanzeng Li, Yaoyiran Li, Yingjie
Li, Yingya Li, Yinqiao Li, Yitong Li, Yuliang Li, Yunyao Li, Zhenghua Li, Zhongyang Li,
Zichao Li, Zongxi Li, Maria Liakata, Bin Liang, Chao-Chun Liang, Chen Liang, Davis Liang,
Paul Pu Liang, Xiaobo Liang, Xiaodan Liang, Yunlong Liang, Zhicheng Liang, Lizi Liao,
Jindřich Libovický, Mohamed Lichouri, Chaya Liebeskind, Luca Di Liello, Constantine
Lignos, Anne-Laure Ligozat, Gilbert Lim, Kwan Hui Lim, Nut Limsopatham, Angela Lin,
Bill Yuchen Lin, Chenghua Lin, Chin-Yew Lin, Chu-Cheng Lin, Chuan-Jie Lin, Hongfei Lin,
Hongyu Lin, Jimmy Lin, Kevin Lin, Kevin Lin, Lucy Lin, Peiqin Lin, Xiang Lin, Yankai Lin,
Ying Lin, Zehao Lin, Zhouhan Lin, Zi Lin, Tal Linzen, Marco Lippi, Thomas Lippincott,
Zachary Lipton, Pierre Lison, Robert Litschko, Marina Litvak, Bin Liu, Bing Liu, Bing
Liu, ChangJian Liu, Chi-Liang Liu, Dayiheng Liu, Dexi Liu, Fangyu Liu, Fei Liu, Fei Liu,
Feifan Liu, Haochen Liu, Haokun Liu, Haoyan Liu, Jiachang Liu, Jiahua Liu, Jiangming Liu,
Jing Liu, Jingzhou Liu, Kang Liu, Lemao Liu, Ling Liu, Linqing Liu, Maofu Liu, Nelson
F. Liu, Peng Liu, Pengfei Liu, Pengfei Liu, Peter Liu, Qian Liu, Qian Liu, Qianchu Liu,
Quan Liu, Qun Liu, Tianyi Liu, Tianyu Liu, Tie-Yan Liu, Ting Liu, Weijie Liu, Weiyang Liu,
Xianggen Liu, Xiao Liu, Xiaodong Liu, Xuebo Liu, Xueqing Liu, Yan Liu, Yang Liu, Yang
Liu, Yang Liu, Ye Liu, Ye Liu, Yijia Liu, Yong Liu, Zemin Liu, Zhenghao Liu, Zhengyuan
Liu, Zhengzhong Liu, Zhiyuan Liu, Zhiyuan Liu, Zhuang Liu, Zihan Liu, Zitao Liu, Zoey
Liu, Nikola Ljubešić, Kyle Lo, Damien Lolive, Guodong Long, Lucelene Lopes, Marcos
Lopes, Jaime Lorenzo-Trueba, Annie Louis, Daniel Loureiro, Ismini Lourentzou, Pablo
Loyola, Sharid Loáiciga, Jiasen Lu, Jing Lu, Junyu Lu, Qin Lu, Wei Lu, Yanbin Lu, Yao Lu,
Yaojie Lu, Yu Lu, Yi Luan, Nurul Lubis, Alexandra Luccioni, Li Lucy, Cheng Luo, Jiebo
Luo, Ling Luo, Ping Luo, Renqian Luo, Robin Luo, Ruotian Luo, Wencan Luo, Yuan Luo,
Zhunchen Luo, Anh Tuan Luu, Kelvin Luu, Shangwen Lv, Chunchuan Lyu, Samuel Läubli,
xxi
Danni Ma, Jianqiang Ma, Lianbo Ma, Martin Ma, Mingbo Ma, Nianzu Ma, Qianli Ma,
Qianwen Ma, Shuming Ma, Tengfei Ma, Wei-Yun Ma, Xiaofei Ma, Xinyin Ma, Xuezhe Ma,
Yun Ma, Ismail El Maarouf, Sean MacAvaney, Wolfgang Macherey, Aman Madaan, Avinash
Madasu, Mounica Maddela, Nitin Madnani, Andrea Madotto, Walid Magdy, Manuel Mager,
Pierre Magistry, Måns Magnusson, Diwakar Mahajan, Suchismit Mahapatra, Adyasha Maha-
rana, Debanjan Mahata, Ayush Maheshwari, Kyle Mahowald, Jean Maillard, Bodhisattwa
Prasad Majumder, Navonil Majumder, Peter Makarov, Márton Makrai, Prodromos Malaka-
siotis, Chaitanya Malaviya, Andreas Maletti, Ankur Mali, Igor Malioutov, Itzik Malkiel,
Eric Malmi, Christopher Malon, Rob Malouf, Valentin Malykh, Radhika Mamidi, Emma
Manning, Irene Manotas, Elman Mansimov, Saab Mansour, Ramesh Manuvinakurike, Emaad
Manzoor, Jiaxin Mao, Runze Mao, Wenji Mao, Yuning Mao, Yuren Mao, Zhendong Mao,
Vladislav Maraev, Ana Marasović, Piotr Mardziel, Katerina Margatina, Alda Mari, Benjamin
Marie, Alex Marin, Vukosi Marivate, David Martinez, Giovanni Da San Martino, Bruno
Martins, Pedro Henrique Martins, Eugenio Martínez-Cámara, Marco Maru, Sameen Maruf,
Fiammetta Marulli, Claudia Marzi, Aleksandre Maskharashvili, Maraim Masoud, Matthew
Matero, Lambert Mathias, Sandeep Mathias, Nitika Mathur, Prashant Mathur, David Martins
de Matos, Sérgio Matos, Yuji Matsumoto, Takuya Matsuzaki, Yevgen Matusevych, Evgeny
Matusov, Rowan Hall Maudslay, Mausam, Jonathan May, Stephen Mayhew, Joshua Maynez,
Karen Mazidi, Sahisnu Mazumder, Alessandro Mazzei, Diana McCarthy, David McClosky,
John P. McCrae, Kate McCurdy, Matthew McDermott, David McDonald, Clifton McFate,
Jered McInerney, Bridget McInnes, Kathleen McKeown, Michael McTear, Sara Meftah,
Yashar Mehdad, Alexander Mehler, Shikib Mehri, Nikhil Mehta, Sachin Mehta, Sneha Mehta,
Clara Meister, Dheeraj Mekala, Gerard de Melo, Julia Mendelsohn, Arul Menezes, Telmo
Menezes, Fandong Meng, Rui Meng, Tao Meng, Yu Meng, Zhao Meng, Xue Mengge,
Rakesh Radhakrishnan Menon, Amil Merchant, Danny Merkx, Paola Merlo, William Merrill,
Mohsen Mesgar, Angeliki Metallinou, Florian Metze, Donald Metzler, Marie-Jean Meurs,
Lars Meyer, Adam Meyers, Haitao Mi, Yishu Miao, Yisong Miao, Julian Michael, Lesly
Miculicich, Sabrina Mielke, Margot Mieskes, Rada Mihalcea, Todor Mihaylov, Tsvetomila
Mihaylova, Nandana Mihindukulasooriya, Claudiu Mihăilă, Martina Miliani, Evangelos
Milios, Simon Mille, Corey Miller, Tristan Miller, Alice Millour, Gregory Mills, Emiel van
Miltenburg, Eleni Miltsakaki, Farjana Sultana Mim, David Mimno, Bonan Min, Sewon Min,
Koji Mineshima, SeyedAbolghasem Mirroshandel, Paramita Mirza, Abhijit Mishra, Pushkar
Mishra, Rohan Mishra, Swaroop Mishra, Abhinav Misra, Jeff Mitchell, Verginica Barbu
Mititelu, Jelena Mitrović, Sudip Mittal, Vibhu Mittal, Makoto Miwa, Yusuke Miyao, Daichi
Mochihashi, Ashutosh Modi, Sarah Moeller, Hans Moen, Aditya Mogadala, Nikita Moghe,
Abdelrahman Mohamed, Saif Mohammad, Mahmoud Mohammadi, Alireza Mohammad-
shahi, Mrinal Mohit, Tasnim Mohiuddin, Michael Mohler, Diego Molla, Francis Mollica,
Monica Monachini, Nicholas Monath, Joel Ruben Antony Moniz, Manuel Montes, Emilio
Monti, Johanna Monti, Il-Chul Moon, Seungwhan Moon, Raymond Mooney, Andrew Moore,
Nafise Sadat Moosavi, Richard Moot, Steven Moran, Erwan Moreau, Antonio Moreno-Ortiz,
Jose G. Moreno, Junichiro Mori, Renato De Mori, Véronique Moriceau, Emmanuel Morin,
Makoto Morishita, Hajime Morita, John Morris, David R. Mortensen, Ahmadreza Mosal-
lanezhad, Marius Mosbach, Alessandro Moschitti, Masud Moshtaghi, Larry Moss, Lili Mou,
Diego Moussallem, Khalil Mrini, Jesse Mu, Jiaqi Mu, Hamdy Mubarak, Pramod Kaushik Mu-
drakarta, David Mueller, Matteo Muffo, Aldrian Obaja Muis, Animesh Mukherjee, Phoebe
Mulcaire, Matthew Mulholland, Benjamin Muller, Philippe Muller, Varish Mulwad, Koji
Murakami, Yugo Murawaki, Jamie Murdoch, Smaranda Muresan, Kenton Murray, Rudra
Murthy, Shikhar Murty, Tomáš Musil, Rafael Muñoz-Guillena, Agnieszka Mykowiecka,
Sheshera Mysore, Lluís Màrquez, Luisa März, Mark-Christoph Müller, Mathias Müller,
Thomas Müller,
xxii
Anandhavelu N, Farah Nadeem, Nona Naderi, Ryo Nagata, Ajay Nagesh, Aakanksha Naik,
Saeed Najafi, Tetsuji Nakagawa, Satoshi Nakamura, Mikio Nakano, Yukiko Nakano, Preslav
Nakov, Ramesh Nallapati, Udhyakumar Nallasamy, Feng Nan, Guoshun Nan, Nikita Nangia,
Courtney Napoles, Diane Napolitano, Jason Naradowsky, Shashi Narayan, Franco Maria Nar-
dini, Tahira Naseem, Jamal Abdul Nasir, Sudip Naskar, Alexis Nasr, Tristan Naumann, Borja
Navarro-Colorado, Roberto Navigli, Mark-Jan Nederhof, Matteo Negri, Isar Nejadgholi,
Preksha Nema, Aida Nematzadeh, Ani Nenkova, Guenter Neumann, Mariana Neves, Hwee
Tou Ng, Jun-Ping Ng, Vincent Ng, Minh-Quoc Nghiem, Axel-Cyrille Ngonga Ngomo, Dang
Tuan Nguyen, Dat Quoc Nguyen, Dong Nguyen, Huyen Nguyen, Kim Anh Nguyen, Thanh
Nguyen, Thanh-Tung Nguyen, Thien Huu Nguyen, Toan Q. Nguyen, Truc-Vien T. Nguyen,
Trung Hieu Nguyen, Viet-An Nguyen, Jianmo Ni, Eric Nichols, Garrett Nicolai, Massimo
Nicosia, Vlad Niculae, Feng Nie, Jian-Yun Nie, Yixin Nie, Jan Niehues, Christina Niklaus,
Giannis Nikolentzos, Nikola I. Nikolov, Vassilina Nikoulina, Qiang Ning, Lasguido Nio,
Nobal B. Niraula, Kosuke Nishida, Kyosuke Nishida, Noriki Nishida, Masaaki Nishino,
Sergiu Nisioi, Malvina Nissim, Tong Niu, Xing Niu, Zheng-Yu Niu, Timothy Niven, Joakim
Nivre, Hiroshi Noji, Tadashi Nomoto, Rik van Noord, Damien Nouvel, Jekaterina Novikova,
Debora Nozza, Pierre Nugues, Claire Nédellec, Aurélie Névéol,
Alexander O’Connor, Brendan O’Connor, Tim O’Gorman, Daniel Oberski, Jose Ochoa-
Luna, Yusuke Oda, Kemal Oflazer, Maciej Ogrodniczuk, Barlas Oguz, Alice Oh, Yoo Rhee
Oh, Tomoko Ohkuma, Kiyonori Ohtake, Naoaki Okazaki, Manabu Okumura, Oleg Okun,
Hugo Gonçalo Oliveira, Ethel Ong, Yasumasa Onoe, Juri Opitz, Shereen Oraby, Constantin
Orasan, Matan Orbach, John Ortega, Petya Osenova, Robert Östling, Naoki Otani, Myle Ott,
Zhijian Ou, Hiroki Ouchi, Nedjma Ousidhoum, Jessica Ouyang, Lilja Øvrelid,
Avinesh P.V.S, Deepak P, Maria Leonor Pacheco, Inkit Padhi, Aishwarya Padmakumar,
Gustavo Henrique Paetzold, Patrizia Paggio, Arindam Pal, Santanu Pal, Alexis Palmer,
Martha Palmer, Endang Pamungkas, Liangming Pan, Xiaoman Pan, Yi-Cheng Pan, Vivek
Pandit, Vinay Pandramish, Liang Pang, Richard Yuanzhe Pang, Ludovica Pannitto, Haris
Papageorgiou, Pinelopi Papalampidi, Alexandros Papangelis, Nikos Papasarantopoulos,
Nikolaos Pappas, Emerson Paraiso, Bhargavi Paranjape, Georgios Paraskevopoulos, Leti-
tia Parcalabescu, Natalie Parde, Antonio Pareja-Lora, Ankur P. Parikh, Haeju Park, Ji Ho
Park, Jong Park, Joonsuk Park, Jungsoo Park, Kunwoo Park, Lucy Park, Seong-Bae Park,
Serim Park, Sungjoon Park, Youngja Park, Yannick Parmentier, Patrick Paroubek, Ioannis
Partalas, Prasanna Parthasarathi, Gabriella Pasi, Tommaso Pasini, Peyman Passban, Rebecca
J. Passonneau, Ramakanth Pasunuru, Panupong Pasupat, Raj Patel, Roma Patel, Siddharth
Patki, Barun Patra, Braja Gopal Patra, Jasabanta Patro, Viviana Patti, Siddharth Patwardhan,
Matthias Paulik, Adam Pauls, Silviu Paun, Ellie Pavlick, John Pavlopoulos, Adam Pease,
Pavel Pecina, Ted Pedersen, Jiaxin Pei, Stephan Peitz, Viktor Pekar, Baolin Peng, Hao Peng,
Haoruo Peng, Nanyun Peng, Siyao Peng, Wei Peng, Xi Peng, Xutan Peng, Yifan Peng, Gerald
Penn, Raffaele Perego, Martin Pereira-Fariña, Lis Kanashiro Pereira, Vittorio Perera, Laura
Perez-Beltrachini, Olatz Perez-de-Viñaspre, Gabriele Pergola, Denis Peskov, Ben Peters,
Matthew Peters, Matthias Petri, Fabio Petroni, Slav Petrov, Miriam R L Petruck, Maxime
Peyrard, Jonas Pfeiffer, Quang Nhat Minh Pham, Maciej Piasecki, Giulio Ermanno Pibiri,
Massimo Piccardi, Karl Pichotta, Mohammad Taher Pilehvar, Ildikó Pilán, Tiago Pimentel,
Mārcis Pinnis, Juan Pino, Yuval Pinter, Irina Piontkovskaya, Dhivya Piraviperumal, Telmo
Pires, Flammie Pirinen, Vito Pirrelli, Miruna Pislar, Emily Pitler, Lidia Pivovarova, Benjamin
Piwowarski, Barbara Plank, Lonneke van der Plas, Laura Plaza, Bryan Plummer, Brian Plüss,
Lahari Poddar, Nikolaus Poechhacker, Massimo Poesio, Thierry Poibeau, Adam Poliak,
Senja Pollak, Lucie Poláková, Girishkumar Ponkiya, Maria Pontiki, Simone Paolo Ponzetto,
Hoifung Poon, Kashyap Popat, Maja Popović, Fred Popowich, Soujanya Poria, François
xxiii
Portet, Christopher Potts, Nima Pourdamghani, Sandhya Prabhakaran, Vinodkumar Prab-
hakaran, Sameer Pradhan, Animesh Prasad, Judita Preiss, Daniel Preotiuc-Pietro, Ofir Press,
Emily Prud’hommeaux, Danish Pruthi, Piotr Przybyła, Michal Ptaszynski, Ratish Puduppully,
Rajkumar Pujari, Hemant Purohit, Matthew Purver, James Pustejovsky, Valentina Pyatkin,
Juan Antonio Pérez-Ortiz,
Ashequl Qadir, Fanchao Qi, Jianzhong Qi, Dong Qian, Tieyun Qian, Yujie Qian, Chao
Qiao, Bing Qin, Guanghui Qin, Lianhui Qin, Libo Qin, Qi Qin, Tao Qin, Liang Qiu, Likun
Qiu, Long Qiu, Minghui Qiu, Xipeng Qiu, Yunqi Qiu, Zimeng Qiu, Chen Qu, Yanru Qu,
Xiaojun Quan, Martí Quixal,
Ella Rabinovich, Alexandre Rademaker, Gorjan Radevski, Will Radford, Bardia Rafieian,
Alessandro Raganato, Preethi Raghavan, Dinesh Raghu, Afshin Rahimi, Zahra Rahimi, Altaf
Rahman, Muhammad Rahman, Dheeraj Rajagopal, Shahab Raji, Nitendra Rajput, Taraka
Rama, Deepak Ramachandran, Anil Ramakrishna, Ganesh Ramakrishnan, Rohan Ramanath,
Owen Rambow, Diego Ramirez-Echavarria, Gabriela Ramirez-de-la-Rosa, Carlos Ramisch,
Alan Ramponi, Surangika Ranathunga, Priya Rani, Jinfeng Rao, Yanghui Rao, Ari Rappoport,
Ahmad Rashid, Hannah Rashkin, Abhinav Rastogi, Sadaf Abdul Rauf, Vikas Raunak, Shauli
Ravfogel, Sujith Ravi, Abhilasha Ravichander, Manikandan Ravikiran, Vinit Ravishankar,
Avik Ray, Soumya Ray, Manny Rayner, Paul Rayson, Julia Rayz, Simon Razniewski, Livy
Real, Traian Rebedea, Clement Rebuffel, Marta Recasens, Florence Reeder, Ines Rehbein,
Georg Rehm, Marek Rei, Roi Reichart, Emily Reif, Paul Reisert, Nils Reiter, Norbert Rei-
thinger, David Reitter, Navid Rekabsaz, Da Ren, Feiliang Ren, Pengjie Ren, Shuhuai Ren,
Shuo Ren, Xiang Ren, Yafeng Ren, Yuanhang Ren, Zhaochun Ren, Adithya Renduchintala,
Philip Resnik, Luis Reyes-Galindo, Martin Reynaert, Robert Reynolds, Kiamehr Rezaee, Eu-
génio Ribeiro, Leonardo F. R. Ribeiro, Manuel Sam Ribeiro, Marco Tulio Ribeiro, Corentin
Ribeyre, Giuseppe Riccardi, Kyle Richardson, Matthew Richardson, Caitlin Richter, Se-
bastian Riedel, Martin Riedl, Jason Riesa, German Rigau, Shruti Rijhwani, Matı̄ss Rikters,
Laura Rimell, Fabio Rinaldi, Annette Rios, Anthony Rios, Julian Risch, Alan Ritter, Molly
Roberts, Gil Rocha, Pedro Rodriguez, Melissa Roemmele, Anna Rogers, Omid Rohanian,
Oleg Rokhlenko, Roland Roller, Stephen Roller, Alexey Romanov, Laurent Romary, Sal-
vatore Romeo, Srikanth Ronanki, Wenge Rong, Subendhu Rongali, Francesco Ronzano,
Rudolf Rosa, Andrew Rosenberg, Sara Rosenthal, Candace Ross, Sophie Rosset, Paolo Rosso,
Aiala Rosá, Dan Roth, Michael Roth, Hossein Rouhizadeh, Masoud Rouhizadeh, Adam
Roussel, Joseph Le Roux, Aurko Roy, Subhro Roy, Jos Rozen, Alla Rozovskaya, Raphael
Rubino, Sebastian Ruder, Rachel Rudinger, Koustav Rudra, Frank Rudzicz, Jack Rueter,
Ivan Vladimir Meza Ruiz, Josef Ruppenhofer, Vasile Rus, Irene Russo, Attapol Rutherford,
Tatyana Ruzsics, Max Ryabinin, Maria Ryskina, Hee Jung Ryu, Andreas Rücklé,
Masoud Jalili Sabet, Mrinmaya Sachan, Fatiha Sadat, Arka Sadhu, Mehrnoosh Sadrzadeh,
Marzieh Saeidi, Tara Safavi, Sylvie Saget, Horacio Saggion, Benoît Sagot, Koustuv Saha,
Monjoy Saha, Punyajoy Saha, Sriparna Saha, Tanay Kumar Saha, Saurav Sahay, Gözde
Şahin, Gaurav Sahu, Sunil Kumar Sahu, Keisuke Sakaguchi, Mohammad Salameh, Elizabeth
Salesky, Avneesh Saluja, Tanja Samardzic, Rajhans Samdani, Niloofar Safi Samghabadi,
Younes Samih, Ramon Sanabria, George Sanchez, Germán Sanchis-Trilles, Victor Sanh,
Chinnadhurai Sankar, Sashank Santhanam, Marina Santini, Cicero Nogueira dos Santos,
T.Y.S.S Santosh, Bishal Santra, Sebastin Santy, Maarten Sap, Naomi Saphra, Maya Sappelli,
Murat Saraclar, Anoop Sarkar, Kamal Sarkar, Prathusha K Sarma, Felix Sasaki, Shota Sasaki,
Ryohei Sasano, Danielle Saunders, Agata Savary, Denis Savenkov, Aleksandar Savkov, Ramit
Sawhney, Apoorv Saxena, Asad Sayeed, Kevin Scannell, Bianca Scarlini, Carolina Scarton,
Thomas Schaaf, Shigehiko Schamoni, Thomas Schatz, Tatjana Scheffler, Yves Scherrer, Timo
xxiv
Schick, David Schlangen, Dominik Schlechtweg, Viktor Schlegel, Natalie Schluter, Helmut
Schmid, Martin Schmitt, Tyler Schnoebelen, Steven Schockaert, Annika Marie Schoene,
Mirco Schoenfeld, Alexandra Schofield, Marc Schulder, William Schuler, Claudia Schulz,
Hannes Schulz, Elliot Schumacher, Sebastian Schuster, Tal Schuster, Ineke Schuurman, H.
Andrew Schwartz, Lane Schwartz, Roy Schwartz, Robert Schwarzenberg, Djamé Seddah,
João Sedoc, Abigail See, Elad Segal, Satoshi Sekine, Ethan Selfridge, Thibault Sellam, David
Semedo, Olga Seminck, Nasredine Semmar, Cansu Sen, Prithviraj Sen, Shubhashis Sengupta,
Rico Sennrich, Minjoon Seo, Yeon Seonwoo, Gwenaelle Cunha Sergio, Abhishek Sethi, Lei
Sha, Mahsa Shafaei, Pararth Shah, Samira Shaikh, Igor Shalyminov, Chao Shang, Jingbo
Shang, Mingyue Shang, Nan Shao, Yingxia Shao, Yutong Shao, Ori Shapira, Naomi Shapiro,
Amr Sharaf, Matthew Shardlow, Abhishek Sharma, Arpit Sharma, Ashish Sharma, Piyush
Sharma, Soumya Sharma, Serge Sharoff, Peter Shaw, Lanbo She, Kim Cheng Sheang, Artem
Shelmanov, Aili Shen, Dinghan Shen, Gehui Shen, Hua Shen, Jiaming Shen, Qinlan Shen,
Sheng Shen, Shiqi Shen, Siqi Shen, Tao Shen, Weizhou Shen, Xiaoyu Shen, Yatian Shen,
Yilin Shen, Emily Sheng, Bei Shi, Chuan Shi, Haoyue Shi, Peng Shi, Shuming Shi, Tianze Shi,
Weijia Shi, Weiyan Shi, Xiaodong Shi, Xing Shi, Yangyang Shi, Zhan Shi, Zhouxing Shi, Chi-
hiro Shibata, Tomohide Shibata, Anastasia Shimorina, Jamin Shin, Prashant Shiralkar, Boaz
Shmueli, Abu Awal Md Shoeb, Linjun Shou, Mohit Shridhar, Manish Shrivastava, Ritvik
Shrivastava, Dimitar Shterionov, Kai Shu, Lei Shu, Raphael Shu, Kurt Shuster, Alexander
Shvets, Vered Shwartz, Chenglei Si, Mei Si, Aditya Siddhant, Advaith Siddharthan, Georgios
Sidiropoulos, Candy Sidner, Melanie Siegel, Avi Sil, Max Silberztein, Max Silberztein,
Miikka Silfverberg, Eliezer de Souza da Silva, Fabrizio Silvestri, Michel Simard, Patrick
Simianer, Kathleen Siminyu, Goncalo Simoes, Dan Simonson, Matthew Sims, Abhishek
Singh, Loitongbam Gyanendro Singh, Sameer Singh, Karan Singla, Priyanka Sinha, Valentina
Sintsova, Sunayana Sitaram, Gabriel Skantze, Steve Skiena, Blaž Škrlj, Kevin Small, Koen-
raad De Smedt, David Smith, Noah A. Smith, Eriks Sneiders, Felipe Soares, Livio Baldini
Soares, Artem Sokolov, Luca Soldaini, Aina Garí Soler, Katira Soleymanzadeh, Thamar
Solorio, Youngseo Son, Dezhao Song, Haoyu Song, Hyun-Je Song, Kai Song, Kaiqiang
Song, Linfeng Song, Ruihua Song, Sanghoun Song, Wei Song, Yan Song, Yangqiu Song,
Yiping Song, Rishi Sonthalia, Claudia Soria, Radu Soricut, Aitor Soroa, Alexey Sorokin,
Daniil Sorokin, José G. C. de Souza, Marlo Souza, Irena Spasic, Manuela Speranza, Matthias
Sperber, Evangelia Spiliopoulou, Andreas Spitz, Rachele Sprugnoli, Mukund Sridhar, Rohini
Srihari, Vivek Srikumar, Tejas Srinivasan, Ankit Srivastava, Shashank Srivastava, Edward
Stabler, Felix Stahlberg, Sanja Stajner, Ieva Staliūnaitė, Efstathios Stamatatos, Marija Stano-
jevic, Gabriel Stanovsky, Katherine Stasaski, Shane Steinert-Threlkeld, Georg Stemmer,
Pontus Stenetorp, Elias Stengel-Eskin, Evgeny Stepanov, Ian Stewart, Giovanni Stilo, George
Stoica, Dario Stojanovski, Kevin Stowe, Veselin Stoyanov, Karl Stratos, Kristina Striegnitz,
Michael Strube, Jannik Strötgen, Will Styler, Sara Stymne, Dan Su, Jinsong Su, Keh-Yih
Su, Ming-Hsiang Su, Pei-Hao Su, Qinliang Su, Yixuan Su, Yu Su, Nishant Subramani,
Aparna Subramanian, Sandeep Subramanian, Sanjay Subramanian, Saku Sugawara, Hiroaki
Sugiyama, Alessandro Suglia, Yoshihiko Suhara, Alane Suhr, Dianbo Sui, Zhifang Sui,
Octavia-Maria Şulea, Elior Sulem, Md Arafat Sultan, Aixin Sun, Changzhi Sun, Fei Sun,
Haitian Sun, Jian Sun, Kai Sun, Le Sun, Ming Sun, Mingming Sun, Si Sun, Simeng Sun,
Siqi Sun, Weiwei Sun, Xiaobing Sun, Xu Sun, Yajing Sun, Yawei Sun, Yibo Sun, Yifan
Sun, Zequn Sun, Zhiqing Sun, Mujeen Sung, Monica Sunkara, Hanna Suominen, Anshuman
Suri, Mirac Suzgun, Hisami Suzuki, Jun Suzuki, Pedro Javier Ortiz Suárez, Sandesh Swamy,
Swabha Swayamdipta, Stan Szpakowicz, Ida Szubert, Felipe Sánchez-Martínez, Joan Andreu
Sánchez, Diarmuid Ó Séaghdha, Anders Søgaard,
Jeniya Tabassum, Ryuki Tachibana, Marie Tahon, Dima Taji, Ryuichi Takanobu, Sho Takase,
David Talbot, Aarne Talman, Ronen Tamari, George Tambouratzis, Aleš Tamchyna, Akihiro
xxv
Tamura, Chenhao Tan, Chuanqi Tan, Fei Tan, Jinghua Tan, Jiwei Tan, Liling Tan, Samson
Tan, Xu Tan, Buzhou Tang, Duyu Tang, Gongbo Tang, Hao Tang, Jiliang Tang, Jintao Tang,
Pingjie Tang, Qingming Tang, Shuai Tang, Siliang Tang, Xiangru Tang, Yi-Kun Tang, Zhiwen
Tang, Ludovic Tanguy, Xavier Tannier, Chongyang Tao, Fei Tao, Shiva Taslimipoor, Sandeep
Tata, Yuka Tateisi, Rachael Tatman, Michiaki Tatsubori, Marta Tatu, Andon Tchechmedjiev,
Christoph Teichmann, Selma Tekir, Serra Sinem Tekiroğlu, Eric Tellez, Ian Tenney, Silvia
Terragni, Joel Tetreault, Kapil Thadani, khushboo Thaker, Urmish Thakker, Kilian Theil,
Ashok Thillaisundaram, Krishnaprasad Thirunarayan, Jesse Thomason, Brian Thompson,
Laure Thompson, Craig Thomson, Camilo Thorne, Yuanhe Tian, Zhiliang Tian, Jörg Tiede-
mann, Christoph Tillmann, Swati Tiwari, Amalia Todirascu, Takenobu Tokunaga, Gabriele
Tolomei, Gaurav Singh Tomar, Nadi Tomeh, Nicholas Tomlin, Marc Tomlinson, Mariya
Toneva, Kentaro Torisawa, Marwan Torki, Tiago Timponi Torrent, Juan-Manuel Torres-
Moreno, María Inés Torres, Paolo Torroni, Shubham Toshniwal, Samia Touileb, Masashi
Toyoda, Amine Trabelsi, Quan Hung Tran, Trang Tran, David Traum, Dietrich Trautmann,
Marcos Treviso, Alina Trifan, Rocco Tripodi, Bayu Distiawan Trisedya, Harsh Trivedi, En-
rica Troiano, Chen-Tse Tsai, Adam Tsakalidis, Reut Tsarfaty, Bo-Hsiang Tseng, Masaaki
Tsuchida, Oren Tsur, Yoshimasa Tsuruoka, Yulia Tsvetkov, Kewei Tu, Lifu Tu, Zhaopeng
Tu, Dan Tufis, Iulia Turc, Marco Turchi, Ferhan Ture, Rory Turnbull, Martin Tutek, Elena
Tutubalina,
Rutuja Ubale, Ana Sabina Uban, Takuma Udagawa, Stefan Ultes, Bhargav Upadhyay, Zdenka
Uresova, Alfonso Ureña-López, Olga Uryupina, Dmitry Ustalov, Masao Utiyama,
Ravi Vadlapudi, Keyon Vafa, Ashwini Vaidya, Vincent Vandeghinste, Keith VanderLinden,
Lucy Vanderwende, David Vandyke, Natalia Vanetik, Eva Vanmassenhove, Andrea Vanzo,
Shikhar Vashishth, Siddharth Vashishtha, Oleg Vasilyev, Lucy Vasserman, Olga Vechtomova,
Luis Gerardo Mojica de la Vega, Julien Velcin, Erik Velldal, Giulia Venturi, Subhashini
Venugopalan, Suzan Verberne, Gaurav Verma, Rakesh Verma, Giorgos Vernikos, Yannick
Versley, Amir Pouran Ben Veyseh, Marta Vicente, Prashanth Vijayaraghavan, Anvesh Rao
Vijjini, David Vilar, David Vilares, Serena Villata, Esau Villatoro-Tello, Aline Villavicencio,
Anne Vilnat, Veronika Vincze, Sami Virpioja, Krishnapriya Vishnubhotla, Marco Viviani,
Andreas Vlachos, Duy Tin Vo, Ngoc Phuoc An Vo, Tatiana Vodolazova, Nikolai Vogler, Rob
Voigt, Soroush Vosoughi, Thuy Vu, Thuy-Trang Vu, Tu Vu, Ivan Vulić, Yogarshi Vyas,
Akifumi Wachi, Henning Wachsmuth, Takashi Wada, Joachim Wagner, Sabine Schulte
im Walde, Byron Wallace, Eric Wallace, Mengting Wan, Shengxian Wan, Xiaojun Wan,
Yao Wan, Yu Wan, Alex Wang, Bailin Wang, Baoxun Wang, Bin Wang, Bingqing Wang,
Boxin Wang, Chang Wang, Changhan Wang, Chao Wang, Cunxiang Wang, Daling Wang,
Danqing Wang, Di Wang, Fei Wang, Guangrun Wang, Guoyin Wang, Hai Wang, Han Wang,
Han Wang, Hanrui Wang, Hao Wang, Haohan Wang, Haoyu Wang, Heyuan Wang, Hong
Wang, Hongfei Wang, Hsin-Min Wang, Hua Wang, Jiaqi Wang, Jin Wang, Jingang Wang,
Jingjing Wang, Jingkang Wang, Jingwen Wang, Ke Wang, Kexiang Wang, Liang Wang, Lidan
Wang, Longyue Wang, Lu Wang, Lucy Lu Wang, Mengxiang Wang, Mingxuan Wang, Nan
Wang, Peifeng Wang, Pidong Wang, Ping Wang, Qiang Wang, Qin Wang, Qingyun Wang,
Quan Wang, Rui Wang, Rui Wang, Runze Wang, Shaojun Wang, Shi Wang, Shuai Wang,
Shuohang Wang, Tong Wang, Wei Wang, Wei Wang, Wen Wang, Wenbo Wang, Wenhui
Wang, Wenqi Wang, Wenxuan Wang, Wenya Wang, William Yang Wang, Xiaozhi Wang,
Xin Wang, Xinglong Wang, Xuezhi Wang, Yan Wang, Yaqing Wang, Yequan Wang, Yifei
Wang, Yizhong Wang, Yong Wang, Yue Wang, Yujing Wang, Zhen Wang, Zhenyi Wang,
Zhichun Wang, Zhiguang Wang, Zhiguo Wang, Zhiqiang Wang, Zhongqing Wang, Zijian
Wang, Ziqi Wang, Zirui Wang, Artit Wangperawong, Leo Wanner, Nigel Ward, Alex Warstadt,
xxvi
Christian Wartena, Zeerak Waseem, Koki Washio, Moshe Wasserblat, Shinji Watanabe, Taro
Watanabe, Bonnie Webber, Ingmar Weber, Leon Weber, Noah Weber, Kellie Webster, Jürgen
Wedekind, Furu Wei, Jason Wei, Junqiu Wei, Penghui Wei, Wei Wei, Xiaochi Wei, Wang
Weiran, Gail Weiss, Charles Welch, Orion Weller, Simon Wells, Haoyang Wen, Lijie Wen,
Tsung-Hsien Wen, Peter West, Matthijs Westera, Michael White, Richard Wicentowski,
Michael Wiegand, John Wieting, Gijs Wijnholds, Ethan Wilcox, Rodrigo Wilkens, Adina
Williams, Jake Williams, Jason D Williams, Jennifer Williams, Steven Wilson, Shuly Wintner,
Sam Wiseman, Dawid Wisniewski, Guillaume Wisniewski, Tomer Wolfson, Marcin Woliński,
Derek F. Wong, Ka Ho Wong, Tak-Lam Wong, Dina Wonsever, Zach Wood-Doughty, Alina
Wróblewska, Bowen Wu, Changxing Wu, Chien-Sheng Wu, Fangzhao Wu, Junshuang Wu,
Ledell Wu, Lijun Wu, Lingfei Wu, Shih-Hung Wu, Shijie Wu, Tongshuang Wu, Wei Wu,
Xianchao Wu, Xixin Wu, Yen-Chen Wu, Youzheng Wu, Yu Wu, Yuanbin Wu, Yuexin Wu,
Yuting Wu, Yuxiang Wu, Zeqiu Wu, Zhanghao Wu, Zhen Wu, Zhiyong Wu, Joern Wuebker,
Christian Wurm,
Congying Xia, Fei Xia, Jingbo Xia, Mengzhou Xia, Patrick Xia, Qingrong Xia, Rui Xia,
Yingce Xia, Yikun Xian, Chaojun Xiao, Huiru Xiao, Lin Xiao, Tong Xiao, Wen Xiao, Xinyan
Xiao, Yanghua Xiao, Boyi Xie, Jun Xie, Lei Xie, Qianqian Xie, Ruobing Xie, Bowen Xing,
Chen Xing, Frank Xing, Chao Xiong, Hao Xiong, Hongyu Xiong, Wenhan Xiong, Benfeng
Xu, Boyan Xu, Can Xu, Chang Xu, Chen Xu, Chenchen Xu, Frank F. Xu, Guandong Xu,
Hongfei Xu, Jiacheng Xu, Jinan Xu, Jingjing Xu, Jun Xu, Lei Xu, Lu Xu, Mingzhou Xu,
Peng Xu, Qiongkai Xu, Wei Xu, Weiran Xu, Wenduan Xu, Xinnuo Xu, Yan Xu, Yang Xu,
Yumo Xu, Yunqiu Xu, Zenglin Xu, Zhen Xu, Huichao Xue, Nianwen Xue,
Mohit Yadav, Shweta Yadav, Yadollah Yaghoobzadeh, Mohamed Yahya, Ikuya Yamada,
Ivan Yamshchikov, Jun Yan, Lingyong Yan, Ming Yan, Rui Yan, Yu Yan, Zhao Yan, Baosong
Yang, Bishan Yang, Chenghao Yang, Diyi Yang, Haiqin Yang, Jaewon Yang, Jie Yang, Jun
Yang, Junjie Yang, Liner Yang, Linyi Yang, Liu Yang, Min Yang, Muyun Yang, Nan Yang,
Qian Yang, Sen Yang, Tsung-Yen Yang, Wei Yang, Weiwei Yang, Wenmian Yang, Yaqin
Yang, Yazheng Yang, Yiben Yang, Yilin Yang, Zhichao Yang, Zixiaofan Yang, Ziyi Yang,
Tae Yano, He Yanqing, Huaxiu Yao, Jin-Ge Yao, Liang Yao, Wenlin Yao, Yiqun Yao, Mark
Yatskar, Semih Yavuz, Deming Ye, Hai Ye, Qinyuan Ye, Xiaoyuan Yi, Wen-wai Yim, Seid
Muhie Yimam, Da Yin, Haiyan Yin, Qingyu Yin, Wenpeng Yin, Xuwang Yin, Yichun Yin,
Anssi Yli-Jyrä, Michael Yoder, Dani Yogatama, Sho Yokoi, Zheng Xin Yong, Seunghyun
Yoon, Masashi Yoshikawa, Naoki Yoshinaga, Koichiro Yoshino, Steve Young, Bei Yu, Bowen
Yu, Changlong Yu, Chen Yu, Dian Yu, Dian Yu, Dong Yu, Heng Yu, Hong Yu, Jianfei Yu,
Jifan Yu, Juntao Yu, Kai Yu, Licheng Yu, Mo Yu, Ping Yu, Seunghak Yu, Tao Yu, Wenhao
Yu, Wenmeng Yu, Xiaodong Yu, Zhou Yu, Caixia Yuan, Jianhua Yuan, Nicholas Jing Yuan,
Xingdi Yuan, Zheng Yuan, François Yvon,
Menno van Zaanen, Wajdi Zaghouani, Farooq Zaman, Mohammadzaman Zamani, Mar-
cos Zampieri, Yuan Zang, Fabio Massimo Zanzotto, Alessandra Zarcone, Gian Piero Zarri,
Sina Zarrieß, Vicky Zayats, Omnia Zayed, Rabih Zbib, Albin Zehe, Amir Zeldes, Rowan
Zellers, Yury Zemlyanskiy, Daojian Zeng, Jiali Zeng, Weixin Zeng, Xiangrong Zeng, Xing-
shan Zeng, Zhaohao Zeng, Deniz Zeyrek, Hanwen Zha, Sheng Zha, Fangzhou Zhai, Shuang
(Sophie) Zhai, Yuming Zhai, Biao Zhang, Boliang Zhang, Bowen Zhang, Bowen Zhang,
Chao Zhang, Chenbin Zhang, Chenwei Zhang, Chuheng Zhang, Dong Zhang, Dongxu Zhang,
Dongyu Zhang, Hainan Zhang, Hao Zhang, Haoyu Zhang, Hongming Zhang, Huijun Zhang,
Jiajun Zhang, Jianguo Zhang, Jinchao Zhang, Jingqing Zhang, Jipeng Zhang, Ke Zhang, Kun
Zhang, Kunpeng Zhang, Lei Zhang, Licheng Zhang, Longtu Zhang, Meishan Zhang, Meng
Zhang, Michael Zhang, Min Zhang, Ningyu Zhang, Qi Zhang, Richong Zhang, Rui Zhang,
xxvii
Ruiyi Zhang, Ruqing Zhang, Shaohua Zhang, Sheng Zhang, Shujian Zhang, Shuo Zhang,
Tongtao Zhang, Wei Emma Zhang, Wei Zhang, Wei-Nan Zhang, Weiwei Zhang, Wen Zhang,
Xiang Zhang, Xiang Zhang, Xiangliang Zhang, Xiao Zhang, Xiaotong Zhang, Xiaoying
Zhang, Xingxing Zhang, Xinsong Zhang, Xinyuan Zhang, Xuanwei Zhang, Xuanyu Zhang,
Xuchao Zhang, Yi Zhang, Yi Zhang, Yi Zhang, Yichi Zhang, Yifan Zhang, Yizhe Zhang, Yu
Zhang, Yuan Zhang, Yuan Zhang, Yue Zhang, Yunyi Zhang, Yuqi Zhang, Yuyu Zhang, Zequn
Zhang, Zeyu Zhang, Zhe Zhang, Zheng Zhang, Zhirui Zhang, Zhisong Zhang, Zhuosheng
Zhang, Chao Zhao, Chen Zhao, Dongyan Zhao, Fei Zhao, Guangxiang Zhao, Jie Zhao,
Jieyu Zhao, Jieyu Zhao, Jun Zhao, Kai Zhao, Lujun Zhao, Mengjie Zhao, Sanqiang Zhao,
Tiancheng Zhao, Tianyu Zhao, Tiejun Zhao, Wei Zhao, Yang Zhao, Yanpeng Zhao, Yanyan
Zhao, Yao Zhao, Yinggong Zhao, Zhou Zhao, Baigong Zheng, Bo Zheng, Changmeng Zheng,
Lin Zheng, Renjie Zheng, Xin Zheng, Yinhe Zheng, Ming Zhong, Peixiang Zhong, Victor
Zhong, Zexuan Zhong, Ben Zhou, Chunting Zhou, Dong Zhou, Ganbin Zhou, Giulio Zhou,
Guangyou Zhou, Hao Zhou, Jiawei Zhou, Jie Zhou, Jingbo Zhou, Junpei Zhou, Junru Zhou,
Junsheng Zhou, Junwei Zhou, Li Zhou, Long Zhou, Mantong Zhou, Pei Zhou, Qiji Zhou,
Qingyu Zhou, Shuchang Zhou, Shuyan Zhou, Wangchunshu Zhou, Wenxuan Zhou, Xiang
Zhou, Xiangyang Zhou, Xuhui Zhou, Yichao Zhou, Yilun Zhou, Zhengyu Zhou, Zhihan
Zhou, Zhong Zhou, Dawei Zhu, Haichao Zhu, Henghui Zhu, Jia Zhu, Jinhua Zhu, Junnan
Zhu, Kenny Zhu, Ligeng Zhu, Muhua Zhu, Pengfei Zhu, Su Zhu, Wanzheng Zhu, Wei Zhu,
Xiaodan Zhu, Xiaofeng Zhu, Zining Zhu, Fuzhen Zhuang, Honglei Zhuang, Yimeng Zhuang,
Yuan Zhuang, Leonardo Zilio, Roger Zimmermann, Heike Zinsmeister, Ayah Zirikly, Imed
Zitouni, Ran Zmigrod, Michael Zock, Shi Zong, Markus Zopf, Bowei Zou, Yanyan Zou,
Amal Zouaq, Arkaitz Zubiaga, Frederike Zufall.
Secondary Reviewers:
Salah Ait-Mokthar, Eunice Akani, Zainab Albujasim, Nada Aldarrab, Sherlon Almeida,
Chantal Amrhein, Nikolay Arefyev, Siddhant Arora,
Pablo Badilla, Jorge Balazs, Hubert Baniecki, Hongchang Bao, Liao Baohao, Loïc Bar-
rault, Anton Belyy, Nathan Berger, Aditya Bhargava, Shaily Bhatt, Nikita Bhutani, Yonatan
Bitton, Rexhina Blloshmi, Janos Borst, Fabienne Braune, Max Bryan, Ana-Maria Bucur,
Wray Buntine, Kim Bürgl,
Hongjie Cai, Jiangxia Cao, Rémi Cardon, Steffen Castle, Sophia Chan, Piyush Chawla,
Siva Uday Sampreeth Chebolu, Fumian Chen, Zitong Cheng, Donghee Choi, Eric Corlett,
Jamell Dacon, Leonard Dahlmann, Yinpei Dai, Dhairya Dalal, Maxime D. Armstrong,
Souvik Das, Loic De Langhe, Johannes Deleu, Marco Del Treidici, Lorenzo De Mattei,
maureen de seyssel, Anurag Deshmukh, Nina Dethlefs, Hannah Devinney, Juglar Diaz, Bayu
Distiawan Trisedya, Suman Dowlagar, Rotem Dror, Andrew Drozov, Nan Duan,
Liana Ermakova,
Marzieh Fadaee, Joachim Fainberg, Nils Feldhus, Katy Felkner, Andrew Finch, Clémentine
Fourrier,
Xiubo Geng, Efthymios Georgiou, Iacopo Ghinassi, Behrooz Ghorbani, Christian Gollan,
Ming Gong, Alicja Gosiewska, Tamas Grosz, Yu Gu, Shu Guo, Ashim Gupta,
xxviii
Marius Hamacher, Kijong Han, Bradley Hauer, Hangfeng He, Michael Heck, Felix Helfer,
Nils Holzenberger, Weiwei Hou, Weronika Hryniewska, Zechuan Hu, Xinting Huang, Yeh
Hui-Syuan, Yongkeun Hwang,
Radu Iacob, Nikolai Ilinykh,
Gilles Jacobs, Aman Jaiswal, Anubhav Jangra, Minbyul Jeong, Ryan J. Hubbard, jian-
shu Ji, Qi Jia, Hao Jiang, Bernal Jimenez Gutierrez, Arne Jönsson,
Tai-lin Karidi, Hemant Kathania, Divyansh Kaushik, Gangwoo Kim, Guillaume Klein,
Mateusz Klimaszewski, Xenia Klinge, Ryosuke Kohita, Michael Kozielski, Akshay Krishna
Sheshadri, Shachi H. Kumar, Yaman Kumar, Nicholas Kuo, Kemal Kurniawan, Heeyoung
Kwak,
Philippe Laban, Samuel Larkin, Hung-yi Lee, Juho Leinonen, Gael Lejeune, Bai Li, Jinggui
Liang, Yaqing Liao, Ruogu Lin, Alisa Liu, Kaiji Lu,
Danni Ma, Avinash Madasu, Arnob Mallik, Ramesh Manuvinakurike, Chengsheng Mao,
Mounika Marreddy, Federico Martelli, Taha Masood, Diego Maupomé, Matt McNeill, Laiba
Mehnaz, Alessio Miaschi, Alice Millour, Flor Miriam Plaza del Arco, Ishani Mondol, Víctor
M. Sánchez-Cartagena, Philipp Müller, Deepak Muralidharan, Toshiki Muromachi,
Kouta Nakayama, Yatin Nandwani, Sara Ng, Dan Nguyen,
Mayumi Ohta, Eda Okur, Siru Ouyang, Nadav Oved, Nanami Ozawa,
Vardaan Pahuja, Margherita Pallottino, Jiaxin Pan, Subhadarshi Panda, Jianhui Pang, Andrea
Papaluca, Nivranshu Pasricha, Archita Pathak, Chen (Patrick) Pei, Jiahuan Pei, Qianqian
Peng, MinhQuang Pham, Joan Plepi, Luigi Procopio,
Weizhen Qi, Yi Qin,
Dheeraj Rajagopal, Alan Ramponi, Fanny Rancourt, Danial Raza, Evelina Rennes, Matías
Rojas, Alexis Ross, Aku Rouhe, Hossein Rouhizadeh, Cao Rui,
Sougata Saha, Naveen Saini, Flora Sakketou, Tanja Samardzic, Brenda Santana, Twisampati
Sarkar, Shiki Sato, Shigehiko Schamoni, Lena Schiffer, Elad Segal, Sina Semnani, Sandaru
Seneviratne, Hendra Setiawan, Kyle Shaffer, Sanket Shah, Jiawei Sheng, Jiatong Shi, Linjun
Shou, Keshav Singh, Gabriella Skitalinskaya, Nikita Soni, Anna Sotnikova, Olga Sozinova,
Anirudh Srinivasan, Tomasz Stanislawek, Kevin Stier, Peng Su, Shivashankar Subramanian,
Yanming Sun, Shahbaz Syed,
Mohsen Tabasy, Ryo Takasu, Duyu Tang, Marc Tanti, Maksym Taranukhin, Xanh Thi
Ho, Evgeniia Tokarchuk, Thanh Tran, Yang Trista Cao, Henry Tsai, An Tuan Dao,
Clara Vania, Benjamin van Niekerk, Suzan Verberne, Huy Vu,
Manya Wadhwa, AbdelRahman Wael, Cheng Wang, Sabine Weber, Cyril Weerasoriya,
Andreas Weise, Zhihua Wen, Taesun Whang, Katarzyna Woźnica, Liangqing Wu,
Xiaolin Xia, Yuqing Xie, Benfeng Xu,
xxix
Brian Yan, Jenny Yang, Xinzhi Yao, Yongjing Yin, Zheng-Xin Yong, Jaehyo Yoo, Ori
Yoran, Bowen Yu, Weizhe Yuan,
Frank D. Zamora-Reina, Najam Zaidi, Klim Zaporojets, Shuxi Zeng, Thomas Zenkel, Run-
zhe Zhan, Chen Zhang, Jinman Zhao, Houquan Zhou, Zining Zhu, Franziska Zimmermann,
Elaine Zosa, Jie Zou, Xinxing Zu.
We would like to recognize the following Outstanding Reviewers:
Rami Al-Rfou, Carl Allen, Mark Anderson, Stefanos Angelidis, Jean-Yves Antoine, Leila
Arras,
Rohit Babar, Hritik Bansal, Su Lin Blodgett, Valts Blukis, Nadjet Bouayad-Agha, Arthur
Bražinskas, Michael Bugert,
Vittorio Castelli, Hou Pong Chan, Fenia Christopoulou, Elizabeth Clark, Kevin Clark, Vincent
Claveau, Anna Currey,
Hal Daume III, Forrest Davis, Steve DeNeefe, Daniel Deutsch, Sunipa Dev, Joseph P. Dexter,
Pablo Duboue, Philip Dufter, Ondřej Dušek, Rory Duthie, Nouha Dziri,
Alexander Fabbri, Agnieszka Falenska, Sergey Feldman, Daniel Fernandez-Gonzalez, An-

jalie Field, Margret Fleck, Michael Flor, Maxwell Forbes, Thomas Francois, Daniel Fried,
Zhenxin Fu,
Matteo Gabburo, Yang Gao, Siddhant Garg, Aina Garí Soler, Marcos Goncalves, Jana
Götze, Bruno Guillaume,
Xiaochuang Han, Peter Hase, Hiroaki Hayashi, Devamanyu Hazarika, Jack Hessel, Tsutomu
Hirao, Ari Holtzman, Xuanjing Huang,
Gabriel Ilharco,
Gilles Jacobs, Alon Jacovi, Sarthak Jain, Nanjiang Jiang, Anders Johanssen,
Jaap Kamps, Siddharth Karamcheti, Brendan Kennedy, Jihyuk Kim, Byeongchang Kim,
Nikita Kitaev, Hayato Kobayashi, Noriyuki Kojima, Seth Kulick, Sawan Kumar, Adhiguna
Kuncoro,
Jake Lever, Yaoyiran Li, Jindřich Libovický, Fangyu Liu,
Wei-Yun Ma, Adyasha Maharana, Alexander Mehler, Sabrina J. Mielke, Evangelios Milios,
Sewon Min, Jeff Mitchell,
Matan Orbach, Jessica Ouyang,
Aishwarya Padmakumar, Bhargavi Paranjape, Letitia Parcalabescu, Carla Parra Escartín,

Viviana Patti, Karl Pichotta, Tiago Pimentel, Lahari Poddar, Rajkumar Pujar,
Xiaojun Quan,
xxx
Shuhuai Ren, Philip Resnik, Gil Rocha,
Sylvie Saget, Victor Sanh, Timo Schick, Tyler Schnoebelen, Roy Schwartz, Abigail See, Rico
Sennrich, Peter Shaw, Qinlan Shen, Tianze Shi, Valentina Sintsova, Wei Song, Youngseo
Song, Andreas Spitz, Yoshihiko Suhara, Alane Suhr,
Ronen Tamari, Yuanhe Tian,
Rob van der Goot, Emiel van Miltenberg, Rik van Noord, Lucy Vanderwende, David Vilares,
Alex Wang, Zijian Wang, Zhen Wang, Alex Warstadt, Gail Weiss, Alina Wróblewska,
Jorn Wuebker,
Jiacheng Xu,
Michael Yoder, Naoki Yoshinaga, Steve Young, Dian Yu,
Wei Zhang, Zeyu Zhang, Dong Zhou, Ran Zmigrod, Markus Zopf.
Ethics Advisory Committee Reviewers:
Jade Abbott, Adewale Akinfaderin, Nora Al-Twairesh, Laura Alonso Alemany, David
Alvarez-Melis, Maxime Amblard, Jean-Yves Antoine,
Timothy Baldwin, Kathy Baxter, Steven Bedrick, Luciana Benotti, Steven Bird, Claudia Borg,
Jamie Brandon,
Kai-Wei Chang, Luis Chiruzzo, Marta R. Costa-jussà,
Guy Emerson,
Albert Gatt, Vasundhara Gautam, Dimitra Gkatzia, Sharon Goldwater, Alvin Grissom II,
Jack Hessel,
Shafiq Joty,
Anne Lauscher, Haley Lepp,
Nitin Madnani, Emiel van Miltenburg,
Aurélie Névéol, Nguyen Thi Minh Huyen,
José Ochoa-Luna,
Viviana Patti, Ted Pedersen,
Gabriela Ramírez-de-la-Rosa, Marta Recasens,
Tatjana Scheffler, Kathleen Siminyu,
xxxi
Samson Tan, Rachael Tatman, Esaú Villatoro Tello.
Aline Villavicencio,
Kellie Webster, Richard Wicentowski,
Jingbo Xia.
xxxii
Keynote Talk: Advancing Technological Equity in Speech and
Language Processing
Helen Meng
The Chinese University of Hong Kong (CUHK)
Abstract: Accelerating advances in AI and deep neural networks have powered the proliferation of
speech and language technologies in applications such as virtual assistants, smart speakers, reading
machines, etc. The technologies have performed impressively well, achieving human parity in speech
recognition accuracies and speech synthesis naturalness. As these technologies continue to permeate
our daily lives, they need to support diverse users and usage contexts with inputs that deviate from the
mainstream. Examples include non-native speakers, code-switching, speech carrying myriad emotions
and styles, and speakers with impairments and disorders. Under such contexts, existing technologies
often suffer performance degradations and fail to fulfill the needs of the users. The crux of the problem
lies in data scarcity and data sparsity, which are exacerbated by high data variability.
This talk presents an overview of some of the approaches we have used to address the challenges of data
shortage, positioned at various stages along the processing pipeline. They include: data augmentation
based on speech signal perturbations, use of pre-trained representations, learning speech representation
disentanglement, knowledge distillation architectures, meta-learned model re-initialization, as well as
adversarially trained models. The effectiveness of these approaches are demonstrated through a variety
of applications, including accented speech recognition, dysarthric speech recognition, code-switched
speech synthesis, disordered speech reconstruction, one-shot voice conversion and exemplar-based
emotive speech synthesis. These efforts strive to develop speech and language technologies that can
gracefully adapt and accommodate a diversity of user needs and usage contexts, in order to achieve
technological equity in our society.
Bio: Helen Meng is Patrick Huen Wing Ming Professor of Systems Engineering and Engineering
Management at The Chinese University of Hong Kong (CUHK). Her research interests include
speech and language technologies to support multilingual and multimodal human-computer interactions,
eLearning and assistive technologies, as well as big data decision analytics using AI. She leads the
interdisciplinary research team that received the first Theme-based Research Scheme Project in Artificial
Intelligence in 2019 from the Hong Kong SAR Government’s Research Grants Council. She is Chair of
the Curriculum Development in the CUHK-JC AI4Future Project, which has developed the courseware
for pre-tertiary AI education being taught in a growing number of participating secondary schools across
Hong Kong.
Helen received all her degrees from MIT. She is the Founding Director of the CUHK Ministry of
Education (MoE)-Microsoft Key Laboratory for Human-Centric Computing and Interface Technologies
(since 2005), Tsinghua-CUHK Joint Research Center for Media Sciences, Technologies and Systems
(since 2006), and Stanley Ho Big Data Decision Analytics Research Center (since 2013). Previously, she
has served as CUHK Faculty of Engineering’s Associate Dean (Research), Chairman of the Department
of Systems Engineering and Engineering Management, Editor-in-Chief of the IEEE Transactions on
Audio, Speech and Language Processing, Member of the IEEE Signal Processing Society Board of
Governors, ISCA Board Member and presently member of the IEEE SPS Awards Board and ISCA
International Advisory Council. She was elected APSIPA’s inaugural Distinguished Lecturer 2012-
2013 and ISCA Distinguished Lecturer 2015-2016. Her awards include the Ministry of Education
Higher Education Outstanding Scientific Research Output Award 2009, Microsoft Research Outstanding
Collaborator Award 2016 (1 in 32 worldwide), IBM Faculty Award 2016, HKPWE Outstanding Women
Professionals and Entrepreneurs Award 2017 (1 in 20 since 1999), Hong Kong ICT Silver Award 2018
in Smart Inclusion, 2019 IEEE SPS Leo L. Beranek Meritorious Service Award and various best paper
xxxiii
awards. Helen has served in a number of government appointments, which include memberships in the
Steering Committee of Hong Kong’s Electronic Health Record Sharing, Social Welfare Department’s
Joint Committee on Information Technology for the Social Welfare Sector and Advisory Committee on
financing social welfare services. She is also a member of the AI4SDGs AI for Children Working Group.
Helen is a Fellow of IEEE, ISCA, HKIE and HKCS.
xxxiv
Keynote Talk: Learning and Processing Language from Wearables:
Opportunities and Challenges
Alejandrina Cristia
Laboratoire de Sciences Cognitives et de Psycholinguistique,
Département d’études cognitives, ENS, EHESS, CNRS, PSL University
Abstract: Recent years have seen tremendous improvement in the ease with which we can collect
naturalistic language samples via devices worn over long periods of time. These allow unprecedented
access to ego-centered experiences in language perceived and produced, including by young children.
For example, in a newly-formed consortium, we pulled together over 40k hours of audio, collected from
1, 001 children growing up in industrialized or hunter-horticulturalist populations, located in one of 12
countries. Such data are interesting for many purposes, including as 1. fodder for unsupervised language
learning models aimed at mimicking what the child does; 2. indices of early language development
that can be used to assess the impact of behavioral and pharmacological interventions; and 3. samples
of the natural use of language(s) in low-resource and multilingual settings. The technology allowing to
carve out interesting information from these large datasets, however, is lagging behind – but this may
not be such a bad thing after all, since the ethical, technical, and legal handling of such data also need
some work to increase the chances that the net impact of research based on this technique is positive.
In this talk, I draw from cutting-edge research building on long-form recordings from wearables and a
framework for doing the most good we can (effective altruism) to highlight surprising findings in early
language acquisition, and delineate key priorities for future work.
Bio: Alejandrina Cristia is a senior researcher at the Centre National de la Recherche Scientifique
(CNRS), leader of the Language Acquisition Across Cultures team, and director of the Laboratoire
de Sciences Cognitives et Psycholinguistique (LSCP) cohosted by the Ecole Normale Supérieure,
EHESS, and PSL. In 2021, she is an invited researcher in the Foundations of Learning Program
of the Abdul Latif Jameel Poverty Action Lab (J-PAL), and a guest researcher at the Max Planck
Institute for Evolutionary Anthropology. Her long-term aim is to answer the following questions:
What are the linguistic representations that infants and adults have? Why and how are they formed?
How may learnability biases shape the world’s languages? To answer these questions, she combines
multiple methodologies including spoken corpora analyses, behavioral studies, neuroimaging (NIRS),
and computational modeling. This interdisciplinary approach has resulted in over 100 publications in
pscyhology, linguistics, and development journals as well as IEEE and similar conferences. With an
interest in cumulative, collaborative, and transparent science, she contributed to the creation of the
first meta-meta-analysis platform (metalab.stanford.edu) and several international networks, including
saliently the LangVIEW consortium that is leading /L+/, the First truly global summer/winter school
on language acquisition.1 She received the 2017 John S. McDonnell Scholar Award in Understanding
Human Cognition, the 2020 Médaille de Bronze CNRS Section Linguistique, and an ERC Consolidator
Award (2021-2026) for the ExELang2 project.
1
https://www.dpss.unipd.it/summer-school-2021/home
2
exelang.fr
xxxv
Keynote Talk: Reliable Characterizations of NLP Systems
as a Social Responsibility
Christopher Potts
Stanford University
Abstract: This is an incredible moment for NLP. We all routinely work with models whose capabilities
would have seemed like science fiction just two decades ago, powerful organizations eagerly await our
latest results, and NLP technologies are playing an increasingly large role in shaping our society. As
a result, all of us in the NLP community are likely to participate in research that will contribute (to
varying degrees and perhaps only indirectly) to technologies that will impact many people’s lives, with
both positive and negative consequences – for example, technologies that broaden accessibility, enhance
creative self-expression, heighten surveillance, and create propaganda. What can we do to fulfill the
social responsibility that this brings? As a (very) partial answer to this question, I will review a number
of important recent developments, spanning many research groups, concerning dataset creation, model
introspection, and system assessment. Taken together, these ideas can help us more reliably characterize
how NLP systems will behave, and more reliably communicate this information to a wider range of
potential users. In this way, they can help us meet our obligations to the people whose lives are impacted
by the results of our research.
Bio: Christopher Potts is Professor and Chair of Linguistics and Professor (by courtesy) of Computer
Science at Stanford, and a faculty member in the Stanford NLP Group and the Stanford AI Lab. His
group uses computational methods to explore how emotion is expressed in language and how linguistic
production and interpretation are influenced by the context of utterance. This research combines methods
from linguistics, cognitive psychology, and computer science, in the service of both scientific discovery
and technology development. He was previously Chief Scientist at Roam Analytics, a start-up focused
on applying NLP in healthcare and the life sciences (now Parexel AI Labs). He is a long-time Action
Editor at TACL, a frequent Area Chair at ACL conferences, and currently an Ethics Committee co-chair
for EMNLP 2021.
xxxvi
Table of Contents
Investigating label suggestions for opinion mining in German Covid-19 social media
Tilman Beck, Ji-Ung Lee, Christina Viehmann, Marcus Maurer, Oliver Quiring and Iryna Gurevych
... .......................................................................................... 1
How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements
Chen Shani, Nadav Borenstein and Dafna Shahaf . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
Engage the Public: Poll Question Generation for Social Media Posts
Zexin Lu, Keyang Ding, Yuji Zhang, Jing Li, Baolin Peng and Lemao Liu . . . . . . . . . . . . . . . . . . . . 29
HateCheck: Functional Tests for Hate Speech Detection Models

Paul Röttger, Bertie Vidgen, Dong Nguyen, Zeerak Waseem, Helen Margetts and Janet Pierrehum-
bert . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Unified Dual-view Cognitive Model for Interpretable Claim Verification

Lianwei Wu, Yuan Rao, Yuqian Lan, Ling Sun and Zhaoyin Qi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling

Lanqing Xue, Kaitao Song, Duocai Wu, Xu Tan, Nevin L. Zhang, Tao Qin, Wei-Qiang Zhang and
Tie-Yan Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
PENS: A Dataset and Generic Framework for Personalized News Headline Generation
Xiang Ao, Xiting Wang, Ling Luo, Ying Qiao, Qing He and Xing Xie . . . . . . . . . . . . . . . . . . . . . . . 82
Enhancing Content Preservation in Text Style Transfer Using Reverse Attention and Conditional Layer
Normalization
Dongkyu Lee, Zhiliang Tian, Lanqing Xue and Nevin L. Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
Mention Flags (MF): Constraining Transformer-based Text Generators

Yufei Wang, Ian Wood, Stephen Wan, Mark Dras and Mark Johnson . . . . . . . . . . . . . . . . . . . . . . . . 103
Generalising Multilingual Concept-to-Text NLG with Language Agnostic Delexicalisation

Giulio Zhou and Gerasimos Lampouras . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances
Zekang Li, Jinchao Zhang, Zhengcong Fei, Yang Feng and Jie Zhou . . . . . . . . . . . . . . . . . . . . . . . . 128
Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking
Jinyu Guo, Kai Shuang, Jijie Li and Zihan Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 139
Transferable Dialogue Systems and User Simulators

Bo-Hsiang Tseng, Yinpei Dai, Florian Kreyssig and Bill Byrne . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data
Haoyu Song, Yan Wang, Kaiyan Zhang, Wei-Nan Zhang and Ting Liu . . . . . . . . . . . . . . . . . . . . . . 167
GL-GIN: Fast and Accurate Non-Autoregressive Model for Joint Multiple Intent Detection and Slot Fill-
ing
Libo Qin, Fuxuan Wei, Tianbao Xie, Xiao Xu, Wanxiang Che and Ting Liu . . . . . . . . . . . . . . . . . 178
Accelerating BERT Inference for Sequence Labeling via Early-Exit

Xiaonan Li, Yunfan Shao, Tianxiang Sun, Hang Yan, Xipeng Qiu and Xuanjing Huang . . . . . . . 189
xxxvii
Modularized Interaction Network for Named Entity Recognition
Fei Li, Zheng Wang, Siu Cheung Hui, Lejian Liao, Dandan Song, Jing Xu, Guoxiu He and meihuizi
jia . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200
Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder

Xi Xiangyu, Wei Ye, Shikun Zhang, Quanxiu Wang, Huixing Jiang and Wei Wu . . . . . . . . . . . . . 210
UniRE: A Unified Label Space for Entity Relation Extraction

Yijun Wang, Changzhi Sun, Yuanbin Wu, Hao Zhou, Lei Li and Junchi Yan . . . . . . . . . . . . . . . . . 220
Refining Sample Embeddings with Relation Prototypes to Enhance Continual Relation Extraction
Li Cui, Deqing Yang, Jiaxin Yu, Chengwei Hu, Jiayang Cheng, Jingjie Yi and Yanghua Xiao . . 232
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation

Xiao Pan, Mingxuan Wang, Liwei Wu and Lei Li . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 244
Understanding the Properties of Minimum Bayes Risk Decoding in Neural Machine Translation
Mathias Müller and Rico Sennrich . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259
Multi-Head Highly Parallelized LSTM Decoder for Neural Machine Translation

Hongfei Xu, Qiuhui Liu, Josef van Genabith, Deyi Xiong and Meng Zhang . . . . . . . . . . . . . . . . . . 273
A Bidirectional Transformer Based Alignment Model for Unsupervised Word Alignment

Jingyi Zhang and Josef van Genabith . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 283
Learning Language Specific Sub-network for Multilingual Machine Translation

Zehui Lin, Liwei Wu, Mingxuan Wang and Lei Li . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 293
Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis

Linyi Yang, Jiazheng Li, Padraig Cunningham, Yue Zhang, Barry Smyth and Ruihai Dong . . . . 306
Bridge-Based Active Domain Adaptation for Aspect Term Extraction

Zhuang Chen and Tieyun Qian . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 317
Multimodal Sentiment Detection Based on Multi-channel Graph Neural Networks

Xiaocui Yang, Shi Feng, Yifei Zhang and Daling Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 328
Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects and Opinions

Hongjie Cai, Rui Xia and Jianfei Yu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340
PASS: Perturb-and-Select Summarizer for Product Reviews

Nadav Oved and Ran Levy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351
Deep Differential Amplifier for Extractive Summarization

Ruipeng Jia, Yanan Cao, Fang Fang, Yuchen Zhou, Zheng Fang, Yanbing Liu and Shi Wang . . 366
Multi-TimeLine Summarization (MTLS): Improving Timeline Summarization by Generating Multiple

Summaries
Yi Yu, Adam Jatowt, Antoine Doucet, Kazunari Sugiyama and Masatoshi Yoshikawa . . . . . . . . . 377
Self-Supervised Multimodal Opinion Summarization

Jinbae Im, Moonki Kim, Hoyeop Lee, Hyunsouk Cho and Sehee Chung . . . . . . . . . . . . . . . . . . . . 388
A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance

and Self-referenced Redundancy
Wang Chen, Piji Li and Irwin King . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 404
xxxviii
DESCGEN: A Distantly Supervised Datasetfor Generating Entity Descriptions
Weijia Shi, Mandar Joshi and Luke Zettlemoyer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 415
Introducing Orthogonal Constraint in Structural Probes

Tomasz Limisiewicz and David Mareček . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 428
Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger

Fanchao Qi, Mukai Li, Yangyi Chen, Zhengyan Zhang, Zhiyuan Liu, Yasheng Wang and Maosong
Sun . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 443
Examining the Inductive Bias of Neural Language Models with Artificial Languages
Jennifer C. White and Ryan Cotterell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 454
Explaining Contextualization in Language Models using Visual Analytics

Rita Sevastjanova, Aikaterini-Lida Kalouli, Christin Beck, Hanna Schäfer and Mennatallah El-
Assady . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 464
Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Clas-
sification
George Chrysostomou and Nikolaos Aletras . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 477
Generating Landmark Navigation Instructions from Maps as a Graph-to-Text Problem

Raphael Schumann and Stefan Riezler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 489
E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning

Haiyang Xu, Ming Yan, Chenliang Li, Bin Bi, Songfang Huang, Wenming Xiao and Fei Huang503
Learning Relation Alignment for Calibrated Cross-modal Retrieval

Shuhuai Ren, Junyang Lin, Guangxiang Zhao, Rui Men, An Yang, Jingren Zhou, Xu Sun and
Hongxia Yang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 514
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation

Yiran Xing, Zai Shi, Zhao Meng, Gerhard Lakemeyer, Yunpu Ma and Roger Wattenhofer . . . . . 525
Cascaded Head-colliding Attention

Lin Zheng, Zhiyong Wu and Lingpeng Kong . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 536
Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor

Xinyu Wang, Yong Jiang, Zhaohui Yan, Zixia Jia, Nguyen Bach, Tao Wang, Zhongqiang Huang,
Fei Huang and Kewei Tu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 550
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks

Rabeeh Karimi Mahabadi, Sebastian Ruder, Mostafa Dehghani and James Henderson . . . . . . . . 565
COSY: COunterfactual SYntax for Cross-Lingual Understanding

SICHENG YU, Hao Zhang, Yulei Niu, Qianru Sun and Jing Jiang . . . . . . . . . . . . . . . . . . . . . . . . . . 577
OoMMix: Out-of-manifold Regularization in Contextual Embedding Space for Text Classification

Seonghyeon Lee, Dongha Lee and Hwanjo Yu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 590
Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model
Kathleen C. Fraser, Isar Nejadgholi and Svetlana Kiritchenko . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 600
Structurizing Misinformation Stories via Rationalizing Fact-Checks

Shan Jiang and Christo Wilson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 617
xxxix
Modeling Language Usage and Listener Engagement in Podcasts
Sravana Reddy, Mariya Lazarova, Yongze Yu and Rosie Jones . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 632
Breaking Down the Invisible Wall of Informal Fallacies in Online Discussions

Saumya Sahai, Oana Balalau and Roxana Horincar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 644
SocAoG: Incremental Graph Parsing for Social Relation Inference in Dialogues

Liang Qiu, Yuan Liang, Yizhou Zhao, Pan Lu, Baolin Peng, Zhou Yu, Ying Nian Wu and Song-
Chun Zhu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 658
TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems

Bill Byrne, Karthik Krishnamoorthi, Saravanan Ganesh and Mihir Kale . . . . . . . . . . . . . . . . . . . . . 671
Improving Dialog Systems for Negotiation with Personality Modeling

Runzhe Yang, Jingxiao Chen and Karthik Narasimhan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 681
Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial
Training
Wangchunshu Zhou, Qifei LI and Chenle Li . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 694
Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Features

Hannah Rashkin, David Reitter, Gaurav Singh Tomar and Dipanjan Das . . . . . . . . . . . . . . . . . . . . . 704
CitationIE: Leveraging the Citation Graph for Scientific Information Extraction

Vijay Viswanathan, Graham Neubig and Pengfei Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 719
From Discourse to Narrative: Knowledge Projection for Event Relation Extraction

Jialong Tang, Hongyu Lin, Meng Liao, Yaojie Lu, Xianpei Han, Le Sun, Weijian Xie and Jin Xu
732
AdvPicker: Effectively Leveraging Unlabeled Data via Adversarial Discriminator for Cross-Lingual
NER
Weile Chen, Huiqiang Jiang, Qianhui Wu, Börje Karlsson and Yi Guan . . . . . . . . . . . . . . . . . . . . . 743
Compare to The Knowledge: Graph Neural Fake News Detection with External Knowledge
Linmei Hu, Tianchi Yang, Luhao Zhang, Wanjun Zhong, Duyu Tang, Chuan Shi, Nan Duan and
Ming Zhou . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 754
Discontinuous Named Entity Recognition as Maximal Clique Discovery

Yucheng Wang, Bowen Yu, Hongsong Zhu, Tingwen Liu, Nan Yu and Limin Sun . . . . . . . . . . . . 764
LNN-EL: A Neuro-Symbolic Approach to Short-text Entity Linking

Hang Jiang, Sairam Gurajada, Qiuhao Lu, Sumit Neelam, Lucian Popa, Prithviraj Sen, Yunyao Li
and Alexander Gray . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 775
Do Context-Aware Translation Models Pay the Right Attention?

Kayo Yin, Patrick Fernandes, Danish Pruthi, Aditi Chaudhary, André F. T. Martins and Graham
Neubig . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 788
Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel
Data
Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Fran-
cisco Guzmán, Pascale Fung, Philipp Koehn and Mona Diab . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 802
xl
Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment
Haoyue Shi, Luke Zettlemoyer and Sida I. Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 813
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models

Xian Li, Changhan Wang, Yun Tang, Chau Tran, Yuqing Tang, Juan Pino, Alexei Baevski, Alexis
Conneau and Michael Auli . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 827
Learning Faithful Representations of Causal Graphs

Ananth Balashankar and Lakshminarayanan Subramanian . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 839
What Context Features Can Transformer Language Models Use?

Joe O’Connor and Jacob Andreas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 851
Integrated Directional Gradients: Feature Interaction Attribution for Neural NLP Models
Sandipan Sikdar, Parantapa Bhattacharya and Kieran Heese . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 865
DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations

John Giorgi, Osvald Nitski, Bo Wang and Gary Bader . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 879
XLPT-AMR: Cross-Lingual Pre-Training via Multi-Task Learning for Zero-Shot AMR Parsing and Text
Generation
Dongqin Xu, Junhui Li, Muhua Zhu, Min Zhang and Guodong Zhou . . . . . . . . . . . . . . . . . . . . . . . 896
Span-based Semantic Parsing for Compositional Generalization

Jonathan Herzig and Jonathan Berant . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 908
Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Han-
dle Both?
Peter Shaw, Ming-Wei Chang, Panupong Pasupat and Kristina Toutanova . . . . . . . . . . . . . . . . . . . 922
A Targeted Assessment of Incremental Processing in Neural Language Models and Humans

Ethan Wilcox, Pranali Vani and Roger Levy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 939
The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for Language Process-
ing
Valentina Pyatkin, Shoval Sadde, Aynat Rubinstein, Paul Portner and Reut Tsarfaty . . . . . . . . . . 953
To POS Tag or Not to POS Tag: The Impact of POS Tags on Morphological Learning in Low-Resource
Settings
Sarah Moeller, Ling Liu and Mans Hulden . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 966
Prosodic segmentation for parsing spoken dialogue

Elizabeth Nielsen, Mark Steedman and Sharon Goldwater . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 979
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised

Learning and Interpretation
Changhan Wang, Morgane Riviere, Ann Lee, Anne Wu, Chaitanya Talnikar, Daniel Haziza, Mary
Williamson, Juan Pino and Emmanuel Dupoux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 993
Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets

Su Lin Blodgett, Gilsinia Lopez, Alexandra Olteanu, Robert Sim and Hanna Wallach . . . . . . . 1004
Robust Knowledge Graph Completion with Stacked Convolutions and a Student Re-Ranking Network
Justin Lovelace, Denis Newman-Griffis, Shikhar Vashishth, Jill Fain Lehman and Carolyn Rosé
1016
xli
A DQN-based Approach to Finding Precise Evidences for Fact Verification
Hai Wan, Haicheng Chen, Jianfeng Du, Weilin Luo and Rongzhen Ye . . . . . . . . . . . . . . . . . . . . . 1030
The Art of Abstention: Selective Prediction and Error Regularization for Natural Language Processing
Ji Xin, Raphael Tang, Yaoliang Yu and Jimmy Lin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1040
Unsupervised Out-of-Domain Detection via Pre-trained Transformers

Keyang Xu, Tongzheng Ren, Shikun Zhang, Yihao Feng and Caiming Xiong . . . . . . . . . . . . . . . 1052
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation

Ahmad Rashid, Vasileios Lioutas and Mehdi Rezagholizadeh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1062
Selecting Informative Contexts Improves Language Model Fine-tuning

Richard Antonello, Nicole Beckage, Javier Turek and Alexander Huth . . . . . . . . . . . . . . . . . . . . . 1072
Explainable Prediction of Text Complexity: The Missing Preliminaries for Text Simplification
Cristina Garbacea, Mengtian Guo, Samuel Carton and Qiaozhu Mei . . . . . . . . . . . . . . . . . . . . . . . 1086
Multi-Task Retrieval for Knowledge-Intensive Tasks

Jean Maillard, Vladimir Karpukhin, Fabio Petroni, Wen-tau Yih, Barlas Oguz, Veselin Stoyanov
and Gargi Ghosh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1098
When Do You Need Billions of Words of Pretraining Data?

Yian Zhang, Alex Warstadt, Xiaocheng Li and Samuel R. Bowman . . . . . . . . . . . . . . . . . . . . . . . . 1112
Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation
Elena Voita, Rico Sennrich and Ivan Titov . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1126
Comparing Test Sets with Item Response Theory

Clara Vania, Phu Mon Htut, William Huang, Dhara Mungra, Richard Yuanzhe Pang, Jason Phang,
Haokun Liu, Kyunghyun Cho and Samuel R. Bowman . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1141
Uncovering Constraint-Based Behavior in Neural Models via Targeted Fine-Tuning

Forrest Davis and Marten van Schijndel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1159
More Identifiable yet Equally Performant Transformers for Text Classification

Rishabh Bhardwaj, Navonil Majumder, Soujanya Poria and Eduard Hovy . . . . . . . . . . . . . . . . . . 1172
AugNLG: Few-shot Natural Language Generation using Self-trained Data Augmentation

Xinnuo Xu, Guoyin Wang, Young-Bum Kim and Sungjin Lee. . . . . . . . . . . . . . . . . . . . . . . . . . . . .1183
Can vectors read minds better than experts? Comparing data augmentation strategies for the automated
scoring of children’s mindreading ability
Venelin Kovatchev, Phillip Smith, Mark Lee and Rory Devine . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1196
A Dataset and Baselines for Multilingual Reply Suggestion

Mozhi Zhang, Wei Wang, Budhaditya Deb, Guoqing Zheng, Milad Shokouhi and Ahmed Hassan
Awadallah . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1207
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection
Tasks?
Nikita Nangia, Saku Sugawara, Harsh Trivedi, Alex Warstadt, Clara Vania and Samuel R. Bowman
1221
xlii
Align Voting Behavior with Public Statements for Legislator Representation Learning
Xinyi Mou, Zhongyu Wei, Lei Chen, Shangyi Ning, Yancheng He, Changjian Jiang and Xuanjing
Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1236
Measure and Evaluation of Semantic Divergence across Two Languages

Syrielle Montariol and Alexandre Allauzen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1247
Improving Zero-Shot Translation by Disentangling Positional Information

Danni Liu, Jan Niehues, James Cross, Francisco Guzmán and Xian Li . . . . . . . . . . . . . . . . . . . . . 1259
Common Sense Beyond English: Evaluating and Improving Multilingual Language Models for Com-
monsense Reasoning
Bill Yuchen Lin, Seyeon Lee, Xiaoyang Qiao and Xiang Ren . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1274
Attention Calibration for Transformer in Neural Machine Translation

Yu Lu, Jiali Zeng, Jiajun Zhang, Shuangzhi Wu and Mu Li . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1288
Diverse Pretrained Context Encodings Improve Document Translation

Domenic Donato, Lei Yu and Chris Dyer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1299
Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Lan-
guages Study
Yash Khemchandani, Sarvesh Mehtani, Vaidehi Patil, Abhijeet Awasthi, Partha Talukdar and Sunita
Sarawagi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1312
On Finding the K-best Non-projective Dependency Trees

Ran Zmigrod, Tim Vieira and Ryan Cotterell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1324
Towards Argument Mining for Social Good: A Survey

Eva Maria Vecchi, Neele Falk, Iman Jundi and Gabriella Lapesa . . . . . . . . . . . . . . . . . . . . . . . . . . 1338
Automated Generation of Storytelling Vocabulary from Photographs for use in AAC

Mauricio Fontana de Vargas and Karyn Moffatt . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1353
CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes
James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale, Jordan Swartz, Greg McK-
elvey, Hui Dai, Yi Yang and David Sontag . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1365
Assessing Emoji Use in Modern Text Processing Tools

Abu Awal Md Shoeb and Gerard de Melo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1379
Select, Extract and Generate: Neural Keyphrase Generation with Layer-wise Coverage Attention
Wasi Ahmad, Xiao Bai, Soomin Lee and Kai-Wei Chang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1389
Factorising Meaning and Form for Intent-Preserving Paraphrasing

Tom Hosking and Mirella Lapata . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1405
AggGen: Ordering and Aggregating while Generating

Xinnuo Xu, Ondřej Dušek, Verena Rieser and Ioannis Konstas . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1419
Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models

Peter West, Ximing Lu, Ari Holtzman, Chandra Bhagavatula, Jena D. Hwang and Yejin Choi 1435
Towards Table-to-Text Generation with Numerical Reasoning

Lya Hulliyyatus Suadaa, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura and Hiroya
Takamura . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1451
xliii
BACO: A Background Knowledge- and Content-Based Framework for Citing Sentence Generation
Yubin Ge, Ly Dinh, Xiaofeng Liu, Jinsong Su, Ziyao Lu, Ante Wang and Jana Diesner . . . . . . 1466
Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization

Xiachong Feng, Xiaocheng Feng, Libo Qin, Bing Qin and Ting Liu . . . . . . . . . . . . . . . . . . . . . . . 1479
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval

Akari Asai and Eunsol Choi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1492
A Gradually Soft Multi-Task and Data-Augmented Approach to Medical Question Understanding

Khalil Mrini, Franck Dernoncourt, Seunghyun Yoon, Trung Bui, Walter Chang, Emilia Farcas and
Ndapa Nakashole . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1505
Leveraging Type Descriptions for Zero-shot Named Entity Recognition and Classification
Rami Aly, Andreas Vlachos and Ryan McDonald . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1516
MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity Recognition
Shuang Wu, Xiaoning Song and Zhenhua Feng . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1529
Factuality Assessment as Modal Dependency Parsing

Jiarui Yao, Haoling Qiu, Jin Zhao, Bonan Min and Nianwen Xue . . . . . . . . . . . . . . . . . . . . . . . . . . 1540
Directed Acyclic Graph Network for Conversational Emotion Recognition

Weizhou Shen, Siyue Wu, Yunyi Yang and Xiaojun Quan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1551
Improving Formality Style Transfer with Context-Aware Rule Injection

Zonghai Yao and hong yu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1561
Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection

Lixing Zhu, Gabriele Pergola, Lin Gui, Deyu Zhou and Yulan He . . . . . . . . . . . . . . . . . . . . . . . . . 1571
Syntopical Graphs for Computational Argumentation Tasks

Joe Barrow, Rajiv Jain, Nedim Lipka, Franck Dernoncourt, Vlad Morariu, Varun Manjunatha,
Douglas Oard, Philip Resnik and Henning Wachsmuth . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1583
Stance Detection in COVID-19 Tweets

Kyle Glandt, Sarthak Khanal, Yingjie Li, Doina Caragea and Cornelia Caragea . . . . . . . . . . . . . 1596
Topic-Aware Evidence Reasoning and Stance-Aware Aggregation for Fact Verification

Jiasheng Si, Deyu Zhou, Tongzhe Li, Xingyu Shi and Yulan He . . . . . . . . . . . . . . . . . . . . . . . . . . . 1612
Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and
Expert-Annotated Twitter Dataset
Alexandra Ils, Dan Liu, Daniela Grunow and Steffen Eger . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1623
Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions

Dorottya Demszky, Jing Liu, Zid Mancenido, Julie Cohen, Heather Hill, Dan Jurafsky and Tatsunori
Hashimoto . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1638
A Survey of Code-switching: Linguistic and Social Perspectives for Language Technologies

A. Seza Doğruöz, Sunayana Sitaram, Barbara E. Bullock and Almedia Jacqueline Toribio . . . 1654
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection
Bertie Vidgen, Tristan Thrush, Zeerak Waseem and Douwe Kiela . . . . . . . . . . . . . . . . . . . . . . . . . 1667
xliv
InfoSurgeon: Cross-Media Fine-grained Information Consistency Checking for Fake News Detection
Yi Fung, Christopher Thomas, Revanth Gangi Reddy, Sandeep Polisetty, Heng Ji, Shih-Fu Chang,
Kathleen McKeown, Mohit Bansal and Avi Sil . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1683
I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling

Yixin Nie, Mary Williamson, Mohit Bansal, Douwe Kiela and Jason Weston . . . . . . . . . . . . . . . 1699
A Sequence-to-Sequence Approach to Dialogue State Tracking

Yue Feng, Yang Wang and Hang Li. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1714
Discovering Dialog Structure Graph for Coherent Dialog Generation

Jun Xu, Zeyang Lei, Haifeng Wang, Zheng-Yu Niu, Hua Wu and Wanxiang Che . . . . . . . . . . . . 1726
Dialogue Response Selection with Hierarchical Curriculum Learning

Yixuan Su, Deng Cai, Qingyu Zhou, Zibo Lin, Simon Baker, Yunbo Cao, Shuming Shi, Nigel
Collier and Yan Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1740
A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Con-
versational Speech
Jingxuan Yang, Kerui Xu, Jun Xu, Si Li, Sheng Gao, Jun Guo, Nianwen Xue and Ji-Rong Wen1752
A Systematic Investigation of KB-Text Embedding Alignment at Scale

Vardaan Pahuja, Yu Gu, Wenhu Chen, Mehdi Bahrami, Lei Liu, Wei-Peng Chen and Yu Su . . 1764
Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data
Haoming Jiang, Danqing Zhang, Tianyu Cao, Bing Yin and Tuo Zhao . . . . . . . . . . . . . . . . . . . . . 1775
Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model
Hongliang Dai, Yangqiu Song and Haixun Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1790
Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning
Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu
1800
Implicit Representations of Meaning in Neural Language Models

Belinda Z. Li, Maxwell Nye and Jacob Andreas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1813
Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models

Matthew Finlayson, Aaron Mueller, Sebastian Gehrmann, Stuart Shieber, Tal Linzen and Yonatan
Belinkov . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1828
Bird’s Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic Approach
Yifan Hou and Mrinmaya Sachan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1844
Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases

Boxi Cao, Hongyu Lin, Xianpei Han, Le Sun, Lingyong Yan, Meng Liao, Tong Xue and Jin Xu
1860
Poisoning Knowledge Graph Embeddings via Relation Inference Patterns

Peru Bhardwaj, John Kelleher, Luca Costabello and Declan O’Sullivan . . . . . . . . . . . . . . . . . . . . 1875
Bad Seeds: Evaluating Lexical Methods for Bias Measurement

Maria Antoniak and David Mimno . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1889
A Survey of Race, Racism, and Anti-Racism in NLP

Anjalie Field, Su Lin Blodgett, Zeerak Waseem and Yulia Tsvetkov . . . . . . . . . . . . . . . . . . . . . . . 1905
xlv
Intrinsic Bias Metrics Do Not Correlate with Application Bias
Seraphina Goldfarb-Tarrant, Rebecca Marchant, Ricardo Muñoz Sánchez, Mugdha Pandya and
Adam Lopez . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1926
RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language
Models
Soumya Barikeri, Anne Lauscher, Ivan Vulić and Goran Glavaš . . . . . . . . . . . . . . . . . . . . . . . . . . . 1941
Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks

Weicheng Ma, Kai Zhang, Renze Lou, Lili Wang and Soroush Vosoughi . . . . . . . . . . . . . . . . . . . 1956
Crafting Adversarial Examples for Neural Machine Translation

Xinze Zhang, Junzhe Zhang, Zhenhua Chen and Kun He . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1967
UXLA: A Robust Unsupervised Data Augmentation Framework for Zero-Resource Cross-Lingual NLP
M Saiful Bari, Tasnim Mohiuddin and Shafiq Joty . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1978
Glancing Transformer for Non-Autoregressive Neural Machine Translation

Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Weinan Zhang, Yong Yu and Lei Li
1993
Hierarchical Context-aware Network for Dense Video Event Captioning

Lei Ji, Xianglin Guo, Haoyang Huang and Xilin Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2004
Control Image Captioning Spatially and Temporally

Kun Yan, Lei Ji, Huaishao Luo, Ming Zhou, Nan Duan and Shuai Ma. . . . . . . . . . . . . . . . . . . . . .2014
Edited Media Understanding Frames: Reasoning About the Intent and Implications of Visual Misinfor-
mation
Jeff Da, Maxwell Forbes, Rowan Zellers, Anthony Zheng, Jena D. Hwang, Antoine Bosselut and
Yejin Choi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2026
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World

Rowan Zellers, Ari Holtzman, Matthew Peters, Roozbeh Mottaghi, Aniruddha Kembhavi, Ali
Farhadi and Yejin Choi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2040
Modeling Fine-Grained Entity Types with Box Embeddings

Yasumasa Onoe, Michael Boratko, Andrew McCallum and Greg Durrett . . . . . . . . . . . . . . . . . . . 2051
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information

zijun sun, Xiaoya Li, Xiaofei Sun, Yuxian Meng, Xiang Ao, Qing He, Fei Wu and Jiwei Li . . 2065
Weight Distillation: Transferring the Knowledge in Neural Network Parameters

Ye Lin, Yanyang Li, Ziyang Wang, Bei Li, Quan Du, Tong Xiao and Jingbo Zhu . . . . . . . . . . . . 2076
Optimizing Deeper Transformers on Small Datasets

Peng Xu, Dhruv Kumar, Wei Yang, Wenjie Zi, Keyi Tang, Chenyang Huang, Jackie Chi Kit Cheung,
Simon J.D. Prince and Yanshuai Cao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2089
BERTAC: Enhancing Transformer-based Language Models with Adversarially Pretrained Convolutional

Neural Networks
Jong-Hoon Oh, Ryu Iida, Julien Kloetzer and Kentaro Torisawa . . . . . . . . . . . . . . . . . . . . . . . . . . . 2103
COVID-Fact: Fact Extraction and Verification of Real-World Claims on COVID-19 Pandemic

Arkadiy Saakyan, Tuhin Chakrabarty and Smaranda Muresan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2116
xlvi
Explaining Relationships Between Scientific Documents
Kelvin Luu, Xinyi Wu, Rik Koncel-Kedziorski, Kyle Lo, Isabel Cachola and Noah A. Smith . 2130
IrEne: Interpretable Energy Prediction for Transformers

Qingqing Cao, Yash Kumar Lal, Harsh Trivedi, Aruna Balasubramanian and Niranjan Balasubra-
manian . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2145
Mitigating Bias in Session-based Cyberbullying Detection: A Non-Compromising Approach

Lu Cheng, Ahmadreza Mosallanezhad, Yasin Silva, Deborah Hall and Huan Liu . . . . . . . . . . . . 2158
PlotCoder: Hierarchical Decoding for Synthesizing Visualization Code in Programmatic Context

Xinyun Chen, Linyuan Gong, Alvin Cheung and Dawn Song . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2169
Changing the World by Changing the Data

Anna Rogers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2182
EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets

Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, Zhangyang Wang and Jingjing Liu . . . . 2195
On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation

Ruidan He, Linlin Liu, Hai Ye, Qingyu Tan, BOSHENG DING, Liying Cheng, Jiawei Low, Lidong
Bing and Luo Si . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2208
Data Augmentation for Text Generation Without Any Augmented Data

Wei Bi, Huayang Li and Jiacheng Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2223
Integrating Semantics and Neighborhood Information with Graph-Driven Generative Models for Docu-
ment Retrieval
Zijing Ou, Qinliang Su, Jianxing Yu, Bang Liu, Jingwen Wang, Ruihui Zhao, Changyou Chen and
Yefeng Zheng . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2238
SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality Analysis
Joshua Feinglass and Yezhou Yang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2250
KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers

Chia-Hsuan Lee, Oleksandr Polozov and Matthew Richardson . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2261
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus
Hamdy Mubarak, Amir Hussein, Shammur Absar Chowdhury and Ahmed Ali . . . . . . . . . . . . . . 2274
An Empirical Study on Hyperparameter Optimization for Fine-Tuning Pre-trained Language Models

Xueqing Liu and Chi Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2286
Better than Average: Paired Evaluation of NLP systems

Maxime Peyrard, Wei Zhao, Steffen Eger and Robert West . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2301
Chase: A Large-Scale and Pragmatic Chinese Dataset for Cross-Database Context-Dependent Text-to-
SQL
Jiaqi Guo, Ziliang Si, Yu Wang, Qian Liu, Ming Fan, Jian-Guang LOU, Zijiang Yang and Ting Liu
2316
CLINE: Contrastive Learning with Semantic Negative Examples for Natural Language Understanding
Dong Wang, Ning Ding, Piji Li and Haitao Zheng . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2332
Tree-Structured Topic Modeling with Nonparametric Neural Variational Inference

Ziye Chen, Cheng Ding, Zusheng Zhang, Yanghui Rao and Haoran Xie . . . . . . . . . . . . . . . . . . . . 2343
xlvii
ExCAR: Event Graph Knowledge Enhanced Explainable Causal Reasoning
Li Du, Xiao Ding, Kai Xiong, Ting Liu and Bing Qin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2354
Distributed Representations of Emotion Categories in Emotion Space

Xiangyu Wang and Chengqing Zong . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2364
Style is NOT a single variable: Case Studies for Cross-Stylistic Language Understanding
Dongyeop Kang and Eduard Hovy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2376
DynaSent: A Dynamic Benchmark for Sentiment Analysis

Christopher Potts, Zhengxuan Wu, Atticus Geiger and Douwe Kiela . . . . . . . . . . . . . . . . . . . . . . . 2388
A Hierarchical VAE for Calibrating Attributes while Generating Text using Normalizing Flow
Bidisha Samanta, Mohit Agrawal and NIloy Ganguly . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2405
A Unified Generative Framework for Aspect-based Sentiment Analysis

Hang Yan, Junqi Dai, Tuo Ji, Xipeng Qiu and Zheng Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2416
Discovering Dialogue Slots with Weak Supervision

Vojtěch Hudeček, Ondřej Dušek and Zhou Yu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2430
Enhancing the generalization for Intent Classification and Out-of-Domain Detection in SLU
Yilin Shen, Yen-Chang Hsu, Avik Ray and Hongxia Jin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2443
ProtAugment: Intent Detection Meta-Learning through Unsupervised Diverse Paraphrasing

Thomas Dopierre, Christophe Gravier and Wilfried Logerais . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2454
Robustness Testing of Language Understanding in Task-Oriented Dialog

Jiexi Liu, Ryuichi Takanobu, Jiaxin Wen, Dazhen Wan, hongguang li, weiran nie, Cheng LI, Wei
Peng and Minlie Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2467
Comprehensive Study: How the Context Information of Different Granularity Affects Dialogue State
Tracking?
Puhai Yang, Heyan Huang and Xian-Ling Mao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2481
OTTers: One-turn Topic Transitions for Open-Domain Dialogue

Karin Sevegnani, David M. Howcroft, Ioannis Konstas and Verena Rieser . . . . . . . . . . . . . . . . . . 2492
Towards Robustness of Text-to-SQL Models against Synonym Substitution

Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver, John R. Woodward, Jinxia Xie and
Pengsheng Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2505
KACE: Generating Knowledge Aware Contrastive Explanations for Natural Language Inference
Qianglong Chen, Feng Ji, Xiangji Zeng, Feng-Lin Li, Ji Zhang, Haiqing Chen and Yin Zhang2516
Self-Guided Contrastive Learning for BERT Sentence Representations

Taeuk Kim, Kang Min Yoo and Sang-goo Lee . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2528
LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations
Ruisheng Cao, Lu Chen, Zhi Chen, Yanbin Zhao, Su Zhu and Kai Yu . . . . . . . . . . . . . . . . . . . . . . 2541
Multi-stage Pre-training over Simplified Multimodal Pre-training Models

Tongtong Liu, Fangxiang Feng and Xiaojie WANG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2556
Beyond Sentence-Level End-to-End Speech Translation: Context Helps

Biao Zhang, Ivan Titov, Barry Haddow and Rico Sennrich . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2566
xlviii
LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding
Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei Florencio,
Cha Zhang, Wanxiang Che, Min Zhang and Lidong Zhou . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2579
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning
Wei Li, Can Gao, Guocheng Niu, Xinyan Xiao, Hao Liu, Jiachen Liu, Hua Wu and Haifeng Wang
2592
Missing Modality Imagination Network for Emotion Recognition with Uncertain Missing Modalities
Jinming Zhao, Ruichen Li and Qin Jin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2608
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation
Encoders
Chen Xu, Bojie Hu, Yanyang Li, Yuhao Zhang, shen huang, Qi Ju, Tong Xiao and Jingbo Zhu2619
N-ary Constituent Tree Parsing with Recursive Semi-Markov Model

Xin Xin, Jinlong Li and Zeqi Tan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2631
Automated Concatenation of Embeddings for Structured Prediction

Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu
2643
Multi-View Cross-Lingual Structured Prediction with Minimum Supervision

Zechuan Hu, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu
2661
The Limitations of Limited Context for Constituency Parsing

Yuchen Li and Andrej Risteski . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2675
Neural Bi-Lexicalized PCFG Induction

Songlin Yang, Yanpeng Zhao and Kewei Tu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2688
Ruddit: Norms of Offensiveness for English Reddit Comments

Rishav Hada, Sohi Sudhir, Pushkar Mishra, Helen Yannakoudakis, Saif M. Mohammad and Ekate-
rina Shutova . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2700
Towards Quantifiable Dialogue Coherence Evaluation

Zheng Ye, Liucun Lu, Lishan Huang, Liang Lin and Xiaodan Liang . . . . . . . . . . . . . . . . . . . . . . . 2718
Assessing the Representations of Idiomaticity in Vector Models with a Noun Compound Dataset Labeled
at Type and Token Levels
Marcos Garcia, Tiago Kramer Vieira, Carolina Scarton, Marco Idiart and Aline Villavicencio 2730
Factoring Statutory Reasoning as Language Understanding Challenges

Nils Holzenberger and Benjamin Van Durme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2742
Evaluating Evaluation Measures for Ordinal Classification and Ordinal Quantification

Tetsuya Sakai . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2759
Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Mak-
ing
Zijun Yao, Chengjiang Li, Tiansi Dong, Xin Lv, Jifan Yu, Lei Hou, Juanzi Li, YICHI ZHANG and
zelin Dai . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2770
xlix
Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition
Yongliang Shen, Xinyin Ma, Zeqi Tan, Shuai Zhang, Wen Wang and Weiming Lu. . . . . . . . . . .2782
Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction

Yaojie Lu, Hongyu Lin, Jin Xu, Xianpei Han, Jialong Tang, Annan Li, Le Sun, Meng Liao and
Shaoyi Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2795
A Large-Scale Chinese Multimodal NER Dataset with Speech Clues

Dianbo Sui, Zhengkun Tian, Yubo Chen, Kang Liu and Jun Zhao . . . . . . . . . . . . . . . . . . . . . . . . . . 2807
A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization
Zongcheng Ji, Tian Xia, Mei Han and Jing Xiao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2819
OntoED: Low-resource Event Detection with Ontology Embedding

Shumin Deng, Ningyu Zhang, Luoqiu Li, Chen Hui, tou huaixiao, Mosha Chen, Fei Huang and
Huajun Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2828
Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation
Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Shuming Shi, Michael Lyu and Irwin King . . . . . . . 2840
Breaking the Corpus Bottleneck for Context-Aware Neural Machine Translation with Cross-Task Pre-
training
Linqing Chen, Junhui Li, Zhengxian Gong, Boxing Chen, Weihua Luo, Min Zhang and Guodong
Zhou . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2851
Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation
Yang Feng, Shuhao Gu, Dengji Guo, Zhengxin Yang and Chenze Shao . . . . . . . . . . . . . . . . . . . . 2862
Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?
Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Alberto Martinelli, Matteo Negri
and Marco Turchi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2873
Unsupervised Neural Machine Translation for Low-Resource Domains via Meta-Learning

Cheonbok Park, Yunwon Tae, TaeHee Kim, Soyoung Yang, Mohammad Azam Khan, Lucy Park
and Jaegul Choo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2888
Lightweight Cross-Lingual Sentence Representation Learning

Zhuoyuan Mao, Prakhar Gupta, Chenhui Chu, Martin Jaggi and Sadao Kurohashi . . . . . . . . . . . 2902
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer

SiYu Ding, Junyuan Shang, Shuohuan Wang, Yu Sun, Hao Tian, Hua Wu and Haifeng Wang . 2914
Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation
Yuanxin LIU, Fandong Meng, Zheng Lin, Weiping Wang and Jie Zhou . . . . . . . . . . . . . . . . . . . . 2928
Rational LAMOL: A Rationale-based Lifelong Learning Framework

Kasidis Kanwatchara, Thanapapas Horsuwan, Piyawat Lertvittayakumjorn, Boonserm Kijsirikul
and Peerapon Vateekul . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2942
EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering

Zhibin Duan, Hao Zhang, Chaojie Wang, Zhengjue Wang, Bo Chen and Mingyuan Zhou . . . . 2954
LeeBERT: Learned Early Exit for BERT with cross-level optimization

Wei Zhu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2968
l
Unsupervised Extractive Summarization-Based Representations for Accurate and Explainable Collabo-
rative Filtering
Reinald Adrian Pugoy and Hung-Yu Kao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2981
PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction

Shulin Liu, Tao Yang, Tianchi Yue, Feng Zhang and Di Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2991
Competence-based Multimodal Curriculum Learning for Medical Report Generation

Fenglin Liu, Shen Ge and Xian Wu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3001
Learning Syntactic Dense Embedding with Correlation Graph for Automatic Readability Assessment
Xinying Qiu, Yuan Chen, Hanwu Chen, Jian-Yun Nie, Yuming Shen and Dawei Lu . . . . . . . . . 3013
Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Do-
mains
Haojie Pan, Chengyu Wang, Minghui Qiu, Yichang Zhang, Yaliang Li and jun huang . . . . . . . 3026
A Semantic-based Method for Unsupervised Commonsense Question Answering

Yilin Niu, Fei Huang, Jiaming Liang, Wenkai Chen, Xiaoyan Zhu and Minlie Huang . . . . . . . . 3037
Explanations for CommonsenseQA: New Dataset and Models

Shourya Aggarwal, Divyanshu Mandowara, Vishwajeet Agrawal, Dinesh Khandelwal, Parag Singla
and Dinesh Garg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3050
Few-Shot Question Answering by Pretraining Span Selection

Ori Ram, Yuval Kirstain, Jonathan Berant, Amir Globerson and Omer Levy . . . . . . . . . . . . . . . . 3066
UnitedQA: A Hybrid Approach for Open Domain Question Answering

Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen and Jianfeng Gao . . . . 3080
Database reasoning over text

James Thorne, Majid Yazdani, Marzieh Saeidi, Fabrizio Silvestri, Sebastian Riedel and Alon Halevy
3091
Online Learning Meets Machine Translation Evaluation: Finding the Best Systems with the Least Human
Effort
Vânia Mendonça, Ricardo Rei, Luisa Coheur, Alberto Sardinha and Ana Lúcia Santos . . . . . . 3105
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models
Phillip Rust, Jonas Pfeiffer, Ivan Vulić, Sebastian Ruder and Iryna Gurevych . . . . . . . . . . . . . . . 3118
Evaluating morphological typology in zero-shot cross-lingual transfer

Antonio Martínez-García, Toni Badia and Jeremy Barnes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3136
From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text

Ishan Tarunesh, Syamantak Kumar and Preethi Jyothi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3154
Fast and Accurate Neural Machine Translation with Translation Memory

Qiuxiang He, Guoping Huang, Qu Cui, Li Li and Lemao Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3170
Annotating Online Misogyny

Philine Zeinert, Nanna Inie and Leon Derczynski . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3181
Few-NERD: A Few-shot Named Entity Recognition Dataset

Ning Ding, Guangwei Xu, Yulin Chen, Xiaobin Wang, Xu Han, Pengjun Xie, Haitao Zheng and
Zhiyuan Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3198
li
MultiMET: A Multimodal Dataset for Metaphor Understanding
Dongyu Zhang, Minghao Zhang, Heting Zhang, Liang Yang and Hongfei LIN . . . . . . . . . . . . . . 3214
Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate
Speech
Margherita Fanton, Helena Bonaldi, Serra Sinem Tekiroğlu and Marco Guerini . . . . . . . . . . . . . 3226
Can Generative Pre-trained Language Models Serve As Knowledge Bases for Closed-book QA?
Cunxiang Wang, Pai Liu and Yue Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3241
Joint Models for Answer Verification in Question Answering Systems

Zeyu Zhang, Thuy Vu and Alessandro Moschitti . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3252
Answering Ambiguous Questions through Generative Evidence Fusion and Round-Trip Prediction
Yifan Gao, Henghui Zhu, Patrick Ng, Cicero Nogueira dos Santos, Zhiguo Wang, Feng Nan, Dejiao
Zhang, Ramesh Nallapati, Andrew O. Arnold and Bing Xiang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3263
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance
Fengbin Zhu, Wenqiang Lei, Youcheng Huang, Chao Wang, Shuo Zhang, Jiancheng Lv, Fuli Feng
and Tat-Seng Chua . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3277
Modeling Transitions of Focal Entities for Conversational Knowledge Base Question Answering
Yunshi Lan and Jing Jiang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3288
Evidence-based Factual Error Correction

James Thorne and Andreas Vlachos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3298
Probabilistic, Structure-Aware Algorithms for Improved Variety, Accuracy, and Coverage of AMR Align-
ments
Austin Blodgett and Nathan Schneider . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3310
Meta-Learning to Compositionally Generalize

Henry Conklin, Bailin Wang, Kenny Smith and Ivan Titov . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3322
Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adapta-
tion
Shizhe Diao, Ruijia Xu, Hongjin Su, Yilei Jiang, Yan Song and Tong Zhang . . . . . . . . . . . . . . . . 3336
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive
Learning
Yujia Qin, Yankai Lin, Ryuichi Takanobu, Zhiyuan Liu, Peng Li, Heng Ji, Minlie Huang, Maosong
Sun and Jie Zhou . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3350
Position Bias Mitigation: A Knowledge-Aware Graph Model for Emotion Cause Extraction
Hanqi Yan, Lin Gui, Gabriele Pergola and Yulan He . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3364
Every Bite Is an Experience: Key Point Analysis of Business Reviews

Roy Bar-Haim, Lilach Eden, Yoav Kantor, Roni Friedman and Noam Slonim . . . . . . . . . . . . . . . 3376
Structured Sentiment Analysis as Dependency Graph Parsing

Jeremy Barnes, Robin Kurtz, Stephan Oepen, Lilja Øvrelid and Erik Velldal . . . . . . . . . . . . . . . . 3387
Consistency Regularization for Cross-Lingual Fine-Tuning

Bo Zheng, Li Dong, Shaohan Huang, Wenhui Wang, Zewen Chi, Saksham Singhal, Wanxiang Che,
Ting Liu, Xia Song and Furu Wei . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3403
lii
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment
Zewen Chi, Li Dong, Bo Zheng, Shaohan Huang, Xian-Ling Mao, Heyan Huang and Furu Wei3418
Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Transla-
tion
Liang Ding, Longyue Wang, Xuebo Liu, Derek F. Wong, Dacheng Tao and Zhaopeng Tu . . . . 3431
G-Transformer for Document-Level Machine Translation

Guangsheng Bao, Yue Zhang, Zhiyang Teng, Boxing Chen and Weihua Luo . . . . . . . . . . . . . . . . 3442
Prevent the Language Model from being Overconfident in Neural Machine Translation
Mengqi Miao, Fandong Meng, Yijin Liu, Xiao-Hua Zhou and Jie Zhou . . . . . . . . . . . . . . . . . . . . 3456
Towards Emotional Support Dialog Systems

Siyang Liu, Chujie Zheng, Orianna Demasi, Sahand Sabour, Yu Li, Zhou Yu, Yong Jiang and Minlie
Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3469
Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue
System
Yanan Wu, Zhiyuan Zeng, Keqing He, Hong Xu, Yuanmeng Yan, Huixing Jiang and Weiran Xu
3484
GTM: A Generative Triple-wise Model for Conversational Question Generation

Lei Shen, Fandong Meng, Jinchao Zhang, Yang Feng and Jie Zhou . . . . . . . . . . . . . . . . . . . . . . . . 3495
Diversifying Dialog Generation via Adaptive Label Smoothing

Yida Wang, Yinhe Zheng, Yong Jiang and Minlie Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3507
Out-of-Scope Intent Detection with Self-Supervision and Discriminative Training

Li-Ming Zhan, Haowen Liang, Bo LIU, Lu Fan, Xiao-Ming Wu and Albert Y.S. Lam . . . . . . . 3521
Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker
Runxin Xu, Tianyu Liu, Lei Li and Baobao Chang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3533
Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path
Yiran Wang, Hiroyuki Shindo, Yuji Matsumoto and Taro Watanabe . . . . . . . . . . . . . . . . . . . . . . . . 3547
LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality Identification

Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng and Yuguang Chen . 3558
Revisiting the Negative Data of Distantly Supervised Relation Extraction

Chenhao Xie, Jiaqing Liang, Jingping Liu, Chengsong Huang, Wenhao Huang and Yanghua Xiao
3572
Knowing the No-match: Entity Alignment with Dangling Cases

Zequn Sun, Muhao Chen and Wei Hu. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3582
Superbizarre Is Not Superb: Derivational Morphology Improves BERT’s Interpretation of Complex

Words
Valentin Hofmann, Janet Pierrehumbert and Hinrich Schütze. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3594
BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?
Asahi Ushio, Luis Espinosa Anke, Steven Schockaert and Jose Camacho-Collados . . . . . . . . . . 3609
Exploring the Representation of Word Meanings in Context: A Case Study on Homonymy and Synonymy
Marcos Garcia . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3625
liii
Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach
Jie Huang, Kevin Chang, JinJun Xiong and Wen-mei Hwu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3641
HERALD: An Annotation Efficient Method to Detect User Disengagement in Social Conversations

Weixin Liang, Kai-Hui Liang and Zhou Yu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3652
Value-Agnostic Conversational Semantic Parsing

Emmanouil Antonios Platanios, Adam Pauls, Subhro Roy, Yuchen Zhang, Alexander Kyte, Alan
Guo, Sam Thomson, Jayant Krishnamurthy, Jason Wolfe, Jacob Andreas and Dan Klein . . . . . . . . . . 3666
MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding

Jia-Chen Gu, Chongyang Tao, Zhenhua Ling, Can Xu, Xiubo Geng and Daxin Jiang . . . . . . . . 3682
Best of Both Worlds: Making High Accuracy Non-incremental Transformer-based Disfluency Detection
Incremental
Morteza Rohanian and Julian Hough . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3693
NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-Based Simulation

Sungdong Kim, Minsuk Chang and Sang-Woo Lee . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3704
CDRNN: Discovering Complex Dynamics in Human Language Processing

Cory Shain . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3718
Structural Guidance for Transformer Language Models

Peng Qian, Tahira Naseem, Roger Levy and Ramón Fernandez Astudillo. . . . . . . . . . . . . . . . . . .3735
Surprisal Estimators for Human Reading Times Need Character Models

Byung-Doh Oh, Christian Clark and William Schuler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3746
CogAlign: Learning to Align Textual Neural Representations to Cognitive Language Processing Signals
Yuqi Ren and Deyi Xiong . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3758
Self-Attention Networks Can Process Bounded Hierarchical Languages

Shunyu Yao, Binghui Peng, Christos Papadimitriou and Karthik Narasimhan . . . . . . . . . . . . . . . 3770
TextSETTR: Few-Shot Text Style Extraction and Tunable Targeted Restyling

Parker Riley, Noah Constant, Mandy Guo, Girish Kumar, David Uthus and Zarana Parekh . . . 3786
H-Transformer-1D: Fast One-Dimensional Hierarchical Attention for Sequences

Zhenhai Zhu and Radu Soricut . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3801
Making Pre-trained Language Models Better Few-shot Learners

Tianyu Gao, Adam Fisch and Danqi Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3816
A Sweet Rabbit Hole by DARCY: Using Honeypots to Detect Universal Trigger’s Adversarial Attacks
Thai Le, Noseong Park and Dongwon Lee . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3831
Towards Propagation Uncertainty: Edge-enhanced Bayesian Graph Convolutional Networks for Rumor
Detection
Lingwei Wei, Dou Hu, Wei Zhou, Zhaojuan Yue and Songlin Hu . . . . . . . . . . . . . . . . . . . . . . . . . . 3845
Label-Specific Dual Graph Neural Network for Multi-Label Text Classification

Qianwen Ma, Chunyuan Yuan, Wei Zhou and Songlin Hu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3855
TAN-NTM: Topic Attention Networks for Neural Topic Modeling

Madhur Panwar, Shashank Shailabh, Milan Aggarwal and Balaji Krishnamurthy . . . . . . . . . . . . 3865
liv
Cross-language Sentence Selection via Data Augmentation and Rationale Training
Yanda Chen, Chris Kedzie, Suraj Nair, Petra Galuscakova, Rui Zhang, Douglas Oard and Kathleen
McKeown . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3881
A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document
Collections
Dimitris Pappas and Ion Androutsopoulos. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3896
W-RST: Towards a Weighted RST-style Discourse Framework

Patrick Huber, Wen Xiao and Giuseppe Carenini . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3908
ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences
Yanjun Gao, Ting-Hao Huang and Rebecca J. Passonneau . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3919
Which Linguist Invented the Lightbulb? Presupposition Verification for Question-Answering

Najoung Kim, Ellie Pavlick, Burcu Karagol Ayan and Deepak Ramachandran . . . . . . . . . . . . . . 3932
Adversarial Learning for Discourse Rhetorical Structure Parsing

Longyin Zhang, Fang Kong and Guodong Zhou . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3946
Exploring Discourse Structures for Argument Impact Classification

Xin Liu, Jiefu Ou, Yangqiu Song and Xin Jiang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3958
Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation
Tong Zhang, Long Zhang, Wei Ye, Bo Li, Jinan Sun, Xiaoyu Zhu, Wen Zhao and Shikun Zhang
3970
VECO: Variable and Flexible Cross-lingual Pre-training for Language Understanding and Generation
Fuli Luo, Wei Wang, Jiahao Liu, Yijia Liu, Bin Bi, Songfang Huang, Fei Huang and Luo Si . . 3980
A unified approach to sentence segmentation of punctuated text in many languages

Rachel Wicks and Matt Post . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3995
Towards User-Driven Neural Machine Translation

Huan Lin, Liang Yao, Baosong Yang, Dayiheng Liu, Haibo Zhang, Weihua Luo, Degen Huang and
Jinsong Su . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4008
End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages

Josef Jon, João Paulo Aires, Dusan Varis and Ondřej Bojar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4019
Handling Extreme Class Imbalance in Technical Logbook Datasets

Farhad Akhbardeh, Cecilia Ovesdotter Alm, Marcos Zampieri and Travis Desell . . . . . . . . . . . . 4034
ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation
Vijit Malik, Rishabh Sanjay, Shubham Kumar Nigam, Kripabandhu Ghosh, Shouvik Kumar Guha,
Arnab Bhattacharya and Ashutosh Modi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4046
Supporting Cognitive and Emotional Empathic Writing of Students

Thiemo Wambsganss, Christina Niklaus, Matthias Söllner, Siegfried Handschuh and Jan Marco
Leimeister . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4063
Dual Reader-Parser on Hybrid Textual and Tabular Evidence for Open Domain Question Answering
Alexander Hanbo Li, Patrick Ng, Peng Xu, Henghui Zhu, Zhiguo Wang and Bing Xiang. . . . .4078
lv
Generation-Augmented Retrieval for Open-Domain Question Answering
Yuning Mao, Pengcheng He, Xiaodong Liu, Yelong Shen, Jianfeng Gao, Jiawei Han and Weizhu
Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4089
Check It Again:Progressive Visual Question Answering via Visual Entailment

Qingyi Si, Zheng Lin, Ming yu Zheng, Peng Fu and Weiping Wang . . . . . . . . . . . . . . . . . . . . . . . 4101
A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised
Question Answering
Zhihong Shao, Lifeng Shang, Qun Liu and Minlie Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4111
Breaking Down Walls of Text: How Can NLP Benefit Consumer Privacy?
Abhilasha Ravichander, Alan W Black, Thomas Norton, Shomir Wilson and Norman Sadeh . 4125
Supporting Land Reuse of Former Open Pit Mining Sites using Text Classification and Active Learning
Christopher Schröder, Kim Bürgl, Yves Annanias, Andreas Niekler, Lydia Müller, Daniel Wiegr-
effe, Christian Bender, Christoph Mengs, Gerik Scheuermann and Gerhard Heyer . . . . . . . . . . . . . . . 4141
Reliability Testing for Natural Language Processing Systems

Samson Tan, Shafiq Joty, Kathy Baxter, Araz Taeihagh, Gregory A. Bennett and Min-Yen Kan4153
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data
Paul Pu Liang, Terrance Liu, Anna Cai, Michal Muszynski, Ryo Ishii, Nick Allen, Randy Auerbach,
David Brent, Ruslan Salakhutdinov and Louis-Philippe Morency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4170
Anonymisation Models for Text Data: State of the art, Challenges and Future Directions
Pierre Lison, Ildikó Pilán, David Sanchez, Montserrat Batet and Lilja Øvrelid . . . . . . . . . . . . . . 4188
End-to-End AMR Corefencence Resolution

Qiankun Fu, Linfeng Song, Wenyu Du and Yue Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4204
How is BERT surprised? Layerwise detection of linguistic anomalies

Bai Li, Zining Zhu, Guillaume Thomas, Yang Xu and Frank Rudzicz . . . . . . . . . . . . . . . . . . . . . . 4215
Psycholinguistic Tripartite Graph Network for Personality Detection

Tao Yang, Feifan Yang, Haolan Ouyang and Xiaojun Quan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4229
Verb Metaphor Detection via Contextual Relation Learning

Wei Song, Shuhui Zhou, Ruiji Fu, Ting Liu and Lizhen Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4240
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task
Yun Tang, Juan Pino, Xian Li, Changhan Wang and Dmitriy Genzel . . . . . . . . . . . . . . . . . . . . . . . 4252
Probing Toxic Content in Large Pre-Trained Language Models

Nedjma Ousidhoum, Xinran Zhao, Tianqing Fang, Yangqiu Song and Dit-Yan Yeung . . . . . . . 4262
Societal Biases in Language Generation: Progress and Challenges

Emily Sheng, Kai-Wei Chang, Prem Natarajan and Nanyun Peng . . . . . . . . . . . . . . . . . . . . . . . . . . 4275
Reservoir Transformers
Sheng Shen, Alexei Baevski, Ari Morcos, Kurt Keutzer, Michael Auli and Douwe Kiela . . . . . 4294
Subsequence Based Deep Active Learning for Named Entity Recognition

Puria Radmard, Yassir Fathullah and Aldo Lipani . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4310
lvi
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models
Tyler Chang, Yifan Xu, Weijian Xu and Zhuowen Tu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4322
BinaryBERT: Pushing the Limit of BERT Quantization

Haoli Bai, Wei Zhang, Lu Hou, Lifeng Shang, Jin JIN, Xin Jiang, Qun Liu, Michael Lyu and Irwin
King . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4334
Are Pretrained Convolutions Better than Pretrained Transformers?

Yi Tay, Mostafa Dehghani, Jai Prakash Gupta, Vamsi Aribandi, Dara Bahri, Zhen Qin and Donald
Metzler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4349
PairRE: Knowledge Graph Embeddings via Paired Relation Vectors

Linlin Chao, Jianshan He, Taifeng Wang and Wei Chu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4360
Hierarchy-aware Label Semantics Matching Network for Hierarchical Text Classification

Haibin Chen, Qianli Ma, Zhenxi Lin and Jiangyue Yan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4370
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalizabil-
ity
Jiaao Chen, Dinghan Shen, Weizhu Chen and Diyi Yang. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .4380
Neural Stylistic Response Generation with Disentangled Latent Variables

Qingfu Zhu, Wei-Nan Zhang, Ting Liu and William Yang Wang . . . . . . . . . . . . . . . . . . . . . . . . . . 4391
Intent Classification and Slot Filling for Privacy Policies

Wasi Ahmad, Jianfeng Chi, Tu Le, Thomas Norton, Yuan Tian and Kai-Wei Chang . . . . . . . . . 4402
RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems
Baolin Peng, Chunyuan Li, Zhu Zhang, Chenguang Zhu, Jinchao Li and Jianfeng Gao. . . . . . .4418
Semantic Representation for Dialogue Modeling

Xuefeng Bai, Yulong Chen, Linfeng Song and Yue Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4430
A Pre-training Strategy for Zero-Resource Response Selection in Knowledge-Grounded Conversations

Chongyang Tao, Changyu Chen, Jiazhan Feng, Ji-Rong Wen and Rui Yan . . . . . . . . . . . . . . . . . . 4446
Dependency-driven Relation Extraction with Attentive Graph Convolutional Networks

Yuanhe Tian, Guimin Chen, Yan Song and Xiang Wan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4458
Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP

Anthony Chen, Pallavi Gudipati, Shayne Longpre, Xiao Ling and Sameer Singh . . . . . . . . . . . . 4472
Evaluation Examples are not Equally Informative: How should that change NLP Leaderboards?
Pedro Rodriguez, Joe Barrow, Alexander Miserlis Hoyle, John P. Lalor, Robin Jia and Jordan Boyd-
Graber . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4486
Claim Matching Beyond English to Scale Global Fact-Checking

Ashkan Kazemi, Kiran Garimella, Devin Gaffney and Scott Hale . . . . . . . . . . . . . . . . . . . . . . . . . . 4504
SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural Machine Translation
Shuo Ren, Long Zhou, Shujie Liu, Furu Wei, Ming Zhou and Shuai Ma . . . . . . . . . . . . . . . . . . . . 4518
Energy-Based Reranking: Improving Neural Machine Translation Using Energy-Based Models

Sumanta Bhattacharyya, Amirmohammad Rooshenas, Subhajit Naskar, Simeng Sun, Mohit Iyyer
and Andrew McCallum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4528
lvii
Syntax-augmented Multilingual BERT for Cross-lingual Transfer
Wasi Ahmad, Haoran Li, Kai-Wei Chang and Yashar Mehdad . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4538
How to Adapt Your Pretrained Multilingual Model to 1600 Languages

Abteen Ebrahimi and Katharina Kann . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4555
Weakly Supervised Named Entity Tagging with Learnable Logical Rules

Jiacheng Li, Haibo Ding, Jingbo Shang, Julian McAuley and Zhe Feng . . . . . . . . . . . . . . . . . . . . 4568
Prefix-Tuning: Optimizing Continuous Prompts for Generation

Xiang Lisa Li and Percy Liang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4582
One2Set: Generating Diverse Keyphrases as a Set

Jiacheng Ye, Tao Gui, Yichao Luo, Yige Xu and Qi Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4598
Continuous Language Generative Flow

Zineng Tang, Shiyue Zhang, Hyounghun Kim and Mohit Bansal . . . . . . . . . . . . . . . . . . . . . . . . . . 4609
TWAG: A Topic-Guided Wikipedia Abstract Generator

Fangwei Zhu, Shangqing Tu, Jiaxin Shi, Juanzi Li, Lei Hou and Tong Cui . . . . . . . . . . . . . . . . . . 4623
ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data
Woojeong Jin, Rahul Khanna, Suji Kim, Dong-Ho Lee, Fred Morstatter, Aram Galstyan and Xiang
Ren . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4636
Recursive Tree-Structured Self-Attention for Answer Sentence Selection

Khalil Mrini, Emilia Farcas and Ndapa Nakashole . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4651
How Knowledge Graph and Attention Help? A Qualitative Analysis into Bag-level Relation Extraction
Zikun Hu, Yixin Cao, Lifu Huang and Tat-Seng Chua . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4662
Trigger is Not Sufficient: Exploiting Frame-aware Knowledge for Implicit Event Argument Extraction
Kaiwen Wei, Xian Sun, Zequn Zhang, Jingyuan Zhang, Guo Zhi and li jin . . . . . . . . . . . . . . . . . 4672
Element Intervention for Open Relation Extraction

Fangchao Liu, Lingyong Yan, Hongyu Lin, Xianpei Han and Le Sun . . . . . . . . . . . . . . . . . . . . . . 4683
AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding
Jun Yan, Nasser Zalmout, Yan Liang, Christan Grant, Xiang Ren and Xin Luna Dong . . . . . . . 4694
CoRI: Collective Relation Integration with Data Augmentation for Open Information Extraction
Zhengbao Jiang, Jialong Han, BUNYAMIN SISMAN and Xin Luna Dong . . . . . . . . . . . . . . . . . 4706
Benchmarking Scalable Methods for Streaming Cross Document Entity Coreference

Robert L Logan IV, Andrew McCallum, Sameer Singh and Dan Bikel . . . . . . . . . . . . . . . . . . . . . 4717
Search from History and Reason for Future: Two-stage Reasoning on Temporal Knowledge Graphs
Zixuan Li, Xiaolong Jin, Saiping Guan, Wei Li, Jiafeng Guo, Yuanzhuo Wang and Xueqi Cheng
4732
Employing Argumentation Knowledge Graphs for Neural Argument Generation

Khalid Al Khatib, Lukas Trautner, Henning Wachsmuth, Yufang Hou and Benno Stein . . . . . . 4744
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction

Lu Xu, Yew Ken Chia and Lidong Bing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4755
lviii
On Compositional Generalization of Neural Machine Translation
Yafu Li, Yongjing Yin, Yulong Chen and Yue Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4767
Mask-Align: Self-Supervised Neural Word Alignment

Chi Chen, Maosong Sun and Yang Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4781
GWLAN: General Word-Level AutocompletioN for Computer-Aided Translation

Huayang Li, Lemao Liu, Guoping Huang and Shuming Shi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4792
De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention

Wenkai Zhang, Hongyu Lin, Xianpei Han and Le Sun . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4803
A Span-Based Model for Joint Overlapped and Discontinuous Named Entity Recognition
Fei Li, ZhiChao Lin, Meishan Zhang and Donghong Ji . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4814
MLBiNet: A Cross-Sentence Collective Event Detection Network

Dongfang Lou, Zhilin Liao, Shumin Deng, Ningyu Zhang and Huajun Chen. . . . . . . . . . . . . . . .4829
Exploiting Document Structures and Cluster Consistencies for Event Coreference Resolution
Hieu Minh Tran, Duy Phung and Thien Huu Nguyen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4840
StereoRel: Relational Triple Extraction from a Stereoscopic Perspective

Xuetao Tian, Liping Jing, Lu He and Feng Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4851
Knowledge-Enriched Event Causality Identification via Latent Structure Induction Networks

Pengfei Cao, Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao, Yuguang Chen and Weihua Peng . 4862
Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution
Fanchao Qi, Yuan Yao, Sophia Xu, Zhiyuan Liu and Maosong Sun . . . . . . . . . . . . . . . . . . . . . . . . 4873
Parameter-Efficient Transfer Learning with Diff Pruning

Demi Guo, Alexander Rush and Yoon Kim . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4884
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language
Modeling
Xiang Hu, Haitao Mi, Zujie Wen, Yafang Wang, Yi Su, Jing Zheng and Gerard de Melo . . . . . 4897
Risk Minimization for Zero-shot Sequence Labeling

Zechuan Hu, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu
4909
WARP: Word-level Adversarial ReProgramming

Karen Hambardzumyan, Hrant Khachatrian and Jonathan May . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4921
Lexicon Learning for Few Shot Sequence Modeling

Ekin Akyurek and Jacob Andreas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4934
Personalized Transformer for Explainable Recommendation

Lei Li, Yongfeng Zhang and Li Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4947
Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques
Kundan Krishna, Sopan Khosla, Jeffrey Bigham and Zachary C. Lipton . . . . . . . . . . . . . . . . . . . . 4958
Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction

Piji Li and Shuming Shi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4973
lix
Early Detection of Sexual Predators in Chats
Matthias Vogt, Ulf Leser and Alan Akbik . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4985
Writing by Memorizing: Hierarchical Retrieval-based Medical Report Generation

Xingyi Yang, Muchao Ye, Quanzeng You and Fenglong Ma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5000
Concept-Based Label Embedding via Dynamic Routing for Hierarchical Text Classification
Xuepeng Wang, Li Zhao, Bing Liu, Tao Chen, Feng Zhang and Di Wang . . . . . . . . . . . . . . . . . . . 5010
VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted

Bag-of-words
Xiaopeng Lu, Tiancheng Zhao and Kyusong Lee . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5020
Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision

Si Sun, Yingzhuo Qian, Zhenghao Liu, Chenyan Xiong, Kaitao Zhang, Jie Bao, Zhiyuan Liu and
Paul Bennett . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5030
Semi-Supervised Text Classification with Balanced Deep Representation Distributions

Changchun Li, Ximing Li and Jihong Ouyang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5044
Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval
Hongyin Tang, Xingwu Sun, Beihong Jin, Jingang Wang, Fuzheng Zhang and Wei Wu . . . . . . 5054
ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Yuanmeng Yan, Rumei Li, Sirui Wang, Fuzheng Zhang, Wei Wu and Weiran Xu . . . . . . . . . . . . 5065
Exploring Dynamic Selection of Branch Expansion Orders for Code Generation

Hui Jiang, Chulun Zhou, Fandong Meng, Biao Zhang, Jie Zhou, Degen Huang, Qingqiang Wu and
Jinsong Su . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5076
COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion
Debjit Paul and Anette Frank . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5086
Reasoning over Entity-Action-Location Graph for Procedural Text Understanding

Hao Huang, Xiubo Geng, Jian Pei, Guodong Long and Daxin Jiang . . . . . . . . . . . . . . . . . . . . . . . 5100
From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic
Decoding
Shan Wu, Bo Chen, Chunlei Xin, Xianpei Han, Le Sun, Weipeng Zhang, Jiansong Chen, Fan Yang
and Xunliang Cai . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5110
Pre-training Universal Language Representation

Yian Li and Hai Zhao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5122
Structural Pre-training for Dialogue Comprehension

Zhuosheng Zhang and Hai Zhao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5134
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models

Yichun Yin, Cheng Chen, Lifeng Shang, Xin Jiang, Xiao Chen and Qun Liu . . . . . . . . . . . . . . . . 5146
Data Augmentation with Adversarial Training for Cross-Lingual NLI

Xin Dong, Yaxin Zhu, Zuohui Fu, Dongkuan Xu and Gerard de Melo . . . . . . . . . . . . . . . . . . . . . . 5158
Bootstrapped Unsupervised Sentence Representation Learning

Yan Zhang, Ruidan He, ZUOZHU LIU, Lidong Bing and Haizhou Li . . . . . . . . . . . . . . . . . . . . . . 5168
lx
Learning Event Graph Knowledge for Abductive Reasoning
Li Du, Xiao Ding, Ting Liu and Bing Qin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5181
A Cognitive Regularizer for Language Modeling

Jason Wei, Clara Meister and Ryan Cotterell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5191
Lower Perplexity is Not Always Human-Like

Tatsuki Kuribayashi, Yohei Oseki, Takumi Ito, Ryo Yoshida, Masayuki Asahara and Kentaro Inui
5203
Word Sense Disambiguation: Towards Interactive Context Exploitation from Both Word and Sense Per-
spectives
Ming Wang and Yinglin Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5218
A Knowledge-Guided Framework for Frame Identification

Xuefeng Su, Ru Li, Xiaoli Li, Jeff Z. Pan, Hu Zhang, Qinghua Chai and Xiaoqi Han . . . . . . . . 5230
Obtaining Better Static Word Embeddings Using Contextual Embedding Models

Prakhar Gupta and Martin Jaggi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5241
Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation

Yingjun Du, Nithin Holla, Xiantong Zhen, Cees Snoek and Ekaterina Shutova . . . . . . . . . . . . . . 5254
LexFit: Lexical Fine-Tuning of Pretrained Language Models

Ivan Vulić, Edoardo Maria Ponti, Anna Korhonen and Goran Glavaš . . . . . . . . . . . . . . . . . . . . . . . 5269
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units

Wei-Ning Hsu, David Harwath, Tyler Miller, Christopher Song and James Glass . . . . . . . . . . . . 5284
CTFN: Hierarchical Learning for Multimodal Sentiment Analysis Using Coupled-Translation Fusion
Network
Jiajia Tang, Kang Li, Xuanyu Jin, Andrzej Cichocki, Qibin Zhao and Wanzeng Kong . . . . . . . . 5301
Positional Artefacts Propagate Through Masked Language Model Embeddings

Ziyang Luo, Artur Kulmizev and Xiaoxi Mao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5312
Language Model Evaluation Beyond Perplexity

Clara Meister and Ryan Cotterell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5328
Learning to Explain: Generating Stable Explanations Fast

Xuelin Situ, Ingrid Zukerman, Cecile Paris, Sameen Maruf and Gholamreza Haffari . . . . . . . . . 5340
StereoSet: Measuring stereotypical bias in pretrained language models

Moin Nadeem, Anna Bethke and Siva Reddy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5356
Alignment Rationale for Natural Language Inference

Zhongtao Jiang, Yuanzhe Zhang, Zhao Yang, Jun Zhao and Kang Liu . . . . . . . . . . . . . . . . . . . . . . 5372
Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Prod-
uct Operators
Peiyu Liu, Ze-Feng Gao, Wayne Xin Zhao, Zhi-Yuan Xie, Zhong-Yi Lu and Ji-Rong Wen . . . 5388
On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation
Wei Zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui and Fan Zhang . . . . . . . . . 5399
lxi
Syntax-Enhanced Pre-trained Model
Zenan Xu, Daya Guo, Duyu Tang, Qinliang Su, Linjun Shou, Ming Gong, Wanjun Zhong, Xiaojun
Quan, Daxin Jiang and Nan Duan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5412
Matching Distributions between Model and Data: Cross-domain Knowledge Distillation for Unsuper-
vised Domain Adaptation
Bo Zhang, Xiaoming Zhang, Yun Liu, Lei Cheng and Zhoujun Li . . . . . . . . . . . . . . . . . . . . . . . . . 5423
Counterfactual Inference for Text Classification Debiasing

Chen Qian, Fuli Feng, Lijie Wen, Chunping Ma and Pengjun Xie. . . . . . . . . . . . . . . . . . . . . . . . . .5434
HieRec: Hierarchical User Interest Modeling for Personalized News Recommendation

Tao Qi, Fangzhao Wu, Chuhan Wu, Peiru Yang, Yang Yu, Xing Xie and Yongfeng Huang . . . 5446
PP-Rec: News Recommendation with Personalized User Interest and Time-aware News Popularity
Tao Qi, Fangzhao Wu, Chuhan Wu and Yongfeng Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5457
Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked
Claims
Qiang Sheng, Juan Cao, Xueyao Zhang, Xirong Li and Lei Zhong . . . . . . . . . . . . . . . . . . . . . . . . . 5468
Defense against Synonym Substitution-based Adversarial Attacks via Dirichlet Neighborhood Ensemble
Yi Zhou, Xiaoqing Zheng, Cho-Jui Hsieh, Kai-Wei Chang and Xuanjing Huang . . . . . . . . . . . . 5482
Shortformer: Better Language Modeling using Shorter Inputs

Ofir Press, Noah A. Smith and Mike Lewis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5493
BanditMTL: Bandit-based Multi-task Learning for Text Classification

Yuren Mao, Zekai Wang, Weiwei Liu, Xuemin Lin and Wenbin Hu . . . . . . . . . . . . . . . . . . . . . . . . 5506
Unified Interpretation of Softmax Cross-Entropy and Negative Sampling: With Case Study for Knowledge
Graph Embedding
Hidetaka Kamigaito and Katsuhiko Hayashi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5517
De-Confounded Variational Encoder-Decoder for Logical Table-to-Text Generation

Wenqing Chen, Jidong Tian, Yitian Li, Hao He and Yaohui Jin . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5532
Rethinking Stealthiness of Backdoor Attack against NLP Models

Wenkai Yang, Yankai Lin, Peng Li, Jie Zhou and Xu Sun . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5543
Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition

Xin Zhang, Guangwei Xu, Yueheng Sun, Meishan Zhang and Pengjun Xie . . . . . . . . . . . . . . . . . 5558
Exploring Distantly-Labeled Rationales in Neural Network Models

Quzhe Huang, Shengqi Zhu, Yansong Feng and Dongyan Zhao . . . . . . . . . . . . . . . . . . . . . . . . . . . 5571
Learning to Perturb Word Embeddings for Out-of-distribution QA

Seanie Lee, Minki Kang, Juho Lee and Sung Ju Hwang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5583
Maria: A Visual Experience Powered Conversational Agent

Zujie Liang, Huang Hu, Can Xu, Chongyang Tao, Xiubo Geng, yining Chen, Fan Liang and Daxin
Jiang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5596
A Human-machine Collaborative Framework for Evaluating Malevolence in Dialogues

Yangjun Zhang, Pengjie Ren and Maarten de Rijke . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5612
lxii
Generating Relevant and Coherent Dialogue Responses using Self-Separated Conditional Variational
AutoEncoders
Bin Sun, Shaoxiong Feng, Yiwei Li, Jiamou Liu and Kan Li . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5624
Learning to Ask Conversational Questions by Optimizing Levenshtein Distance

Zhongkun Liu, Pengjie Ren, Zhumin CHEN, Zhaochun Ren, Maarten de Rijke and Ming Zhou5638
DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue

Hung Le, Chinnadhurai Sankar, Seungwhan Moon, Ahmad Beirami, Alborz Geramifard and Satwik
Kottur . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5651
MMGCN: Multimodal Fusion via Deep Graph Convolution Network for Emotion Recognition in Con-
versation
Jingwen Hu, Yuchen Liu, Jinming Zhao and Qin Jin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5666
DynaEval: Unifying Turn and Dialogue Level Evaluation

Chen Zhang, Yiming Chen, Luis Fernando D’Haro, Yan Zhang, Thomas Friedrichs, Grandee Lee
and Haizhou Li . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5676
CoSQA: 20,000+ Web Queries for Code Search and Question Answering
Junjie Huang, Duyu Tang, Linjun Shou, Ming Gong, Ke Xu, Daxin Jiang, Ming Zhou and Nan
Duan. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5690
Rewriter-Evaluator Architecture for Neural Machine Translation

Yangming Li and Kaisheng Yao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5701
Modeling Bilingual Conversational Characteristics for Neural Chat Translation

Yunlong Liang, Fandong Meng, Yufeng Chen, Jinan Xu and Jie Zhou . . . . . . . . . . . . . . . . . . . . . . 5711
Importance-based Neuron Allocation for Multilingual Neural Machine Translation

Wanying Xie, Yang Feng, Shuhao Gu and Dong Yu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5725
Transfer Learning for Sequence Generation: from Single-source to Multi-source

Xuancheng Huang, jingfang xu, Maosong Sun and Yang Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5738
A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters

Mengjie Zhao, Yi Zhu, Ehsan Shareghi, Ivan Vulić, Roi Reichart, Anna Korhonen and Hinrich
Schütze . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5751
Coreference Reasoning in Machine Reading Comprehension

Mingzhu Wu, Nafise Sadat Moosavi, Dan Roth and Iryna Gurevych . . . . . . . . . . . . . . . . . . . . . . . 5768
Adapting Unsupervised Syntactic Parsing Methodology for Discourse Dependency Parsing

Liwen Zhang, Ge Wang, Wenjuan Han and Kewei Tu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5782
A Conditional Splitting Framework for Efficient Constituency Parsing

Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty and Xiaoli Li . . . . . . . . . . . . . . . . . . . . . . . . 5795
A Unified Generative Framework for Various NER Subtasks

Hang Yan, Tao Gui, Junqi Dai, Qipeng Guo, Zheng Zhang and Xipeng Qiu . . . . . . . . . . . . . . . . . 5808
An In-depth Study on Internal Structure of Chinese Words

Chen Gong, Saihao Huang, Houquan Zhou, Zhenghua Li, Min Zhang, Zhefeng Wang, baoxing
Huai and Nicholas Jing Yuan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5823
lxiii
MulDA: A Multilingual Data Augmentation Framework for Low-Resource Cross-Lingual NER
Linlin Liu, BOSHENG DING, Lidong Bing, Shafiq Joty, Luo Si and Chunyan Miao . . . . . . . . 5834
Lexicon Enhanced Chinese Sequence Labeling Using BERT Adapter

Wei Liu, Xiyan Fu, Yue Zhang and Wenming Xiao. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5847
Math Word Problem Solving with Explicit Numerical Values

Qinzhuo Wu, Qi Zhang, Zhongyu Wei and Xuanjing Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5859
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks

Jinghui Qin, Xiaodan Liang, Yining Hong, Jianheng Tang and Liang Lin . . . . . . . . . . . . . . . . . . . 5870
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medi-
cal Text Mining
Taolin Zhang, Zerui Cai, Chengyu Wang, Minghui Qiu, Bite Yang and XIAOFENG HE . . . . . 5882
What is Your Article Based On? Inferring Fine-grained Provenance

Yi Zhang, Zachary Ives and Dan Roth . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5894
Cross-modal Memory Networks for Radiology Report Generation

Zhihong Chen, Yaling Shen, Yan Song and Xiang Wan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5904
Controversy and Conformity: from Generalized to Personalized Aggressiveness Detection

Kamil Kanclerz, Alicja Figas, Marcin Gruza, Tomasz Kajdanowicz, Jan Kocon, Daria Puchalska
and Przemyslaw Kazienko . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5915
Multi-perspective Coherent Reasoning for Helpfulness Prediction of Multimodal Reviews

Junhao Liu, Zhen Hai, Min Yang and Lidong Bing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5927
Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding

Xin Sun, Tao Ge, Furu Wei and Houfeng Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5937
Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism
Tong Zhou, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Kun Niu, Weifeng Chong and Shengping
Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5948
PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check

Li Huang, Junjie Li, Weiwei Jiang, Zhiyu Zhang, Minchuan Chen, Shaojun Wang and Jing Xiao
5958
Guiding the Growth: Difficulty-Controllable Question Generation through Step-by-Step Rewriting

Yi Cheng, Siyao Li, Bang Liu, Ruihui Zhao, Sujian Li, Chenghua Lin and Yefeng Zheng . . . . 5968
Improving Encoder by Auxiliary Supervision Tasks for Table-to-Text Generation

Liang Li, Can Ma, Yinliang Yue and Dayong Hu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5979
POS-Constrained Parallel Decoding for Non-autoregressive Generation

Kexin Yang, Wenqiang Lei, Dayiheng Liu, Weizhen Qi and Jiancheng Lv . . . . . . . . . . . . . . . . . . 5990
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation

Xin Liu, Baosong Yang, Dayiheng Liu, Haibo Zhang, Weihua Luo, Min Zhang, Haiying Zhang and
Jinsong Su . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6001
TGEA: An Error-Annotated Dataset and Benchmark Tasks for TextGeneration from Pretrained Language
Models
Jie He, Bo Peng, Yi Liao, Qun Liu and Deyi Xiong . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6012
lxiv
Long-Span Summarization via Local Attention and Content Selection
Potsawee Manakul and Mark Gales . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6026
RepSum: Unsupervised Dialogue Summarization based on Replacement Strategy

Xiyan Fu, Yating Zhang, Tianyi Wang, Xiaozhong Liu, Changlong Sun and Zhenglu Yang . . . 6042
BASS: Boosting Abstractive Summarization with Unified Semantic Graph

Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Ziqiang Cao, Sujian Li, Hua Wu and Haifeng Wang
6052
Capturing Relations between Scientific Papers: An Abstractive Model for Related Work Section Genera-
tion
Xiuying Chen, Hind Alamro, Mingzhe Li, Shen Gao, Xiangliang Zhang, Dongyan Zhao and Rui
Yan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6068
Focus Attention: Promoting Faithfulness and Diversity in Summarization

Rahul Aralikatte, Shashi Narayan, Joshua Maynez, Sascha Rothe and Ryan McDonald . . . . . . 6078
Generating Query Focused Summaries from Query-Free Resources

Yumo Xu and Mirella Lapata . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6096
Robustifying Multi-hop QA through Pseudo-Evidentiality Training

Kyungjae Lee, Seung-won Hwang, Sang-eun Han and Dohyeon Lee . . . . . . . . . . . . . . . . . . . . . . . 6110
xMoCo: Cross Momentum Contrastive Learning for Open-Domain Question Answering

Nan Yang, Furu Wei, Binxing Jiao, Daxing Jiang and Linjun Yang . . . . . . . . . . . . . . . . . . . . . . . . 6120
Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational

Question Answering
Gangwoo Kim, Hyunjae Kim, Jungsoo Park and Jaewoo Kang . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6130
PhotoChat: A Human-Human Dialogue Dataset With Photo Sharing Behavior For Joint Image-Text
Modeling
Xiaoxue Zang, Lijuan Liu, Maria Wang, Yang Song, Hao Zhang and Jindong Chen . . . . . . . . . 6142
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal
Machine Translation
Zhiyong Wu, Lingpeng Kong, Wei Bi, Xiang Li and Ben Kao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6153
Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering
Ahjeong Seo, Gi-Cheon Kang, Joonhan Park and Byoung-Tak Zhang . . . . . . . . . . . . . . . . . . . . . . 6167
BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition
Yinghao Li, Pranav Shetty, Lucas Liu, Chao Zhang and Le Song . . . . . . . . . . . . . . . . . . . . . . . . . . 6178
CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction
Tao Chen, Haizhou Shi, Siliang Tang, Zhigang Chen, Fei Wu and Yueting Zhuang . . . . . . . . . . 6191
SENT: Sentence-level Distant Relation Extraction via Negative Training

Ruotian Ma, Tao Gui, Linyang Li, Qi Zhang, Xuanjing Huang and Yaqian Zhou . . . . . . . . . . . . 6201
An End-to-End Progressive Multi-Task Learning Framework for Medical Named Entity Recognition and
Normalization
Baohang Zhou, Xiangrui Cai, Ying Zhang and Xiaojie Yuan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6214
lxv
PRGC: Potential Relation and Global Correspondence Based Joint Relational Triple Extraction
Hengyi Zheng, rui wen, Xi Chen, Yifan Yang, Yunyan Zhang, Ziheng Zhang, Ningyu Zhang, Bin
Qin, Xu Ming and Yefeng Zheng . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6225
Learning from Miscellaneous Other-Class Words for Few-shot Named Entity Recognition
Meihan Tong, Shuai Wang, Bin Xu, Yixin Cao, Minghui Liu, Lei Hou and Juanzi Li . . . . . . . . 6236
Joint Biomedical Entity and Relation Extraction with Knowledge-Enhanced Collective Inference
Tuan Lai, Heng Ji, ChengXiang Zhai and Quan Hung Tran . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6248
Fine-grained Information Extraction from Biomedical Literature based on Knowledge-enriched Abstract

Meaning Representation
Zixuan Zhang, Nikolaus Parulian, Heng Ji, Ahmed Elsayed, Skatje Myers and Martha Palmer6261
Unleash GPT-2 Power for Event Detection

Amir Pouran Ben Veyseh, Viet Lai, Franck Dernoncourt and Thien Huu Nguyen . . . . . . . . . . . . 6271
CLEVE: Contrastive Pre-training for Event Extraction

Ziqi Wang, Xiaozhi Wang, Xu Han, Yankai Lin, Lei Hou, Zhiyuan Liu, Peng Li, Juanzi Li and Jie
Zhou . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6283
Document-level Event Extraction via Parallel Prediction Networks

Hang Yang, Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao and Taifeng Wang . . . . . . . . . . . . . . . 6298
StructuralLM: Structural Pre-training for Form Understanding

Chenliang Li, Bin Bi, Ming Yan, Wei Wang, Songfang Huang, Fei Huang and Luo Si . . . . . . . 6309
Dual Graph Convolutional Networks for Aspect-based Sentiment Analysis

Ruifan Li, Hao Chen, Fangxiang Feng, Zhanyu Ma, Xiaojie WANG and Eduard Hovy . . . . . . 6319
Multi-Label Few-Shot Learning for Aspect Category Detection

Mengting Hu, Shiwan Zhao, Honglei Guo, Chao Xue, Hang Gao, Tiegang Gao, renhong cheng and
Zhong Su . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6330
Argument Pair Extraction via Attention-guided Multi-Layer Multi-Cross Encoding

Liying Cheng, Tianyu Wu, Lidong Bing and Luo Si . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6341
A Neural Transition-based Model for Argumentation Mining

Jianzhu Bao, Chuang Fan, Jipeng Wu, Yixue Dang, Jiachen Du and Ruifeng Xu . . . . . . . . . . . . 6354
Keep It Simple: Unsupervised Simplification of Multi-Paragraph Text

Philippe Laban, Tobias Schnabel, Paul Bennett and Marti A. Hearst . . . . . . . . . . . . . . . . . . . . . . . 6365
Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence

Jian Guan, Xiaoxi Mao, changjie fan, Zitao Liu, Wenbiao Ding and Minlie Huang . . . . . . . . . . 6379
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics

Jian Guan, Zhexin Zhang, Zhuoer Feng, Zitao Liu, Wenbiao Ding, Xiaoxi Mao, changjie fan and
Minlie Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6394
DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Text Generation
Xinyu Hua, Ashwin Sreevatsa and Lu Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6408
Controllable Open-ended Question Generation with A New Question Type Ontology

Shuyang Cao and Lu Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6424
lxvi
BERTGen: Multi-task Generation through BERT
Faidon Mitzalis, Ozan Caglayan, Pranava Madhyastha and Lucia Specia . . . . . . . . . . . . . . . . . . . 6440
Selective Knowledge Distillation for Neural Machine Translation

Fusheng Wang, Jianhao Yan, Fandong Meng and Jie Zhou . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6456
Measuring and Increasing Context Usage in Context-Aware Machine Translation

Patrick Fernandes, Kayo Yin, Graham Neubig and André F. T. Martins . . . . . . . . . . . . . . . . . . . . 6467
Beyond Offline Mapping: Learning Cross-lingual Word Embeddings through Context Anchoring
Aitor Ormazabal, Mikel Artetxe, Aitor Soroa, Gorka Labaka and Eneko Agirre . . . . . . . . . . . . . 6479
CCMatrix: Mining Billions of High-Quality Parallel Sentences on the Web

Holger Schwenk, Guillaume Wenzek, Sergey Edunov, Edouard Grave, Armand Joulin and Angela
Fan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6490
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search
Gyuwan Kim and Kyunghyun Cho . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6501
GhostBERT: Generate More Features with Cheap Operations for BERT

Zhiqi Huang, Lu Hou, Lifeng Shang, Xin Jiang, Xiao Chen and Qun Liu . . . . . . . . . . . . . . . . . . . 6512
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Chen Liang, Simiao Zuo, Minshuo Chen, Haoming Jiang, Xiaodong Liu, Pengcheng He, Tuo Zhao
and Weizhu Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6524
A Novel Estimator of Mutual Information for Learning to Disentangle Textual Representations

Pierre Colombo, Pablo Piantanida and Chloé Clavel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6539
Determinantal Beam Search

Clara Meister, Martina Forster and Ryan Cotterell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6551
Multi-hop Graph Convolutional Network with High-order Chebyshev Approximation for Text Reasoning
Shuoran Jiang, Qingcai Chen, Xin Liu, Baotian Hu and Lisai Zhang . . . . . . . . . . . . . . . . . . . . . . . 6563
Accelerating Text Communication via Abbreviated Sentence Input

Jiban Adhikary, Jamie Berger and Keith Vertanen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6574
Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model
Updates
YUQING XIE, Yi-An Lai, Yuanjun Xiong, Yi Zhang and Stefano Soatto . . . . . . . . . . . . . . . . . . . 6589
Detecting Propaganda Techniques in Memes

Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz,
Preslav Nakov and Giovanni Da San Martino . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6603
On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale
Randomized Study
Divyansh Kaushik, Douwe Kiela, Zachary C. Lipton and Wen-tau Yih . . . . . . . . . . . . . . . . . . . . . 6618
Learning Dense Representations of Phrases at Scale

Jinhyuk Lee, Mujeen Sung, Jaewoo Kang and Danqi Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6634
End-to-End Training of Neural Retrievers for Open-Domain Question Answering

Devendra Sachan, Mostofa Patwary, Mohammad Shoeybi, Neel Kant, Wei Ping, William L. Hamil-
ton and Bryan Catanzaro . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6648
lxvii
Question Answering Over Temporal Knowledge Graphs
Apoorv Saxena, Soumen Chakrabarti and Partha Talukdar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6663
Language Model Augmented Relevance Score

Ruibo Liu, Jason Wei and Soroush Vosoughi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6677
DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts

Alisa Liu, Maarten Sap, Ximing Lu, Swabha Swayamdipta, Chandra Bhagavatula, Noah A. Smith
and Yejin Choi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6691
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models

Tongshuang Wu, Marco Tulio Ribeiro, Jeffrey Heer and Daniel Weld . . . . . . . . . . . . . . . . . . . . . . 6707
Metaphor Generation with Conceptual Mappings

Kevin Stowe, Tuhin Chakrabarty, Nanyun Peng, Smaranda Muresan and Iryna Gurevych . . . . 6724
Learning Latent Structures for Cross Action Phrase Relations in Wet Lab Protocols
Chaitanya Kulkarni, Jany Chan, Eric Fosler-Lussier and Raghu Machiraju. . . . . . . . . . . . . . . . . .6737
Multimodal Multi-Speaker Merger & Acquisition Financial Modeling: A New Task, Dataset, and Neural
Baselines
Ramit Sawhney, Mihir Goyal, Prakhar Goel, Puneet Mathur and Rajiv Ratn Shah . . . . . . . . . . . 6751
Mid-Air Hand Gestures for Post-Editing of Machine Translation

Rashad Albo Jamara, Nico Herbig, Antonio Krüger and Josef van Genabith . . . . . . . . . . . . . . . . 6763
Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning
Pan Lu, Ran Gong, Shibiao Jiang, Liang Qiu, Siyuan Huang, Xiaodan Liang and Song-Chun Zhu
6774
Joint Verification and Reranking for Open Fact Checking Over Tables
Michael Sejr Schlichtkrull, Vladimir Karpukhin, Barlas Oguz, Mike Lewis, Wen-tau Yih and Se-
bastian Riedel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6787
Evaluation of Thematic Coherence in Microblogs

Iman Munire Bilal, Bo Wang, Maria Liakata, Rob Procter and Adam Tsakalidis . . . . . . . . . . . . 6800
Neural semi-Markov CRF for Monolingual Word Alignment

Wuwei Lan, Chao Jiang and Wei Xu. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6815
Privacy at Scale: Introducing the PrivaSeer Corpus of Web Privacy Policies

Mukund Srinath, Shomir Wilson and C Lee Giles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6829
The statistical advantage of automatic NLG metrics at the system level

Johnny Wei and Robin Jia . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6840
Are Missing Links Predictable? An Inferential Benchmark for Knowledge Graph Completion
Yixin Cao, Xiang Ji, Xin Lv, Juanzi Li, Yonggang Wen and Hanwang Zhang . . . . . . . . . . . . . . . 6855
ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive Summarization with

Argument Mining
Alexander Fabbri, Faiaz Rahman, Imad Rizvi, Borui Wang, Haoran Li, Yashar Mehdad and Dragomir
Radev . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6866
lxviii
Improving Factual Consistency of Abstractive Summarization via Question Answering
Feng Nan, Cicero Nogueira dos Santos, Henghui Zhu, Patrick Ng, Kathleen McKeown, Ramesh
Nallapati, Dejiao Zhang, Zhiguo Wang, Andrew O. Arnold and Bing Xiang . . . . . . . . . . . . . . . . . . . . . 6881
EmailSum: Abstractive Email Thread Summarization

Shiyue Zhang, Asli Celikyilmaz, Jianfeng Gao and Mohit Bansal . . . . . . . . . . . . . . . . . . . . . . . . . . 6895
Cross-Lingual Abstractive Summarization with Limited Parallel Resources

Yu Bai, Yang Gao and Heyan Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6910
Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution
Jiacheng Xu and Greg Durrett . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6925
Learning Prototypical Functions for Physical Artifacts

Tianyu Jiang and Ellen Riloff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6941
Verb Knowledge Injection for Multilingual Event Processing

Olga Majewska, Ivan Vulić, Goran Glavaš, Edoardo Maria Ponti and Anna Korhonen . . . . . . . 6952
Dynamic Contextualized Word Embeddings

Valentin Hofmann, Janet Pierrehumbert and Hinrich Schütze. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6970
Lexical Semantic Change Discovery

Sinan Kurtyigit, Maike Park, Dominik Schlechtweg, Jonas Kuhn and Sabine Schulte im Walde6985
The R-U-A-Robot Dataset: Helping Avoid Chatbot Deception by Detecting User Questions About Hu-
man or Non-Human Identity
David Gros, Yu Li and Zhou Yu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6999
Using Meta-Knowledge Mined from Identifiers to Improve Intent Recognition in Conversational Systems
Claudio Pinhanez, Paulo Cavalin, Victor Henrique Alves Ribeiro, Ana Appel, Heloisa Candello,
Julio Nogima, Mauro Pichiliani, Melina Guerra, Maira de Bayser, Gabriel Malfatti and Henrique Ferreira
7014
Space Efficient Context Encoding for Non-Task-Oriented Dialogue Generation with Graph Attention
Transformer
Fabian Galetzka, Jewgeni Rose, David Schlangen and Jens Lehmann . . . . . . . . . . . . . . . . . . . . . . 7028
DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations

Dou Hu, Lingwei Wei and Xiaoyong Huai . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7042
Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater Reliability

Ka Wong, Praveen Paritosh and Lora Aroyo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7053
TIMEDIAL: Temporal Commonsense Reasoning in Dialog

Lianhui Qin, Aditya Gupta, Shyam Upadhyay, Luheng He, Yejin Choi and Manaal Faruqui . . 7066
RAW-C: Relatedness of Ambiguous Words in Context (A New Lexical Resource for English)
Sean Trott and Benjamin Bergen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7077
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic

Muhammad Abdul-Mageed, AbdelRahim Elmadany and El Moatez Billah Nagoudi . . . . . . . . . 7088
Improving Paraphrase Detection with the Adversarial Paraphrasing Task

Animesh Nighojkar and John Licato . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7106
lxix
ADEPT: An Adjective-Dependent Plausibility Task
Ali Emami, Ian Porada, Alexandra Olteanu, Kaheer Suleman, Adam Trischler and Jackie Chi Kit
Cheung . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7117
ReadOnce Transformers: Reusable Representations of Text for Transformers

Shih-Ting Lin, Ashish Sabharwal and Tushar Khot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7129
Conditional Generation of Temporally-ordered Event Sequences

Shih-Ting Lin, Nathanael Chambers and Greg Durrett . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7142
Hate Speech Detection Based on Sentiment Knowledge Sharing

Xianbing Zhou, yang yong, xiaochao fan, Ge Ren, Yunfeng Song, Yufeng Diao, Liang Yang and
Hongfei LIN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7158
Transition-based Bubble Parsing: Improvements on Coordination Structure Prediction

Tianze Shi and Lillian Lee . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7167
SpanNER: Named Entity Re-/Recognition as Span Prediction

Jinlan Fu, Xuanjing Huang and Pengfei Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7183
StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked
Language Modeling
Yikang Shen, Yi Tay, Che Zheng, Dara Bahri, Donald Metzler and Aaron Courville . . . . . . . . . 7196
Language Embeddings for Typology and Cross-lingual Transfer Learning

Dian Yu, Taiqi He and Kenji Sagae . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7210
Can Sequence-to-Sequence Models Crack Substitution Ciphers?

Nada Aldarrab and Jonathan May . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7226
Beyond Noise: Mitigating the Impact of Fine-grained Semantic Divergences on Neural Machine Trans-
lation
Eleftheria Briakou and Marine Carpuat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7236
Discriminative Reranking for Neural Machine Translation

Ann Lee, Michael Auli and Marc’Aurelio Ranzato . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7250
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question
Answering
Siddharth Karamcheti, Ranjay Krishna, Li Fei-Fei and Christopher Manning . . . . . . . . . . . . . . . 7265
All That’s ’Human’ Is Not Gold: Evaluating Human Evaluation of Generated Text
Elizabeth Clark, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan and Noah A.
Smith . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7282
Scientific Credibility of Machine Translation Research: A Meta-Evaluation of 769 Papers

Benjamin Marie, Atsushi Fujita and Raphael Rubino . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7297
Neural Machine Translation with Monolingual Translation Memory

Deng Cai, Yan Wang, Huayang Li, Wai Lam and Lemao Liu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7307
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning

Armen Aghajanyan, Sonal Gupta and Luke Zettlemoyer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7319
UnNatural Language Inference

Koustuv Sinha, Prasanna Parthasarathi, Joelle Pineau and Adina Williams . . . . . . . . . . . . . . . . . . 7329
lxx
Including Signed Languages in Natural Language Processing
Kayo Yin, Amit Moryossef, Julie Hochgesang, Yoav Goldberg and Malihe Alikhani . . . . . . . . 7347
Vocabulary Learning via Optimal Transport for Neural Machine Translation

Jingjing Xu, Hao Zhou, Chun Gan, Zaixiang Zheng and Lei Li . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7361
lxxi
Conference Program
Monday, August 2, 2021 (all times UTC+0)
08:15–08:35 Opening Session
08:40–09:00 Presidential Address
09:00–10:00 Keynote 1. Helen Meng: Advancing Technological Equity in Speech and Lan-
guage Processing
Session 1A: Computational Social Science and Cultural Analytics 1
10:00–10:10 Investigating label suggestions for opinion mining in German Covid-19 social me-
dia
Tilman Beck, Ji-Ung Lee, Christina Viehmann, Marcus Maurer, Oliver Quiring and
Iryna Gurevych
10:10–10:20 How Did This Get Funded?! Automatically Identifying Quirky Scientific Achieve-
ments
Chen Shani, Nadav Borenstein and Dafna Shahaf
10:20–10:30 Engage the Public: Poll Question Generation for Social Media Posts
Zexin Lu, Keyang Ding, Yuji Zhang, Jing Li, Baolin Peng and Lemao Liu
10:30–10:40 HateCheck: Functional Tests for Hate Speech Detection Models

Paul Röttger, Bertie Vidgen, Dong Nguyen, Zeerak Waseem, Helen Margetts and
Janet Pierrehumbert
10:40–10:50 Unified Dual-view Cognitive Model for Interpretable Claim Verification

Lianwei Wu, Yuan Rao, Yuqian Lan, Ling Sun and Zhaoyin Qi
10:50–10:57 Catchphrase: Automatic Detection of Cultural References

Nir Sweed and Dafna Shahaf
lxxiii
Monday, August 2, 2021 (all times UTC+0) (continued)
Session 1B: Language Generation 1
10:00–10:10 DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling
Lanqing Xue, Kaitao Song, Duocai Wu, Xu Tan, Nevin L. Zhang, Tao Qin, Wei-
Qiang Zhang and Tie-Yan Liu
10:10–10:20 PENS: A Dataset and Generic Framework for Personalized News Headline Gener-
ation
Xiang Ao, Xiting Wang, Ling Luo, Ying Qiao, Qing He and Xing Xie
10:20–10:30 Enhancing Content Preservation in Text Style Transfer Using Reverse Attention and
Conditional Layer Normalization
Dongkyu Lee, Zhiliang Tian, Lanqing Xue and Nevin L. Zhang
10:30–10:40 Mention Flags (MF): Constraining Transformer-based Text Generators

Yufei Wang, Ian Wood, Stephen Wan, Mark Dras and Mark Johnson
10:40–10:50 Generalising Multilingual Concept-to-Text NLG with Language Agnostic Delexi-

calisation
Giulio Zhou and Gerasimos Lampouras
10:50–10:57 On Training Instance Selection for Few-Shot Neural Text Generation

Ernie Chang, Xiaoyu Shen, Hui-Syuan Yeh and Vera Demberg
Session 1C: Dialog and Interactive Systems 1
10:00–10:10 Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dia-
logue Utterances
Zekang Li, Jinchao Zhang, Zhengcong Fei, Yang Feng and Jie Zhou
10:10–10:20 Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking
Jinyu Guo, Kai Shuang, Jijie Li and Zihan Wang
10:20–10:30 Transferable Dialogue Systems and User Simulators

Bo-Hsiang Tseng, Yinpei Dai, Florian Kreyssig and Bill Byrne
10:30–10:40 BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited
Personalized Data
Haoyu Song, Yan Wang, Kaiyan Zhang, Wei-Nan Zhang and Ting Liu
lxxiv
10:40–10:50 GL-GIN: Fast and Accurate Non-Autoregressive Model for Joint Multiple Intent
Detection and Slot Filling
Libo Qin, Fuxuan Wei, Tianbao Xie, Xiao Xu, Wanxiang Che and Ting Liu
10:50–10:57 Coreference Resolution without Span Representations

Yuval Kirstain, Ori Ram and Omer Levy
Session 1D: Information Extraction 1
10:00–10:10 Accelerating BERT Inference for Sequence Labeling via Early-Exit

Xiaonan Li, Yunfan Shao, Tianxiang Sun, Hang Yan, Xipeng Qiu and Xuanjing
Huang
10:10–10:20 Modularized Interaction Network for Named Entity Recognition

Fei Li, Zheng Wang, Siu Cheung Hui, Lejian Liao, Dandan Song, Jing Xu, Guoxiu
He and meihuizi jia
10:20–10:30 Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent

Decoder
Xi Xiangyu, Wei Ye, Shikun Zhang, Quanxiu Wang, Huixing Jiang and Wei Wu
10:30–10:40 UniRE: A Unified Label Space for Entity Relation Extraction

Yijun Wang, Changzhi Sun, Yuanbin Wu, Hao Zhou, Lei Li and Junchi Yan
10:40–10:50 Refining Sample Embeddings with Relation Prototypes to Enhance Continual Rela-
tion Extraction
Li Cui, Deqing Yang, Jiaxin Yu, Chengwei Hu, Jiayang Cheng, Jingjie Yi and
Yanghua Xiao
10:50–10:57 Enhancing Entity Boundary Detection for Better Chinese Named Entity Recognition
Chun Chen and Fang Kong
lxxv
Session 1E: Machine Translation and Multilinguality 1
10:00–10:10 Contrastive Learning for Many-to-many Multilingual Neural Machine Translation

Xiao Pan, Mingxuan Wang, Liwei Wu and Lei Li
10:10–10:20 Understanding the Properties of Minimum Bayes Risk Decoding in Neural Machine
Translation
Mathias Müller and Rico Sennrich
10:20–10:30 Multi-Head Highly Parallelized LSTM Decoder for Neural Machine Translation
Hongfei Xu, Qiuhui Liu, Josef van Genabith, Deyi Xiong and Meng Zhang
10:30–10:40 A Bidirectional Transformer Based Alignment Model for Unsupervised Word Align-
ment
Jingyi Zhang and Josef van Genabith
10:40–10:50 Learning Language Specific Sub-network for Multilingual Machine Translation

Zehui Lin, Liwei Wu, Mingxuan Wang and Lei Li
10:50–10:57 Difficulty-Aware Machine Translation Evaluation

Runzhe Zhan, Xuebo Liu, Derek F. Wong and Lidia S. Chao
Session 2A: Sentiment Analysis, Stylistic Analysis, and Argument Mining 1
11:00–11:10 Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment

Analysis
Linyi Yang, Jiazheng Li, Padraig Cunningham, Yue Zhang, Barry Smyth and Ruihai
Dong
11:10–11:20 Bridge-Based Active Domain Adaptation for Aspect Term Extraction

Zhuang Chen and Tieyun Qian
11:20–11:30 Multimodal Sentiment Detection Based on Multi-channel Graph Neural Networks

Xiaocui Yang, Shi Feng, Yifei Zhang and Daling Wang
11:30–11:40 Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects

and Opinions
Hongjie Cai, Rui Xia and Jianfei Yu
lxxvi
11:40–11:47 Uncertainty and Surprisal Jointly Deliver the Punchline: Exploiting Incongruity-
Based Features for Humor Recognition
Yubo Xie, Junze Li and Pearl Pu
11:47–11:54 Counterfactuals to Control Latent Disentangled Text Representations for Style

Transfer
Sharmila Reddy Nangi, Niyati Chhaya, Sopan Khosla, Nikhil Kaushik and Harshit
Nyati
Session 2B: Summarization 1
11:00–11:10 PASS: Perturb-and-Select Summarizer for Product Reviews

Nadav Oved and Ran Levy
11:10–11:20 Deep Differential Amplifier for Extractive Summarization

Ruipeng Jia, Yanan Cao, Fang Fang, Yuchen Zhou, Zheng Fang, Yanbing Liu and
Shi Wang
11:20–11:30 Multi-TimeLine Summarization (MTLS): Improving Timeline Summarization by

Generating Multiple Summaries
Yi Yu, Adam Jatowt, Antoine Doucet, Kazunari Sugiyama and Masatoshi
Yoshikawa
11:30–11:40 Self-Supervised Multimodal Opinion Summarization

Jinbae Im, Moonki Kim, Hoyeop Lee, Hyunsouk Cho and Sehee Chung
11:40–11:50 A Training-free and Reference-free Summarization Evaluation Metric via

Centrality-weighted Relevance and Self-referenced Redundancy
Wang Chen, Piji Li and Irwin King
11:50–12:00 DESCGEN: A Distantly Supervised Datasetfor Generating Entity Descriptions

Weijia Shi, Mandar Joshi and Luke Zettlemoyer
lxxvii
Session 2C: Interpretability and Analysis of Models for NLP 1
11:00–11:10 Introducing Orthogonal Constraint in Structural Probes

Tomasz Limisiewicz and David Mareček
11:10–11:20 Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger
Fanchao Qi, Mukai Li, Yangyi Chen, Zhengyan Zhang, Zhiyuan Liu, Yasheng Wang
and Maosong Sun
11:20–11:30 Examining the Inductive Bias of Neural Language Models with Artificial Languages
Jennifer C. White and Ryan Cotterell
11:30–11:40 Explaining Contextualization in Language Models using Visual Analytics

Rita Sevastjanova, Aikaterini-Lida Kalouli, Christin Beck, Hanna Schäfer and Men-
natallah El-Assady
11:40–11:50 Improving the Faithfulness of Attention-based Explanations with Task-specific In-

formation for Text Classification
George Chrysostomou and Nikolaos Aletras
11:50–11:57 Attention Flows are Shapley Value Explanations

Kawin Ethayarajh and Dan Jurafsky
Session 2D: Language Grounding to Vision, Robotics and Beyond 1
11:00–11:10 Generating Landmark Navigation Instructions from Maps as a Graph-to-Text Prob-

lem
Raphael Schumann and Stefan Riezler
11:10–11:20 E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning

Haiyang Xu, Ming Yan, Chenliang Li, Bin Bi, Songfang Huang, Wenming Xiao
and Fei Huang
11:20–11:30 Learning Relation Alignment for Calibrated Cross-modal Retrieval

Shuhuai Ren, Junyang Lin, Guangxiang Zhao, Rui Men, An Yang, Jingren Zhou,
Xu Sun and Hongxia Yang
11:30–11:40 KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Gen-
eration
Yiran Xing, Zai Shi, Zhao Meng, Gerhard Lakemeyer, Yunpu Ma and Roger Wat-
tenhofer
lxxviii
11:40–11:47 Video Paragraph Captioning as a Text Summarization Task

Hui Liu and Xiaojun Wan
11:47–11:54 Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused
Interventions
Daniel Rosenberg, Itai Gat, Amir Feder and Roi Reichart
Session 2E: Machine Learning for NLP 1
11:00–11:10 Cascaded Head-colliding Attention

Lin Zheng, Zhiyong Wu and Lingpeng Kong
11:10–11:20 Structural Knowledge Distillation: Tractably Distilling Information for Structured

Predictor
Xinyu Wang, Yong Jiang, Zhaohui Yan, Zixia Jia, Nguyen Bach, Tao Wang,
Zhongqiang Huang, Fei Huang and Kewei Tu
11:20–11:30 Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernet-

works
Rabeeh Karimi Mahabadi, Sebastian Ruder, Mostafa Dehghani and James Hender-
son
11:30–11:40 COSY: COunterfactual SYntax for Cross-Lingual Understanding

SICHENG YU, Hao Zhang, Yulei Niu, Qianru Sun and Jing Jiang
11:40–11:50 OoMMix: Out-of-manifold Regularization in Contextual Embedding Space for Text

Classification
Seonghyeon Lee, Dongha Lee and Hwanjo Yu
11:50–11:57 How Helpful is Inverse Reinforcement Learning for Table-to-Text Generation?

Sayan Ghosh, Zheng Qi, Snigdha Chaturvedi and Shashank Srivastava
lxxix
14:00–14:10 Understanding and Countering Stereotypes: A Computational Approach to the

Stereotype Content Model
Kathleen C. Fraser, Isar Nejadgholi and Svetlana Kiritchenko
14:10–14:20 Structurizing Misinformation Stories via Rationalizing Fact-Checks

Shan Jiang and Christo Wilson
14:20–14:30 Modeling Language Usage and Listener Engagement in Podcasts

Sravana Reddy, Mariya Lazarova, Yongze Yu and Rosie Jones
14:30–14:40 Breaking Down the Invisible Wall of Informal Fallacies in Online Discussions
Saumya Sahai, Oana Balalau and Roxana Horincar
14:40–14:50 SocAoG: Incremental Graph Parsing for Social Relation Inference in Dialogues
Liang Qiu, Yuan Liang, Yizhou Zhao, Pan Lu, Baolin Peng, Zhou Yu, Ying Nian
Wu and Song-Chun Zhu
14:50–14:57 Automatic Fake News Detection: Are Models Learning to Reason?

Casper Hansen, Christian Hansen and Lucas Chaves Lima
Session 3B: Dialog and Interactive Systems 2
14:00–14:10 TicketTalk: Toward human-level performance with end-to-end, transaction-based

dialog systems
Bill Byrne, Karthik Krishnamoorthi, Saravanan Ganesh and Mihir Kale
14:10–14:20 Improving Dialog Systems for Negotiation with Personality Modeling

Runzhe Yang, Jingxiao Chen and Karthik Narasimhan
14:20–14:30 Learning from Perturbations: Diverse and Informative Dialogue Generation with
Inverse Adversarial Training
Wangchunshu Zhou, Qifei LI and Chenle Li
14:30–14:40 Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Fea-

tures
Hannah Rashkin, David Reitter, Gaurav Singh Tomar and Dipanjan Das
lxxx
14:40–14:47 Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dia-

logue Queries
Ashish Shrivastava, Kaustubh Dhole, Abhinav Bhatt and Sharvani Raghunath
14:47–14:54 N-Best ASR Transformer: Enhancing SLU Performance using Multiple ASR Hy-
potheses
Karthik Ganesan, Pakhi Bamdev, Jaivarsan B, Amresh Venugopal and Abhinav
Tushar
Session 3C: Information Extraction 2
14:00–14:10 CitationIE: Leveraging the Citation Graph for Scientific Information Extraction
Vijay Viswanathan, Graham Neubig and Pengfei Liu
14:10–14:20 From Discourse to Narrative: Knowledge Projection for Event Relation Extraction
Jialong Tang, Hongyu Lin, Meng Liao, Yaojie Lu, Xianpei Han, Le Sun, Weijian
Xie and Jin Xu
14:20–14:30 AdvPicker: Effectively Leveraging Unlabeled Data via Adversarial Discriminator

for Cross-Lingual NER
Weile Chen, Huiqiang Jiang, Qianhui Wu, Börje Karlsson and Yi Guan
14:30–14:40 Compare to The Knowledge: Graph Neural Fake News Detection with External
Knowledge
Linmei Hu, Tianchi Yang, Luhao Zhang, Wanjun Zhong, Duyu Tang, Chuan Shi,
Nan Duan and Ming Zhou
14:40–14:50 Discontinuous Named Entity Recognition as Maximal Clique Discovery

Yucheng Wang, Bowen Yu, Hongsong Zhu, Tingwen Liu, Nan Yu and Limin Sun
14:50–15:00 LNN-EL: A Neuro-Symbolic Approach to Short-text Entity Linking

Hang Jiang, Sairam Gurajada, Qiuhao Lu, Sumit Neelam, Lucian Popa, Prithviraj
Sen, Yunyao Li and Alexander Gray
lxxxi
Session 3D: Machine Translation and Multilinguality 2
14:00–14:10 Do Context-Aware Translation Models Pay the Right Attention?

Kayo Yin, Patrick Fernandes, Danish Pruthi, Aditi Chaudhary, André F. T. Martins
and Graham Neubig
14:10–14:20 Adapting High-resource NMT Models to Translate Low-resource Related Lan-

guages without Parallel Data
Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Na-
man Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn and Mona Diab
14:20–14:30 Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Align-
ment
Haoyue Shi, Luke Zettlemoyer and Sida I. Wang
14:30–14:40 Multilingual Speech Translation from Efficient Finetuning of Pretrained Models

Xian Li, Changhan Wang, Yun Tang, Chau Tran, Yuqing Tang, Juan Pino, Alexei
Baevski, Alexis Conneau and Michael Auli
14:40–14:47 Gender bias amplification during Speed-Quality optimization in Neural Machine

Translation
Adithya Renduchintala, Denise Diaz, Kenneth Heafield, Xian Li and Mona Diab
14:47–14:54 Machine Translation into Low-resource Language Varieties

Sachin Kumar, Antonios Anastasopoulos, Shuly Wintner and Yulia Tsvetkov
Session 3E: Interpretability and Analysis of Models for NLP 2
14:00–14:10 Learning Faithful Representations of Causal Graphs

Ananth Balashankar and Lakshminarayanan Subramanian
14:10–14:20 What Context Features Can Transformer Language Models Use?

Joe O’Connor and Jacob Andreas
14:20–14:30 Integrated Directional Gradients: Feature Interaction Attribution for Neural NLP
Models
Sandipan Sikdar, Parantapa Bhattacharya and Kieran Heese
14:30–14:37 Is Sparse Attention more Interpretable?

Clara Meister, Stefan Lazov, Isabelle Augenstein and Ryan Cotterell
lxxxii
14:37–14:44 The Case for Translation-Invariant Self-Attention in Transformer-Based Language

Models
Ulme Wennberg and Gustav Eje Henter
14:44–14:51 Relative Importance in Sentence Processing

Nora Hollenstein and Lisa Beinborn
Poster 1A: Semantics: Sentence-level Semantics, Textual Inference and Other

areas
15:00–17:00 DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations

John Giorgi, Osvald Nitski, Bo Wang and Gary Bader
15:00–17:00 Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal
Reasoning Models
Mingyue Han and Yinglin Wang
15:00–17:00 XLPT-AMR: Cross-Lingual Pre-Training via Multi-Task Learning for Zero-Shot

AMR Parsing and Text Generation
Dongqin Xu, Junhui Li, Muhua Zhu, Min Zhang and Guodong Zhou
15:00–17:00 Span-based Semantic Parsing for Compositional Generalization

Jonathan Herzig and Jonathan Berant
15:00–17:00 AND does not mean OR: Using Formal Languages to Study Language Models’ Rep-
resentations
Aaron Traylor, Roman Feiman and Ellie Pavlick
15:00–17:00 Enforcing Consistency in Weakly Supervised Semantic Parsing

Nitish Gupta, Sameer Singh and Matt Gardner
15:00–17:00 Compositional Generalization and Natural Language Variation: Can a Semantic

Parsing Approach Handle Both?
Peter Shaw, Ming-Wei Chang, Panupong Pasupat and Kristina Toutanova
lxxxiii
Poster 1B: Linguistic Theories, Cognitive Modeling and Psycholinguistics
15:00–17:00 A Targeted Assessment of Incremental Processing in Neural Language Models and

Humans
Ethan Wilcox, Pranali Vani and Roger Levy
Poster 1C: Semantics: Lexical Semantics
15:00–17:00 The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for
Language Processing
Valentina Pyatkin, Shoval Sadde, Aynat Rubinstein, Paul Portner and Reut Tsarfaty
Poster 1D: Phonology, Morphology and Word Segmentation
15:00–17:00 To POS Tag or Not to POS Tag: The Impact of POS Tags on Morphological Learn-
ing in Low-Resource Settings
Sarah Moeller, Ling Liu and Mans Hulden
Poster 1E: Speech and Multimodality
15:00–17:00 Prosodic segmentation for parsing spoken dialogue

Elizabeth Nielsen, Mark Steedman and Sharon Goldwater
15:00–17:00 VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learn-

ing, Semi-Supervised Learning and Interpretation
Changhan Wang, Morgane Riviere, Ann Lee, Anne Wu, Chaitanya Talnikar, Daniel
Haziza, Mary Williamson, Juan Pino and Emmanuel Dupoux
15:00–17:00 An Improved Model for Voicing Silent Speech

David Gaddy and Dan Klein
lxxxiv
Poster 1F: Ethics in NLP
15:00–17:00 What’s in the Box? An Analysis of Undesirable Content in the Common Crawl
Corpus
Alexandra Luccioni and Joseph Viviano
15:00–17:00 Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark

Datasets
Su Lin Blodgett, Gilsinia Lopez, Alexandra Olteanu, Robert Sim and Hanna Wal-
lach
Poster 1G: Information Retrieval and Text Mining
15:00–17:00 Robust Knowledge Graph Completion with Stacked Convolutions and a Student Re-
Ranking Network
Justin Lovelace, Denis Newman-Griffis, Shikhar Vashishth, Jill Fain Lehman and
Carolyn Rosé
15:00–17:00 A DQN-based Approach to Finding Precise Evidences for Fact Verification

Hai Wan, Haicheng Chen, Jianfeng Du, Weilin Luo and Rongzhen Ye
Poster 1H: Machine Learning for NLP
15:00–17:00 The Art of Abstention: Selective Prediction and Error Regularization for Natural
Language Processing
Ji Xin, Raphael Tang, Yaoliang Yu and Jimmy Lin
15:00–17:00 Unsupervised Out-of-Domain Detection via Pre-trained Transformers

Keyang Xu, Tongzheng Ren, Shikun Zhang, Yihao Feng and Caiming Xiong
15:00–17:00 Continual Quality Estimation with Online Bayesian Meta-Learning

Abiola Obamuyide, Marina Fomicheva and Lucia Specia
15:00–17:00 MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation

Ahmad Rashid, Vasileios Lioutas and Mehdi Rezagholizadeh
15:00–17:00 Selecting Informative Contexts Improves Language Model Fine-tuning

Richard Antonello, Nicole Beckage, Javier Turek and Alexander Huth
lxxxv
15:00–17:00 Explainable Prediction of Text Complexity: The Missing Preliminaries for Text Sim-
plification
Cristina Garbacea, Mengtian Guo, Samuel Carton and Qiaozhu Mei
15:00–17:00 Multi-Task Retrieval for Knowledge-Intensive Tasks

Jean Maillard, Vladimir Karpukhin, Fabio Petroni, Wen-tau Yih, Barlas Oguz,
Veselin Stoyanov and Gargi Ghosh
Poster 1I: Interpretability and Analysis of Models for NLP
15:00–17:00 When Do You Need Billions of Words of Pretraining Data?

Yian Zhang, Alex Warstadt, Xiaocheng Li and Samuel R. Bowman
15:00–17:00 Analyzing the Source and Target Contributions to Predictions in Neural Machine
Translation
Elena Voita, Rico Sennrich and Ivan Titov
15:00–17:00 Comparing Test Sets with Item Response Theory

Clara Vania, Phu Mon Htut, William Huang, Dhara Mungra, Richard Yuanzhe Pang,
Jason Phang, Haokun Liu, Kyunghyun Cho and Samuel R. Bowman
15:00–17:00 Uncovering Constraint-Based Behavior in Neural Models via Targeted Fine-Tuning

Forrest Davis and Marten van Schijndel
15:00–17:00 More Identifiable yet Equally Performant Transformers for Text Classification
Rishabh Bhardwaj, Navonil Majumder, Soujanya Poria and Eduard Hovy
lxxxvi
Poster 1J: Dialog and Interactive Systems
15:00–17:00 AugNLG: Few-shot Natural Language Generation using Self-trained Data Augmen-
tation
Xinnuo Xu, Guoyin Wang, Young-Bum Kim and Sungjin Lee
15:00–17:00 A Span-based Dynamic Local Attention Model for Sequential Sentence Classifica-
tion
Xichen Shang, Qianli Ma, Zhenxi Lin, Jiangyue Yan and Zipeng Chen
Poster 1K: Resources and Evaluation
15:00–17:00 How effective is BERT without word ordering? Implications for language under-
standing and data privacy
Jack Hessel and Alexandra Schofield
15:00–17:00 Can vectors read minds better than experts? Comparing data augmentation strate-
gies for the automated scoring of children’s mindreading ability
Venelin Kovatchev, Phillip Smith, Mark Lee and Rory Devine
15:00–17:00 A Dataset and Baselines for Multilingual Reply Suggestion

Mozhi Zhang, Wei Wang, Budhaditya Deb, Guoqing Zheng, Milad Shokouhi and
Ahmed Hassan Awadallah
15:00–17:00 WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation

Nachshon Cohen, Oren Kalinsky, Yftah Ziser and Alessandro Moschitti
15:00–17:00 What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU
Data Collection Tasks?
Nikita Nangia, Saku Sugawara, Harsh Trivedi, Alex Warstadt, Clara Vania and
Samuel R. Bowman
15:00–17:00 UMIC: An Unreferenced Metric for Image Captioning via Contrastive Learning
Hwanhee Lee, Seunghyun Yoon, Franck Dernoncourt, Trung Bui and Kyomin Jung
15:00–17:00 Neural OCR Post-Hoc Correction of Historical Corpora

Lijun Lyu, Maria Koutraki, Martin Krikl and Besnik Fetahu
lxxxvii
Poster 1L: Computational Social Science and Cultural Analytics
15:00–17:00 Align Voting Behavior with Public Statements for Legislator Representation Learn-
ing
Xinyi Mou, Zhongyu Wei, Lei Chen, Shangyi Ning, Yancheng He, Changjian Jiang
and Xuanjing Huang
15:00–17:00 Measure and Evaluation of Semantic Divergence across Two Languages

Syrielle Montariol and Alexandre Allauzen
Poster 1M: Machine Translation and Multilinguality
15:00–17:00 Improving Zero-Shot Translation by Disentangling Positional Information

Danni Liu, Jan Niehues, James Cross, Francisco Guzmán and Xian Li
15:00–17:00 Common Sense Beyond English: Evaluating and Improving Multilingual Language
Models for Commonsense Reasoning
Bill Yuchen Lin, Seyeon Lee, Xiaoyang Qiao and Xiang Ren
15:00–17:00 Attention Calibration for Transformer in Neural Machine Translation

Yu Lu, Jiali Zeng, Jiajun Zhang, Shuangzhi Wu and Mu Li
15:00–17:00 Anchor-based Bilingual Word Embeddings for Low-Resource Languages

Tobias Eder, Viktor Hangya and Alexander Fraser
15:00–17:00 Diverse Pretrained Context Encodings Improve Document Translation

Domenic Donato, Lei Yu and Chris Dyer
15:00–17:00 Multilingual Agreement for Multilingual Neural Machine Translation

Jian Yang, Yuwei Yin, Shuming Ma, Haoyang Huang, Dongdong Zhang, Zhoujun
Li and Furu Wei
15:00–17:00 Exploiting Language Relatedness for Low Web-Resource Language Model Adapta-
tion: An Indic Languages Study
Yash Khemchandani, Sarvesh Mehtani, Vaidehi Patil, Abhijeet Awasthi, Partha
Talukdar and Sunita Sarawagi
lxxxviii
Poster 1N: Syntax: Tagging, Chunking, and Parsing
15:00–17:00 On Finding the K-best Non-projective Dependency Trees

Ran Zmigrod, Tim Vieira and Ryan Cotterell
15:00–17:00 Higher-order Derivatives of Weighted Finite-state Machines

Ran Zmigrod, Tim Vieira and Ryan Cotterell
Poster 1O: Theme
15:00–17:00 Towards Argument Mining for Social Good: A Survey

Eva Maria Vecchi, Neele Falk, Iman Jundi and Gabriella Lapesa
15:00–17:00 Automated Generation of Storytelling Vocabulary from Photographs for use in AAC
Mauricio Fontana de Vargas and Karyn Moffatt
Poster 1P: NLP Applications
15:00–17:00 CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Dis-
charge Notes
James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale, Jordan Swartz,
Greg McKelvey, Hui Dai, Yi Yang and David Sontag
15:00–17:00 Assessing Emoji Use in Modern Text Processing Tools

Abu Awal Md Shoeb and Gerard de Melo
15:00–17:00 Select, Extract and Generate: Neural Keyphrase Generation with Layer-wise Cov-
erage Attention
Wasi Ahmad, Xiao Bai, Soomin Lee and Kai-Wei Chang
lxxxix
Poster 1Q: Language Generation
15:00–17:00 Factorising Meaning and Form for Intent-Preserving Paraphrasing

Tom Hosking and Mirella Lapata
15:00–17:00 AggGen: Ordering and Aggregating while Generating

Xinnuo Xu, Ondřej Dušek, Verena Rieser and Ioannis Konstas
15:00–17:00 Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Lan-

guage Models
Peter West, Ximing Lu, Ari Holtzman, Chandra Bhagavatula, Jena D. Hwang and
Yejin Choi
15:00–17:00 Towards Table-to-Text Generation with Numerical Reasoning

Lya Hulliyyatus Suadaa, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura
and Hiroya Takamura
15:00–17:00 Data-to-text Generation with Macro Planning

Ratish Puduppully and Mirella Lapata
Poster 1R: Summarization
15:00–17:00 BACO: A Background Knowledge- and Content-Based Framework for Citing Sen-
tence Generation
Yubin Ge, Ly Dinh, Xiaofeng Liu, Jinsong Su, Ziyao Lu, Ante Wang and Jana
Diesner
15:00–17:00 Language Model as an Annotator: Exploring DialoGPT for Dialogue Summariza-

tion
Xiachong Feng, Xiaocheng Feng, Libo Qin, Bing Qin and Ting Liu
15:00–17:00 Reinforcement Learning for Abstractive Question Summarization with Question-

aware Semantic Rewards
Shweta Yadav, Deepak Gupta, Asma Ben Abacha and Dina Demner-Fushman
xc
Poster 1S: Question Answering
15:00–17:00 Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph

Retrieval
Akari Asai and Eunsol Choi
15:00–17:00 A Semantics-aware Transformer Model of Relation Linking for Knowledge Base

Question Answering
Tahira Naseem, Srinivas Ravishankar, Nandana Mihindukulasooriya, Ibrahim Ab-
delaziz, Young-Suk Lee, Pavan Kapanipathi, Salim Roukos, Alfio Gliozzo and
Alexander Gray
15:00–17:00 A Gradually Soft Multi-Task and Data-Augmented Approach to Medical Question

Understanding
Khalil Mrini, Franck Dernoncourt, Seunghyun Yoon, Trung Bui, Walter Chang,
Emilia Farcas and Ndapa Nakashole
15:00–17:00 Neural Retrieval for Question Answering with Cross-Attention Supervised Data
Augmentation
Yinfei Yang, Ning Jin, Kuo Lin, Mandy Guo and Daniel Cer
Poster 1T: Language Grounding to Vision, Robotics and Beyond
15:00–17:00 Enhancing Descriptive Image Captioning with Natural Language Inference

Zhan Shi, Hui Liu and Xiaodan Zhu
Poster 1U: Information Extraction
15:00–17:00 Leveraging Type Descriptions for Zero-shot Named Entity Recognition and Classi-
fication
Rami Aly, Andreas Vlachos and Ryan McDonald
15:00–17:00 MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named

Entity Recognition
Shuang Wu, Xiaoning Song and Zhenhua Feng
15:00–17:00 MOLEMAN: Mention-Only Linking of Entities with a Mention Annotation Network

Nicholas FitzGerald, Dan Bikel, Jan Botha, Daniel Gillick, Tom Kwiatkowski and
Andrew McCallum
15:00–17:00 Factuality Assessment as Modal Dependency Parsing

Jiarui Yao, Haoling Qiu, Jin Zhao, Bonan Min and Nianwen Xue
xci
Poster 1V: Sentiment Analysis, Stylistic Analysis, and Argument Mining
15:00–17:00 Directed Acyclic Graph Network for Conversational Emotion Recognition

Weizhou Shen, Siyue Wu, Yunyi Yang and Xiaojun Quan
15:00–17:00 Improving Formality Style Transfer with Context-Aware Rule Injection

Zonghai Yao and hong yu
15:00–17:00 Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection

Lixing Zhu, Gabriele Pergola, Lin Gui, Deyu Zhou and Yulan He
15:00–17:00 Syntopical Graphs for Computational Argumentation Tasks

Joe Barrow, Rajiv Jain, Nedim Lipka, Franck Dernoncourt, Vlad Morariu, Varun
Manjunatha, Douglas Oard, Philip Resnik and Henning Wachsmuth
15:00–17:00 Stance Detection in COVID-19 Tweets

Kyle Glandt, Sarthak Khanal, Yingjie Li, Doina Caragea and Cornelia Caragea
15:00–17:00 eMLM: A New Pre-training Objective for Emotion Related Tasks

Tiberiu Sosea and Cornelia Caragea
15:00–17:00 Topic-Aware Evidence Reasoning and Stance-Aware Aggregation for Fact Verifica-
tion
Jiasheng Si, Deyu Zhou, Tongzhe Li, Xingyu Shi and Yulan He
17:00—18:00 Keynote 2. Alejandrina Cristia: Learning and Processing Language from Wear-
ables: Opportunities and Challenges
xcii
23:00–23:10 Changes in European Solidarity Before and During COVID-19: Evidence from a
Large Crowd- and Expert-Annotated Twitter Dataset
Alexandra Ils, Dan Liu, Daniela Grunow and Steffen Eger
23:10–23:20 Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions

Dorottya Demszky, Jing Liu, Zid Mancenido, Julie Cohen, Heather Hill, Dan Juraf-
sky and Tatsunori Hashimoto
23:20–23:30 A Survey of Code-switching: Linguistic and Social Perspectives for Language Tech-
nologies
A. Seza Doğruöz, Sunayana Sitaram, Barbara E. Bullock and Almedia Jacqueline
Toribio
23:30–23:40 Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate
Detection
Bertie Vidgen, Tristan Thrush, Zeerak Waseem and Douwe Kiela
23:40–23:50 InfoSurgeon: Cross-Media Fine-grained Information Consistency Checking for

Fake News Detection
Yi Fung, Christopher Thomas, Revanth Gangi Reddy, Sandeep Polisetty, Heng Ji,
Shih-Fu Chang, Kathleen McKeown, Mohit Bansal and Avi Sil
23:50–23:57 On Positivity Bias in Negative Reviews

Madhusudhan Aithal and Chenhao Tan
23:00–23:10 I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling

Yixin Nie, Mary Williamson, Mohit Bansal, Douwe Kiela and Jason Weston
23:10–23:20 A Sequence-to-Sequence Approach to Dialogue State Tracking

Yue Feng, Yang Wang and Hang Li
23:20–23:30 Discovering Dialog Structure Graph for Coherent Dialog Generation

Jun Xu, Zeyang Lei, Haifeng Wang, Zheng-Yu Niu, Hua Wu and Wanxiang Che
23:30–23:40 Dialogue Response Selection with Hierarchical Curriculum Learning

Yixuan Su, Deng Cai, Qingyu Zhou, Zibo Lin, Simon Baker, Yunbo Cao, Shuming
Shi, Nigel Collier and Yan Wang
xciii
23:40–23:50 A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Pars-
ing in Chinese Conversational Speech
Jingxuan Yang, Kerui Xu, Jun Xu, Si Li, Sheng Gao, Jun Guo, Nianwen Xue and
Ji-Rong Wen
23:50–23:57 PRAL: A Tailored Pre-Training Model for Task-Oriented Dialog Generation

Jing Gu, Qingyang Wu, Chongruo Wu, Weiyan Shi and Zhou Yu
23:00–23:10 A Systematic Investigation of KB-Text Embedding Alignment at Scale

Vardaan Pahuja, Yu Gu, Wenhu Chen, Mehdi Bahrami, Lei Liu, Wei-Peng Chen
and Yu Su
23:10–23:20 Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled
Data
Haoming Jiang, Danqing Zhang, Tianyu Cao, Bing Yin and Tuo Zhao
23:20–23:30 Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model
Hongliang Dai, Yangqiu Song and Haixun Wang
23:30–23:40 Improving Named Entity Recognition by External Context Retrieving and Coopera-
tive Learning
Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang
and Kewei Tu
23:40–23:47 ROPE: Reading Order Equivariant Positional Encoding for Graph-based Docu-
ment Information Extraction
Chen-Yu Lee, Chun-Liang Li, Chu Wang, Renshen Wang, Yasuhisa Fujii, Siyang
Qin, Ashok Popat and Tomas Pfister
23:47–23:54 Zero-shot Event Extraction via Transfer Learning: Challenges and Insights
Qing Lyu, Hongming Zhang, Elior Sulem and Dan Roth
xciv
Session 4D: Interpretability and Analysis of Models for NLP 3
23:00–23:10 Implicit Representations of Meaning in Neural Language Models

Belinda Z. Li, Maxwell Nye and Jacob Andreas
23:10–23:20 Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models

Matthew Finlayson, Aaron Mueller, Sebastian Gehrmann, Stuart Shieber, Tal
Linzen and Yonatan Belinkov
23:20–23:30 Bird’s Eye: Probing for Linguistic Graph Structures with a Simple Information-
Theoretic Approach
Yifan Hou and Mrinmaya Sachan
23:30–23:40 Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge

Bases
Boxi Cao, Hongyu Lin, Xianpei Han, Le Sun, Lingyong Yan, Meng Liao, Tong Xue
and Jin Xu
23:40–23:50 Poisoning Knowledge Graph Embeddings via Relation Inference Patterns

Peru Bhardwaj, John Kelleher, Luca Costabello and Declan O’Sullivan
23:50–23:57 Using Adversarial Attacks to Reveal the Statistical Bias in Machine Reading Com-
prehension Models
Jieyu Lin, Jiajie Zou and Nai Ding
Session 4E: Ethics in NLP 1
23:00–23:10 Bad Seeds: Evaluating Lexical Methods for Bias Measurement

Maria Antoniak and David Mimno
23:10–23:20 A Survey of Race, Racism, and Anti-Racism in NLP

Anjalie Field, Su Lin Blodgett, Zeerak Waseem and Yulia Tsvetkov
23:20–23:30 Intrinsic Bias Metrics Do Not Correlate with Application Bias

Seraphina Goldfarb-Tarrant, Rebecca Marchant, Ricardo Muñoz Sánchez, Mugdha
Pandya and Adam Lopez
23:30–23:40 RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conver-
sational Language Models
Soumya Barikeri, Anne Lauscher, Ivan Vulić and Goran Glavaš
xcv
23:40–23:47 Quantifying and Avoiding Unfair Qualification Labour in Crowdsourcing

Jonathan K. Kummerfeld
23:47–23:54 Men Are Elected, Women Are Married: Events Gender Bias on Wikipedia
Jiao Sun and Nanyun Peng
Tuesday, August 3, 2021 (all times UTC+0)
Session 5A: Machine Translation and Multilinguality 3
00:00–00:10 Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks

Weicheng Ma, Kai Zhang, Renze Lou, Lili Wang and Soroush Vosoughi
00:10–00:20 Crafting Adversarial Examples for Neural Machine Translation

Xinze Zhang, Junzhe Zhang, Zhenhua Chen and Kun He
00:20–00:30 UXLA: A Robust Unsupervised Data Augmentation Framework for Zero-Resource

Cross-Lingual NLP
M Saiful Bari, Tasnim Mohiuddin and Shafiq Joty
00:30–00:40 Glancing Transformer for Non-Autoregressive Neural Machine Translation

Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Weinan Zhang, Yong
Yu and Lei Li
00:40–00:47 Modeling Task-Aware MIMO Cardinality for Efficient Multilingual Neural Machine
Translation
Hongfei Xu, Qiuhui Liu, Josef van Genabith and Deyi Xiong
00:47–00:54 Adaptive Nearest Neighbor Machine Translation

Xin Zheng, Zhirui Zhang, Junliang Guo, Shujian Huang, Boxing Chen, Weihua Luo
and Jiajun CHEN
xcvi
Tuesday, August 3, 2021 (all times UTC+0) (continued)
Session 5B: Language Grounding to Vision, Robotics and Beyond 2
00:00–00:10 Hierarchical Context-aware Network for Dense Video Event Captioning

Lei Ji, Xianglin Guo, Haoyang Huang and Xilin Chen
00:10–00:20 Control Image Captioning Spatially and Temporally

Kun Yan, Lei Ji, Huaishao Luo, Ming Zhou, Nan Duan and Shuai Ma
00:20–00:30 Edited Media Understanding Frames: Reasoning About the Intent and Implications
of Visual Misinformation
Jeff Da, Maxwell Forbes, Rowan Zellers, Anthony Zheng, Jena D. Hwang, Antoine
Bosselut and Yejin Choi
00:30–00:40 PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World

Rowan Zellers, Ari Holtzman, Matthew Peters, Roozbeh Mottaghi, Aniruddha
Kembhavi, Ali Farhadi and Yejin Choi
00:40–00:50 Neural Event Semantics for Grounded Language Understanding

Shyamal Buch, Li Fei-Fei and Noah Goodman
Session 5C: Machine Learning for NLP 2
00:00–00:10 Modeling Fine-Grained Entity Types with Box Embeddings

Yasumasa Onoe, Michael Boratko, Andrew McCallum and Greg Durrett
00:10–00:20 ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information

zijun sun, Xiaoya Li, Xiaofei Sun, Yuxian Meng, Xiang Ao, Qing He, Fei Wu and
Jiwei Li
00:20–00:30 Weight Distillation: Transferring the Knowledge in Neural Network Parameters

Ye Lin, Yanyang Li, Ziyang Wang, Bei Li, Quan Du, Tong Xiao and Jingbo Zhu
00:30–00:40 Optimizing Deeper Transformers on Small Datasets

Peng Xu, Dhruv Kumar, Wei Yang, Wenjie Zi, Keyi Tang, Chenyang Huang, Jackie
Chi Kit Cheung, Simon J.D. Prince and Yanshuai Cao
00:40–00:50 BERTAC: Enhancing Transformer-based Language Models with Adversarially Pre-

trained Convolutional Neural Networks
Jong-Hoon Oh, Ryu Iida, Julien Kloetzer and Kentaro Torisawa
xcvii
00:50–00:57 On Orthogonality Constraints for Transformers

Aston Zhang, Alvin Chan, Yi Tay, Jie Fu, Shuohang Wang, Shuai Zhang, Huajie
Shao, Shuochao Yao and Roy Ka-Wei Lee
Session 5D: NLP Applications 1 and Ethics
00:00–00:10 COVID-Fact: Fact Extraction and Verification of Real-World Claims on COVID-19

Pandemic
Arkadiy Saakyan, Tuhin Chakrabarty and Smaranda Muresan
00:10–00:20 Explaining Relationships Between Scientific Documents

Kelvin Luu, Xinyi Wu, Rik Koncel-Kedziorski, Kyle Lo, Isabel Cachola and Noah
A. Smith
00:20–00:30 IrEne: Interpretable Energy Prediction for Transformers

Qingqing Cao, Yash Kumar Lal, Harsh Trivedi, Aruna Balasubramanian and Niran-
jan Balasubramanian
00:30–00:40 Mitigating Bias in Session-based Cyberbullying Detection: A Non-Compromising

Approach
Lu Cheng, Ahmadreza Mosallanezhad, Yasin Silva, Deborah Hall and Huan Liu
00:40–00:50 PlotCoder: Hierarchical Decoding for Synthesizing Visualization Code in Program-

matic Context
Xinyun Chen, Linyuan Gong, Alvin Cheung and Dawn Song
00:50–01:00 Changing the World by Changing the Data

Anna Rogers
xcviii
Session 6A: Machine Learning for NLP 3
01:00–01:10 EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets

Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, Zhangyang Wang and
Jingjing Liu
01:10–01:20 On the Effectiveness of Adapter-based Tuning for Pretrained Language Model

Adaptation
Ruidan He, Linlin Liu, Hai Ye, Qingyu Tan, BOSHENG DING, Liying Cheng,
Jiawei Low, Lidong Bing and Luo Si
01:20–01:30 Data Augmentation for Text Generation Without Any Augmented Data
Wei Bi, Huayang Li and Jiacheng Huang
01:30–01:40 KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language
Representation
Xiaozhi Wang, Tianyu Gao, Zhaocheng Zhu, Zhengyan Zhang, Zhiyuan Liu, Juanzi
Li and Jian Tang
01:40–01:50 Integrating Semantics and Neighborhood Information with Graph-Driven Genera-

tive Models for Document Retrieval
Zijing Ou, Qinliang Su, Jianxing Yu, Bang Liu, Jingwen Wang, Ruihui Zhao,
Changyou Chen and Yefeng Zheng
01:50–01:57 Measuring and Improving BERT’s Mathematical Abilities by Predicting the Order
of Reasoning.
Piotr Pi˛ekos, Mateusz Malinowski and Henryk Michalewski
Session 6B: Resources and Evaluation 1
01:00–01:10 SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation
via Typicality Analysis
Joshua Feinglass and Yezhou Yang
01:10–01:20 KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers

Chia-Hsuan Lee, Oleksandr Polozov and Matthew Richardson
01:20–01:30 QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech
Corpus
Hamdy Mubarak, Amir Hussein, Shammur Absar Chowdhury and Ahmed Ali
01:30–01:40 An Empirical Study on Hyperparameter Optimization for Fine-Tuning Pre-trained

Language Models
Xueqing Liu and Chi Wang
xcix
01:40–01:50 Better than Average: Paired Evaluation of NLP systems

Maxime Peyrard, Wei Zhao, Steffen Eger and Robert West
01:50–01:57 Happy Dance, Slow Clap: Using Reaction GIFs to Predict Induced Affect on Twitter
Boaz Shmueli, Soumya Ray and Lun-Wei Ku
Session 6C: Semantics: Sentence-level Semantics, Textual Inference and Other

areas 1
01:00–01:10 Chase: A Large-Scale and Pragmatic Chinese Dataset for Cross-Database Context-
Dependent Text-to-SQL
Jiaqi Guo, Ziliang Si, Yu Wang, Qian Liu, Ming Fan, Jian-Guang LOU, Zijiang
Yang and Ting Liu
01:10–01:20 CLINE: Contrastive Learning with Semantic Negative Examples for Natural Lan-
guage Understanding
Dong Wang, Ning Ding, Piji Li and Haitao Zheng
01:20–01:30 Tree-Structured Topic Modeling with Nonparametric Neural Variational Inference

Ziye Chen, Cheng Ding, Zusheng Zhang, Yanghui Rao and Haoran Xie
01:30–01:40 ExCAR: Event Graph Knowledge Enhanced Explainable Causal Reasoning

Li Du, Xiao Ding, Kai Xiong, Ting Liu and Bing Qin
01:40–01:50 Infusing Finetuning with Semantic Dependencies

Zhaofeng Wu, Hao Peng and Noah Smith
01:50–01:57 Exploring Listwise Evidence Reasoning with T5 for Fact Verification

Kelvin Jiang, Ronak Pradeep and Jimmy Lin
c
Session 6D: Sentiment Analysis, Stylistic Analysis, and Argument Mining 2
01:00–01:10 Distributed Representations of Emotion Categories in Emotion Space

Xiangyu Wang and Chengqing Zong
01:10–01:20 Style is NOT a single variable: Case Studies for Cross-Stylistic Language Under-
standing
Dongyeop Kang and Eduard Hovy
01:20–01:30 DynaSent: A Dynamic Benchmark for Sentiment Analysis

Christopher Potts, Zhengxuan Wu, Atticus Geiger and Douwe Kiela
01:30–01:40 A Hierarchical VAE for Calibrating Attributes while Generating Text using Normal-
izing Flow
Bidisha Samanta, Mohit Agrawal and NIloy Ganguly
01:40–01:50 A Unified Generative Framework for Aspect-based Sentiment Analysis

Hang Yan, Junqi Dai, Tuo Ji, Xipeng Qiu and Zheng Zhang
01:50–02:00 Classifying Argumentative Relations Using Logical Mechanisms and Argumenta-

tion Schemes
Yohan Jo, Seojin Bang, Chris Reed and Eduard Hovy
Session 7A: Dialog and Interactive Systems 4
08:00–08:10 Discovering Dialogue Slots with Weak Supervision

Vojtěch Hudeček, Ondřej Dušek and Zhou Yu
08:10–08:20 Enhancing the generalization for Intent Classification and Out-of-Domain Detec-
tion in SLU
Yilin Shen, Yen-Chang Hsu, Avik Ray and Hongxia Jin
08:20–08:30 ProtAugment: Intent Detection Meta-Learning through Unsupervised Diverse Para-

phrasing
Thomas Dopierre, Christophe Gravier and Wilfried Logerais
08:30–08:40 Robustness Testing of Language Understanding in Task-Oriented Dialog

Jiexi Liu, Ryuichi Takanobu, Jiaxin Wen, Dazhen Wan, hongguang li, weiran nie,
Cheng LI, Wei Peng and Minlie Huang
ci
08:40–08:50 Comprehensive Study: How the Context Information of Different Granularity Af-
fects Dialogue State Tracking?
Puhai Yang, Heyan Huang and Xian-Ling Mao
08:50–09:00 OTTers: One-turn Topic Transitions for Open-Domain Dialogue

Karin Sevegnani, David M. Howcroft, Ioannis Konstas and Verena Rieser
Session 7B: Semantics: Sentence-level Semantics, Textual Inference and Other

areas 2
08:00–08:10 Towards Robustness of Text-to-SQL Models against Synonym Substitution

Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver, John R. Woodward,
Jinxia Xie and Pengsheng Huang
08:10–08:20 KACE: Generating Knowledge Aware Contrastive Explanations for Natural Lan-
guage Inference
Qianglong Chen, Feng Ji, Xiangji Zeng, Feng-Lin Li, Ji Zhang, Haiqing Chen and
Yin Zhang
08:20–08:30 Self-Guided Contrastive Learning for BERT Sentence Representations

Taeuk Kim, Kang Min Yoo and Sang-goo Lee
08:30–08:40 LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-
Local Relations
Ruisheng Cao, Lu Chen, Zhi Chen, Yanbin Zhao, Su Zhu and Kai Yu
08:40–08:47 DefSent: Sentence Embeddings using Definition Sentences

Hayato Tsukagoshi, Ryohei Sasano and Koichi Takeda
08:47–08:54 Discrete Cosine Transform as Universal Sentence Encoder

Nada Almarwani and Mona Diab
cii
Session 7C: Speech and Multimodality 1
08:00–08:10 Multi-stage Pre-training over Simplified Multimodal Pre-training Models

Tongtong Liu, Fangxiang Feng and Xiaojie WANG
08:10–08:20 Beyond Sentence-Level End-to-End Speech Translation: Context Helps

Biao Zhang, Ivan Titov, Barry Haddow and Rico Sennrich
08:20–08:30 LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding

Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu,
Dinei Florencio, Cha Zhang, Wanxiang Che, Min Zhang and Lidong Zhou
08:30–08:40 UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal

Contrastive Learning
Wei Li, Can Gao, Guocheng Niu, Xinyan Xiao, Hao Liu, Jiachen Liu, Hua Wu and
Haifeng Wang
08:40–08:50 Missing Modality Imagination Network for Emotion Recognition with Uncertain
Missing Modalities
Jinming Zhao, Ruichen Li and Qin Jin
08:50–09:00 Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into

Speech Translation Encoders
Chen Xu, Bojie Hu, Yanyang Li, Yuhao Zhang, shen huang, Qi Ju, Tong Xiao and
Jingbo Zhu
Session 7D: Syntax: Tagging, Chunking, and Parsing 1
08:00–08:10 N-ary Constituent Tree Parsing with Recursive Semi-Markov Model

Xin Xin, Jinlong Li and Zeqi Tan
08:10–08:20 Automated Concatenation of Embeddings for Structured Prediction

Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang
and Kewei Tu
08:20–08:30 Multi-View Cross-Lingual Structured Prediction with Minimum Supervision

Zechuan Hu, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang
and Kewei Tu
08:30–08:40 The Limitations of Limited Context for Constituency Parsing

Yuchen Li and Andrej Risteski
ciii
08:40–08:50 Neural Bi-Lexicalized PCFG Induction

Songlin Yang, Yanpeng Zhao and Kewei Tu
Session 7E: Resources and Evaluation 2
08:00–08:10 Ruddit: Norms of Offensiveness for English Reddit Comments

Rishav Hada, Sohi Sudhir, Pushkar Mishra, Helen Yannakoudakis, Saif M. Moham-
mad and Ekaterina Shutova
08:10–08:20 Towards Quantifiable Dialogue Coherence Evaluation

Zheng Ye, Liucun Lu, Lishan Huang, Liang Lin and Xiaodan Liang
08:20–08:30 Assessing the Representations of Idiomaticity in Vector Models with a Noun Com-
pound Dataset Labeled at Type and Token Levels
Marcos Garcia, Tiago Kramer Vieira, Carolina Scarton, Marco Idiart and Aline
Villavicencio
08:30–08:40 Factoring Statutory Reasoning as Language Understanding Challenges

Nils Holzenberger and Benjamin Van Durme
08:40–08:50 Evaluating Evaluation Measures for Ordinal Classification and Ordinal Quantifi-
cation
Tetsuya Sakai
08:50–08:57 AligNarr: Aligning Narratives on Movies

Paramita Mirza, Mostafa Abouhamra and Gerhard Weikum
civ
Session 8A: Information Extraction 4
09:00–09:10 Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning
from Decision Making
Zijun Yao, Chengjiang Li, Tiansi Dong, Xin Lv, Jifan Yu, Lei Hou, Juanzi Li,
YICHI ZHANG and zelin Dai
09:10–09:20 Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition
Yongliang Shen, Xinyin Ma, Zeqi Tan, Shuai Zhang, Wen Wang and Weiming Lu
09:20–09:30 Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event

Extraction
Yaojie Lu, Hongyu Lin, Jin Xu, Xianpei Han, Jialong Tang, Annan Li, Le Sun,
Meng Liao and Shaoyi Chen
09:30–09:40 A Large-Scale Chinese Multimodal NER Dataset with Speech Clues

Dianbo Sui, Zhengkun Tian, Yubo Chen, Kang Liu and Jun Zhao
09:40–09:50 A Neural Transition-based Joint Model for Disease Named Entity Recognition and
Normalization
Zongcheng Ji, Tian Xia, Mei Han and Jing Xiao
09:50–10:00 OntoED: Low-resource Event Detection with Ontology Embedding

Shumin Deng, Ningyu Zhang, Luoqiu Li, Chen Hui, tou huaixiao, Mosha Chen, Fei
Huang and Huajun Chen
Session 8B: Machine Translation and Multilinguality 4
09:00–09:10 Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine
Translation
Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Shuming Shi, Michael Lyu and Irwin
King
09:10–09:20 Breaking the Corpus Bottleneck for Context-Aware Neural Machine Translation
with Cross-Task Pre-training
Linqing Chen, Junhui Li, Zhengxian Gong, Boxing Chen, Weihua Luo, Min Zhang
and Guodong Zhou
09:20–09:30 Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation
Yang Feng, Shuhao Gu, Dengji Guo, Zhengxin Yang and Chenze Shao
09:30–09:40 Cascade versus Direct Speech Translation: Do the Differences Still Make a Differ-
ence?
Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Alberto Mar-
tinelli, Matteo Negri and Marco Turchi
cv
09:40–09:50 Unsupervised Neural Machine Translation for Low-Resource Domains via Meta-
Learning
Cheonbok Park, Yunwon Tae, TaeHee Kim, Soyoung Yang, Mohammad Azam
Khan, Lucy Park and Jaegul Choo
09:50–09:57 An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-

Lingual Transformers
Tharindu Ranasinghe, Constantin Orasan and Ruslan Mitkov
09:00–09:10 Lightweight Cross-Lingual Sentence Representation Learning

Zhuoyuan Mao, Prakhar Gupta, Chenhui Chu, Martin Jaggi and Sadao Kurohashi
09:10–09:20 ERNIE-Doc: A Retrospective Long-Document Modeling Transformer

SiYu Ding, Junyuan Shang, Shuohuan Wang, Yu Sun, Hao Tian, Hua Wu and
Haifeng Wang
09:20–09:30 Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowl-
edge Distillation
Yuanxin LIU, Fandong Meng, Zheng Lin, Weiping Wang and Jie Zhou
09:30–09:40 Rational LAMOL: A Rationale-based Lifelong Learning Framework

Kasidis Kanwatchara, Thanapapas Horsuwan, Piyawat Lertvittayakumjorn, Boon-
serm Kijsirikul and Peerapon Vateekul
09:40–09:50 EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering
Zhibin Duan, Hao Zhang, Chaojie Wang, Zhengjue Wang, Bo Chen and Mingyuan
Zhou
09:50–10:00 LeeBERT: Learned Early Exit for BERT with cross-level optimization
Wei Zhu
cvi
Session 8D: NLP Applications 2
09:00–09:10 Unsupervised Extractive Summarization-Based Representations for Accurate and

Explainable Collaborative Filtering
Reinald Adrian Pugoy and Hung-Yu Kao
09:10–09:20 PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction
Shulin Liu, Tao Yang, Tianchi Yue, Feng Zhang and Di Wang
09:20–09:30 Competence-based Multimodal Curriculum Learning for Medical Report Genera-

tion
Fenglin Liu, Shen Ge and Xian Wu
09:30–09:40 Learning Syntactic Dense Embedding with Correlation Graph for Automatic Read-
ability Assessment
Xinying Qiu, Yuan Chen, Hanwu Chen, Jian-Yun Nie, Yuming Shen and Dawei Lu
09:40–09:50 Meta-KD: A Meta Knowledge Distillation Framework for Language Model Com-
pression across Domains
Haojie Pan, Chengyu Wang, Minghui Qiu, Yichang Zhang, Yaliang Li and jun
huang
09:50–09:57 Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction
Models
Chong Li, Cenyuan Zhang, Xiaoqing Zheng and Xuanjing Huang
Session 8E: Question Answering 1
09:00–09:10 A Semantic-based Method for Unsupervised Commonsense Question Answering

Yilin Niu, Fei Huang, Jiaming Liang, Wenkai Chen, Xiaoyan Zhu and Minlie Huang
09:10–09:20 Explanations for CommonsenseQA: New Dataset and Models

Shourya Aggarwal, Divyanshu Mandowara, Vishwajeet Agrawal, Dinesh Khandel-
wal, Parag Singla and Dinesh Garg
09:20–09:30 Few-Shot Question Answering by Pretraining Span Selection

Ori Ram, Yuval Kirstain, Jonathan Berant, Amir Globerson and Omer Levy
09:30–09:40 UnitedQA: A Hybrid Approach for Open Domain Question Answering

Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen and Jianfeng
Gao
cvii
09:40–09:50 Database reasoning over text

James Thorne, Majid Yazdani, Marzieh Saeidi, Fabrizio Silvestri, Sebastian Riedel
and Alon Halevy
09:50–09:57 Training Adaptive Computation for Open-Domain Question Answering with Com-
putational Constraints
Yuxiang Wu, Pasquale Minervini, Pontus Stenetorp and Sebastian Riedel
10:00–10:10 Online Learning Meets Machine Translation Evaluation: Finding the Best Systems
with the Least Human Effort
Vânia Mendonça, Ricardo Rei, Luisa Coheur, Alberto Sardinha and Ana Lúcia San-
tos
10:10–10:20 How Good is Your Tokenizer? On the Monolingual Performance of Multilingual

Language Models
Phillip Rust, Jonas Pfeiffer, Ivan Vulić, Sebastian Ruder and Iryna Gurevych
10:20–10:30 Evaluating morphological typology in zero-shot cross-lingual transfer

Antonio Martínez-García, Toni Badia and Jeremy Barnes
10:30–10:40 From Machine Translation to Code-Switching: Generating High-Quality Code-

Switched Text
Ishan Tarunesh, Syamantak Kumar and Preethi Jyothi
10:40–10:50 Fast and Accurate Neural Machine Translation with Translation Memory
Qiuxiang He, Guoping Huang, Qu Cui, Li Li and Lemao Liu
10:50–10:57 An Empirical Study on Adversarial Attack on NMT: Languages and Positions Matter
Zhiyuan Zeng and Deyi Xiong
cviii
10:00–10:10 Annotating Online Misogyny

Philine Zeinert, Nanna Inie and Leon Derczynski
10:10–10:20 Few-NERD: A Few-shot Named Entity Recognition Dataset

Ning Ding, Guangwei Xu, Yulin Chen, Xiaobin Wang, Xu Han, Pengjun Xie,
Haitao Zheng and Zhiyuan Liu
10:20–10:30 MultiMET: A Multimodal Dataset for Metaphor Understanding

Dongyu Zhang, Minghao Zhang, Heting Zhang, Liang Yang and Hongfei LIN
10:30–10:40 Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset

to Fight Online Hate Speech
Margherita Fanton, Helena Bonaldi, Serra Sinem Tekiroğlu and Marco Guerini
10:40–10:47 OntoGUM: Evaluating Contextualized SOTA Coreference Resolution on 12 More

Genres
Yilun Zhu, Sameer Pradhan and Amir Zeldes
Session 9C: Question Answering 2
10:00–10:10 Can Generative Pre-trained Language Models Serve As Knowledge Bases for
Closed-book QA?
Cunxiang Wang, Pai Liu and Yue Zhang
10:10–10:20 Joint Models for Answer Verification in Question Answering Systems

Zeyu Zhang, Thuy Vu and Alessandro Moschitti
10:20–10:30 Answering Ambiguous Questions through Generative Evidence Fusion and Round-
Trip Prediction
Yifan Gao, Henghui Zhu, Patrick Ng, Cicero Nogueira dos Santos, Zhiguo Wang,
Feng Nan, Dejiao Zhang, Ramesh Nallapati, Andrew O. Arnold and Bing Xiang
10:30–10:40 TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual

Content in Finance
Fengbin Zhu, Wenqiang Lei, Youcheng Huang, Chao Wang, Shuo Zhang, Jiancheng
Lv, Fuli Feng and Tat-Seng Chua
10:40–10:50 Modeling Transitions of Focal Entities for Conversational Knowledge Base Ques-
tion Answering
Yunshi Lan and Jing Jiang
cix
10:50–10:57 In Factuality: Efficient Integration of Relevant Facts for Visual Question Answering
Peter Vickers, Nikolaos Aletras, Emilio Monti and Loïc Barrault
Session 9D: Semantics: Sentence-level Semantics, Textual Inference and Other

areas 3
10:00–10:10 Evidence-based Factual Error Correction

James Thorne and Andreas Vlachos
10:10–10:20 Probabilistic, Structure-Aware Algorithms for Improved Variety, Accuracy, and

Coverage of AMR Alignments
Austin Blodgett and Nathan Schneider
10:20–10:30 Meta-Learning to Compositionally Generalize

Henry Conklin, Bailin Wang, Kenny Smith and Ivan Titov
10:30–10:40 Taming Pre-trained Language Models with N-gram Representations for Low-
Resource Domain Adaptation
Shizhe Diao, Ruijia Xu, Hongjin Su, Yilei Jiang, Yan Song and Tong Zhang
10:40–10:50 ERICA: Improving Entity and Relation Understanding for Pre-trained Language
Models via Contrastive Learning
Yujia Qin, Yankai Lin, Ryuichi Takanobu, Zhiyuan Liu, Peng Li, Heng Ji, Minlie
Huang, Maosong Sun and Jie Zhou
10:50–10:57 Zero-shot Fact Verification by Claim Generation

Liangming Pan, Wenhu Chen, Wenhan Xiong, Min-Yen Kan and William Yang
Wang
cx
Session 9E: Sentiment Analysis, Stylistic Analysis, and Argument Mining 3
10:00–10:10 Position Bias Mitigation: A Knowledge-Aware Graph Model for Emotion Cause
Extraction
Hanqi Yan, Lin Gui, Gabriele Pergola and Yulan He
10:10–10:20 Every Bite Is an Experience: Key Point Analysis of Business Reviews

Roy Bar-Haim, Lilach Eden, Yoav Kantor, Roni Friedman and Noam Slonim
10:20–10:30 Structured Sentiment Analysis as Dependency Graph Parsing

Jeremy Barnes, Robin Kurtz, Stephan Oepen, Lilja Øvrelid and Erik Velldal
10:30–10:37 Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Trans-
fer
Huiyuan Lai, Antonio Toral and Malvina Nissim
10:37–10:44 Deep Context- and Relation-Aware Learning for Aspect-based Sentiment Analysis
Shinhyeok Oh, Dongyub Lee, Taesun Whang, IlNam Park, Seo Gaeun, EungGyun
Kim and Harksoo Kim
10:44–10:51 Towards Generative Aspect-Based Sentiment Analysis

Wenxuan Zhang, Xin Li, Yang Deng, Lidong Bing and Wai Lam
11:00–11:10 Consistency Regularization for Cross-Lingual Fine-Tuning

Bo Zheng, Li Dong, Shaohan Huang, Wenhui Wang, Zewen Chi, Saksham Singhal,
Wanxiang Che, Ting Liu, Xia Song and Furu Wei
11:10–11:20 Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word

Alignment
Zewen Chi, Li Dong, Bo Zheng, Shaohan Huang, Xian-Ling Mao, Heyan Huang
and Furu Wei
11:20–11:30 Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-
Autoregressive Translation
Liang Ding, Longyue Wang, Xuebo Liu, Derek F. Wong, Dacheng Tao and
Zhaopeng Tu
11:30–11:40 G-Transformer for Document-Level Machine Translation

Guangsheng Bao, Yue Zhang, Zhiyang Teng, Boxing Chen and Weihua Luo
cxi
11:40–11:50 Prevent the Language Model from being Overconfident in Neural Machine Transla-
tion
Mengqi Miao, Fandong Meng, Yijin Liu, Xiao-Hua Zhou and Jie Zhou
11:50–11:57 Bilingual Mutual Information Based Adaptive Training for Neural Machine Trans-
lation
Yangyifan Xu, Yijin Liu, Fandong Meng, Jiajun Zhang, Jinan Xu and Jie Zhou
11:00–11:10 Towards Emotional Support Dialog Systems

Siyang Liu, Chujie Zheng, Orianna Demasi, Sahand Sabour, Yu Li, Zhou Yu, Yong
Jiang and Minlie Huang
11:10–11:20 Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the
Task-Oriented Dialogue System
Yanan Wu, Zhiyuan Zeng, Keqing He, Hong Xu, Yuanmeng Yan, Huixing Jiang
and Weiran Xu
11:20–11:30 GTM: A Generative Triple-wise Model for Conversational Question Generation

Lei Shen, Fandong Meng, Jinchao Zhang, Yang Feng and Jie Zhou
11:30–11:40 Diversifying Dialog Generation via Adaptive Label Smoothing

Yida Wang, Yinhe Zheng, Yong Jiang and Minlie Huang
11:40–11:50 Out-of-Scope Intent Detection with Self-Supervision and Discriminative Training

Li-Ming Zhan, Haowen Liang, Bo LIU, Lu Fan, Xiao-Ming Wu and Albert Y.S.
Lam
11:50–11:57 Continual Learning for Task-oriented Dialogue System with Iterative Network
Pruning, Expanding and Masking
Binzong Geng, Fajie Yuan, Qiancheng Xu, Ying Shen, Ruifeng Xu and Min Yang
cxii
11:00–11:10 Document-level Event Extraction via Heterogeneous Graph-based Interaction

Model with a Tracker
Runxin Xu, Tianyu Liu, Lei Li and Baobao Chang
11:10–11:20 Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best
Path
Yiran Wang, Hiroyuki Shindo, Yuji Matsumoto and Taro Watanabe
11:20–11:30 LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality

Identification
Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng and
Yuguang Chen
11:30–11:40 Revisiting the Negative Data of Distantly Supervised Relation Extraction

Chenhao Xie, Jiaqing Liang, Jingping Liu, Chengsong Huang, Wenhao Huang and
Yanghua Xiao
11:40–11:50 Knowing the No-match: Entity Alignment with Dangling Cases

Zequn Sun, Muhao Chen and Wei Hu
11:50–11:57 TIMERS: Document-level Temporal Relation Extraction

Puneet Mathur, Rajiv Jain, Franck Dernoncourt, Vlad Morariu, Quan Hung Tran
and Dinesh Manocha
Session 10D: Phonology, Morphology and Word Segmentation 1
11:00–11:10 Superbizarre Is Not Superb: Derivational Morphology Improves BERT’s Interpre-

tation of Complex Words
Valentin Hofmann, Janet Pierrehumbert and Hinrich Schütze
11:10–11:20 Optimizing over Subsequences Generates Context-Sensitive Languages

Andrew Lamont
11:20–11:30 Morphology Matters: A Multilingual Language Modeling Analysis

Hyunji Hayley Park, Katherine J. Zhang, Coleman Haley, Kenneth Steimel, Han
Liu and Lane Schwartz
11:30–11:37 Improving Arabic Diacritization with Regularized Decoding and Adversarial Train-
ing
Han Qin, Guimin Chen, Yuanhe Tian and Yan Song
cxiii
11:37–11:44 When is Char Better Than Subword: A Systematic Study of Segmentation Algo-
rithms for Neural Machine Translation
Jiahuan Li, Yutong Shen, Shujian Huang, Xinyu Dai and Jiajun CHEN
11:44–11:51 More than Text: Multi-modal Chinese Word Segmentation

Dong Zhang, Zheng Hu, Shoushan Li, Hanqian Wu, Qiaoming Zhu and Guodong
Zhou
Session 10E: Semantics: Lexical Semantics 1
11:00–11:10 BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify
Analogies?
Asahi Ushio, Luis Espinosa Anke, Steven Schockaert and Jose Camacho-Collados
11:10–11:20 Exploring the Representation of Word Meanings in Context: A Case Study on

Homonymy and Synonymy
Marcos Garcia
11:20–11:30 Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe

Approach
Jie Huang, Kevin Chang, JinJun Xiong and Wen-mei Hwu
11:30–11:37 A Mixture-of-Experts Model for Antonym-Synonym Discrimination

Zhipeng Xie and Nan Zeng
11:37–11:44 Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity

Linking
Fangyu Liu, Ivan Vulić, Anna Korhonen and Nigel Collier
11:44–11:51 A Cluster-based Approach for Improving Isotropy in Contextual Embedding Space

Sara Rajaee and Mohammad Taher Pilehvar
14:00–15:30 Business meeting and Green NLP panel
15:30–16:30 Keynote 3. Christopher Potts: Reliable Characterizations of NLP Systems as a

Social Responsibility
cxiv
16:30–16:40 HERALD: An Annotation Efficient Method to Detect User Disengagement in Social

Conversations
Weixin Liang, Kai-Hui Liang and Zhou Yu
16:40–16:50 Value-Agnostic Conversational Semantic Parsing

Emmanouil Antonios Platanios, Adam Pauls, Subhro Roy, Yuchen Zhang, Alexan-
der Kyte, Alan Guo, Sam Thomson, Jayant Krishnamurthy, Jason Wolfe, Jacob
Andreas and Dan Klein
16:50–17:00 MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Under-

standing
Jia-Chen Gu, Chongyang Tao, Zhenhua Ling, Can Xu, Xiubo Geng and Daxin Jiang
17:00–17:10 Best of Both Worlds: Making High Accuracy Non-incremental Transformer-based

Disfluency Detection Incremental
Morteza Rohanian and Julian Hough
17:10–17:20 NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-Based Simu-

lation
Sungdong Kim, Minsuk Chang and Sang-Woo Lee
17:20–17:27 Unsupervised Enrichment of Persona-grounded Dialog with Background Stories

Bodhisattwa Prasad Majumder, Taylor Berg-Kirkpatrick, Julian McAuley and
Harsh Jhamtani
Session 11B: Linguistic Theories, Cognitive Modeling and Psycholinguistics 1
16:30–16:40 CDRNN: Discovering Complex Dynamics in Human Language Processing

Cory Shain
16:40–16:50 Structural Guidance for Transformer Language Models

Peng Qian, Tahira Naseem, Roger Levy and Ramón Fernandez Astudillo
16:50–17:00 Surprisal Estimators for Human Reading Times Need Character Models
Byung-Doh Oh, Christian Clark and William Schuler
17:00–17:10 CogAlign: Learning to Align Textual Neural Representations to Cognitive Lan-

guage Processing Signals
Yuqi Ren and Deyi Xiong
cxv
17:10–17:20 Formal Basis of a Language Universal

Milos Stanojevic and Mark Steedman
17:20–17:27 Beyond Laurel/Yanny: An Autoencoder-Enabled Search for Polyperceivable Audio

Kartik Chandra, Chuma Kabaghe and Gregory Valiant
16:30–16:40 Self-Attention Networks Can Process Bounded Hierarchical Languages

Shunyu Yao, Binghui Peng, Christos Papadimitriou and Karthik Narasimhan
16:40–16:50 TextSETTR: Few-Shot Text Style Extraction and Tunable Targeted Restyling
Parker Riley, Noah Constant, Mandy Guo, Girish Kumar, David Uthus and Zarana
Parekh
16:50–17:00 H-Transformer-1D: Fast One-Dimensional Hierarchical Attention for Sequences

Zhenhai Zhu and Radu Soricut
17:00–17:10 Making Pre-trained Language Models Better Few-shot Learners

Tianyu Gao, Adam Fisch and Danqi Chen
17:10–17:20 A Sweet Rabbit Hole by DARCY: Using Honeypots to Detect Universal Trigger’s
Adversarial Attacks
Thai Le, Noseong Park and Dongwon Lee
17:20–17:27 Don’t Let Discourse Confine Your Model: Sequence Perturbations for Improved
Event Language Models
Mahnaz Koupaee, Greg Durrett, Nathanael Chambers and Niranjan Balasubrama-
nian
cxvi
Session 11D: Information Retrieval and Text Mining 1
16:30–16:40 Towards Propagation Uncertainty: Edge-enhanced Bayesian Graph Convolutional

Networks for Rumor Detection
Lingwei Wei, Dou Hu, Wei Zhou, Zhaojuan Yue and Songlin Hu
16:40–16:50 Label-Specific Dual Graph Neural Network for Multi-Label Text Classification
Qianwen Ma, Chunyuan Yuan, Wei Zhou and Songlin Hu
16:50–17:00 TAN-NTM: Topic Attention Networks for Neural Topic Modeling

Madhur Panwar, Shashank Shailabh, Milan Aggarwal and Balaji Krishnamurthy
17:00–17:10 Cross-language Sentence Selection via Data Augmentation and Rationale Training
Yanda Chen, Chris Kedzie, Suraj Nair, Petra Galuscakova, Rui Zhang, Douglas
Oard and Kathleen McKeown
17:10–17:20 A Neural Model for Joint Document and Snippet Ranking in Question Answering
for Large Document Collections
Dimitris Pappas and Ion Androutsopoulos
17:20–17:27 The Curse of Dense Low-Dimensional Information Retrieval for Large Index Sizes
Nils Reimers and Iryna Gurevych
Session 11E: Discourse and Pragmatics 1
16:30–16:40 W-RST: Towards a Weighted RST-style Discourse Framework

Patrick Huber, Wen Xiao and Giuseppe Carenini
16:40–16:50 ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of

Simple Sentences
Yanjun Gao, Ting-Hao Huang and Rebecca J. Passonneau
16:50–17:00 Which Linguist Invented the Lightbulb? Presupposition Verification for Question-
Answering
Najoung Kim, Ellie Pavlick, Burcu Karagol Ayan and Deepak Ramachandran
17:00–17:10 Adversarial Learning for Discourse Rhetorical Structure Parsing

Longyin Zhang, Fang Kong and Guodong Zhou
cxvii
17:10–17:20 Exploring Discourse Structures for Argument Impact Classification

Xin Liu, Jiefu Ou, Yangqiu Song and Xin Jiang
23:00–23:10 Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural
Machine Translation
Tong Zhang, Long Zhang, Wei Ye, Bo Li, Jinan Sun, Xiaoyu Zhu, Wen Zhao and
Shikun Zhang
23:10–23:20 VECO: Variable and Flexible Cross-lingual Pre-training for Language Understand-
ing and Generation
Fuli Luo, Wei Wang, Jiahao Liu, Yijia Liu, Bin Bi, Songfang Huang, Fei Huang and
Luo Si
23:20–23:30 A unified approach to sentence segmentation of punctuated text in many languages

Rachel Wicks and Matt Post
23:30–23:40 Towards User-Driven Neural Machine Translation

Huan Lin, Liang Yao, Baosong Yang, Dayiheng Liu, Haibo Zhang, Weihua Luo,
Degen Huang and Jinsong Su
23:40–23:50 End-to-End Lexically Constrained Machine Translation for Morphologically Rich

Languages
Josef Jon, João Paulo Aires, Dusan Varis and Ondřej Bojar
23:50–23:57 Cross-lingual Text Classification with Heterogeneous Graph Neural Network

Ziyun Wang, Xuan Liu, Peiji Yang, Shixing Liu and zhisheng wang
cxviii
23:00–23:10 Handling Extreme Class Imbalance in Technical Logbook Datasets

Farhad Akhbardeh, Cecilia Ovesdotter Alm, Marcos Zampieri and Travis Desell
23:10–23:20 ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction
and Explanation
Vijit Malik, Rishabh Sanjay, Shubham Kumar Nigam, Kripabandhu Ghosh, Shou-
vik Kumar Guha, Arnab Bhattacharya and Ashutosh Modi
23:20–23:30 Supporting Cognitive and Emotional Empathic Writing of Students

Thiemo Wambsganss, Christina Niklaus, Matthias Söllner, Siegfried Handschuh
and Jan Marco Leimeister
23:30–23:40 Context-aware Adversarial Training for Name Regularity Bias in Named Entity
Recognition
Abbas Ghaddar, Philippe Langlais, Ahmad Rashid and Mehdi Rezagholizadeh
23:40–23:50 SummEval: Re-evaluating Summarization Evaluation

Alex Fabbri, Wojciech Kryscinski, Bryan McCann, Caiming Xiong and Richard
Socher
23:50–24:00 Towards Question-Answering as an Automatic Metric for Evaluating the Content

Quality of a Summary
Daniel Deutsch, Tania Bedrax-Weiss and Dan Roth
Session 12C: Question Answering 3
23:00–23:10 Dual Reader-Parser on Hybrid Textual and Tabular Evidence for Open Domain
Question Answering
Alexander Hanbo Li, Patrick Ng, Peng Xu, Henghui Zhu, Zhiguo Wang and Bing
Xiang
23:10–23:20 Generation-Augmented Retrieval for Open-Domain Question Answering

Yuning Mao, Pengcheng He, Xiaodong Liu, Yelong Shen, Jianfeng Gao, Jiawei Han
and Weizhu Chen
23:20–23:30 Check It Again:Progressive Visual Question Answering via Visual Entailment

Qingyi Si, Zheng Lin, Ming yu Zheng, Peng Fu and Weiping Wang
23:30–23:40 A Mutual Information Maximization Approach for the Spurious Solution Problem
in Weakly Supervised Question Answering
Zhihong Shao, Lifeng Shang, Qun Liu and Minlie Huang
cxix
23:40–23:50 Relevance-guided Supervision for OpenQA with ColBERT

Omar Khattab, Christopher Potts and Matei Zaharia
23:50–23:57 Towards more equitable question answering systems: How much more data do you
need?
Arnab Debnath, Navid Rajabi, Fardina Fathmiul Alam and Antonios Anastasopou-
los
Session 12D: Theme 1
23:00–23:10 Breaking Down Walls of Text: How Can NLP Benefit Consumer Privacy?
Abhilasha Ravichander, Alan W Black, Thomas Norton, Shomir Wilson and Nor-
man Sadeh
23:10–23:20 Supporting Land Reuse of Former Open Pit Mining Sites using Text Classification
and Active Learning
Christopher Schröder, Kim Bürgl, Yves Annanias, Andreas Niekler, Lydia Müller,
Daniel Wiegreffe, Christian Bender, Christoph Mengs, Gerik Scheuermann and
Gerhard Heyer
23:20–23:30 Reliability Testing for Natural Language Processing Systems

Samson Tan, Shafiq Joty, Kathy Baxter, Araz Taeihagh, Gregory A. Bennett and
Min-Yen Kan
23:30–23:40 Learning Language and Multimodal Privacy-Preserving Markers of Mood from

Mobile Data
Paul Pu Liang, Terrance Liu, Anna Cai, Michal Muszynski, Ryo Ishii, Nick Allen,
Randy Auerbach, David Brent, Ruslan Salakhutdinov and Louis-Philippe Morency
23:40–23:50 Anonymisation Models for Text Data: State of the art, Challenges and Future Di-
rections
Pierre Lison, Ildikó Pilán, David Sanchez, Montserrat Batet and Lilja Øvrelid
cxx
Wednesday, August 4, 2021 (all times UTC+0)

areas
0:00–2:00 End-to-End AMR Corefencence Resolution

Qiankun Fu, Linfeng Song, Wenyu Du and Yue Zhang
0:00–2:00 How is BERT surprised? Layerwise detection of linguistic anomalies

Bai Li, Zining Zhu, Guillaume Thomas, Yang Xu and Frank Rudzicz
0:00–2:00 Psycholinguistic Tripartite Graph Network for Personality Detection

Tao Yang, Feifan Yang, Haolan Ouyang and Xiaojun Quan
0:00–2:00 Verb Metaphor Detection via Contextual Relation Learning

Wei Song, Shuhui Zhou, Ruiji Fu, Ting Liu and Lizhen Liu
Poster 2D: Speech and Multimodality
0:00–2:00 Improving Speech Translation by Understanding and Learning from the Auxiliary
Text Translation Task
Yun Tang, Juan Pino, Xian Li, Changhan Wang and Dmitriy Genzel
cxxi
Wednesday, August 4, 2021 (all times UTC+0) (continued)
Poster 2E: Ethics in NLP
0:00–2:00 Probing Toxic Content in Large Pre-Trained Language Models

Nedjma Ousidhoum, Xinran Zhao, Tianqing Fang, Yangqiu Song and Dit-Yan Ye-
ung
0:00–2:00 Societal Biases in Language Generation: Progress and Challenges

Emily Sheng, Kai-Wei Chang, Prem Natarajan and Nanyun Peng
Poster 2F: Interpretability and Analysis of Models for NLP
0:00–2:00 Reservoir Transformers

Sheng Shen, Alexei Baevski, Ari Morcos, Kurt Keutzer, Michael Auli and Douwe
Kiela
Poster 2G: Machine Learning for NLP
0:00–2:00 Subsequence Based Deep Active Learning for Named Entity Recognition
Puria Radmard, Yassir Fathullah and Aldo Lipani
0:00–2:00 Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained

Language Models
Tyler Chang, Yifan Xu, Weijian Xu and Zhuowen Tu
0:00–2:00 BinaryBERT: Pushing the Limit of BERT Quantization

Haoli Bai, Wei Zhang, Lu Hou, Lifeng Shang, Jin JIN, Xin Jiang, Qun Liu, Michael
Lyu and Irwin King
0:00–2:00 Embedding Time Differences in Context-sensitive Neural Networks for Learning

Time to Event
Nazanin Dehghani, Hassan Hajipoor and Hadi Amiri
0:00–2:00 Are Pretrained Convolutions Better than Pretrained Transformers?

Yi Tay, Mostafa Dehghani, Jai Prakash Gupta, Vamsi Aribandi, Dara Bahri, Zhen
Qin and Donald Metzler
0:00–2:00 PairRE: Knowledge Graph Embeddings via Paired Relation Vectors

Linlin Chao, Jianshan He, Taifeng Wang and Wei Chu
cxxii
0:00–2:00 Improving Compositional Generalization in Classification Tasks via Structure An-

notations
Juyong Kim, Pradeep Ravikumar, Joshua Ainslie and Santiago Ontanon
0:00–2:00 Learning to Generate Task-Specific Adapters from Task Description

Qinyuan Ye and Xiang Ren
0:00–2:00 Hierarchy-aware Label Semantics Matching Network for Hierarchical Text Classi-
fication
Haibin Chen, Qianli Ma, Zhenxi Lin and Jiangyue Yan
0:00–2:00 HiddenCut: Simple Data Augmentation for Natural Language Understanding with
Better Generalizability
Jiaao Chen, Dinghan Shen, Weizhu Chen and Diyi Yang
0:00–2:00 Efficient Content-Based Sparse Attention with Routing Transformers

Aurko Roy, Mohammad Saffar, Ashish Vaswani and David Grangier
Poster 2H: Dialog and Interactive Systems
0:00–2:00 Neural Stylistic Response Generation with Disentangled Latent Variables

Qingfu Zhu, Wei-Nan Zhang, Ting Liu and William Yang Wang
0:00–2:00 Intent Classification and Slot Filling for Privacy Policies

Wasi Ahmad, Jianfeng Chi, Tu Le, Thomas Norton, Yuan Tian and Kai-Wei Chang
0:00–2:00 RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-
oriented Dialog Systems
Baolin Peng, Chunyuan Li, Zhu Zhang, Chenguang Zhu, Jinchao Li and Jianfeng
Gao
0:00–2:00 QA-Driven Zero-shot Slot Filling with Weak Supervision Pretraining

Xinya Du, Luheng He, Qi Li, Dian Yu, Panupong Pasupat and Yuan Zhang
0:00–2:00 Domain-Adaptive Pretraining Methods for Dialogue Understanding

Han Wu, Kun Xu, Linfeng Song, Lifeng Jin, Haisong Zhang and Linqi Song
0:00–2:00 Semantic Representation for Dialogue Modeling

Xuefeng Bai, Yulong Chen, Linfeng Song and Yue Zhang
cxxiii
0:00–2:00 A Pre-training Strategy for Zero-Resource Response Selection in Knowledge-

Grounded Conversations
Chongyang Tao, Changyu Chen, Jiazhan Feng, Ji-Rong Wen and Rui Yan
0:00–2:00 SOLOIST: Building Task Bots at Scale with Transfer Learning and Machine Teach-
ing
Baolin Peng, Chunyuan Li, Jinchao Li, Shahin Shayandeh, Lars Liden and Jianfeng
Gao
Poster 2I: Information Retrieval and Text Mining
0:00–2:00 Dependency-driven Relation Extraction with Attentive Graph Convolutional Net-

works
Yuanhe Tian, Guimin Chen, Yan Song and Xiang Wan
0:00–2:00 Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based

NLP
Anthony Chen, Pallavi Gudipati, Shayne Longpre, Xiao Ling and Sameer Singh
Poster 2J: Resources and Evaluation
0:00–2:00 Targeting the Benchmark: On Methodology in Current Natural Language Process-

ing Research
David Schlangen
0:00–2:00 Evaluation Examples are not Equally Informative: How should that change NLP
Leaderboards?
Pedro Rodriguez, Joe Barrow, Alexander Miserlis Hoyle, John P. Lalor, Robin Jia
and Jordan Boyd-Graber
cxxiv
Poster 2K: Computational Social Science and Cultural Analytics
0:00–2:00 Claim Matching Beyond English to Scale Global Fact-Checking

Ashkan Kazemi, Kiran Garimella, Devin Gaffney and Scott Hale
0:00–2:00 X-Fact: A New Benchmark Dataset for Multilingual Fact Checking

Ashim Gupta and Vivek Srikumar
Poster 2L: Machine Translation and Multilinguality
0:00–2:00 SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural
Machine Translation
Shuo Ren, Long Zhou, Shujie Liu, Furu Wei, Ming Zhou and Shuai Ma
0:00–2:00 Energy-Based Reranking: Improving Neural Machine Translation Using Energy-

Based Models
Sumanta Bhattacharyya, Amirmohammad Rooshenas, Subhajit Naskar, Simeng
Sun, Mohit Iyyer and Andrew McCallum
0:00–2:00 nmT5 - Is parallel data still relevant for pre-training massively multilingual lan-
guage models?
Mihir Kale, Aditya Siddhant, Rami Al-Rfou, Linting Xue, Noah Constant and
Melvin Johnson
0:00–2:00 Syntax-augmented Multilingual BERT for Cross-lingual Transfer

Wasi Ahmad, Haoran Li, Kai-Wei Chang and Yashar Mehdad
0:00–2:00 How to Adapt Your Pretrained Multilingual Model to 1600 Languages

Abteen Ebrahimi and Katharina Kann
0:00–2:00 Synthesizing Parallel Data of User-Generated Texts with Zero-Shot Neural Machine
Translation
Benjamin Marie and Atsushi Fujita
cxxv
Poster 2M: Syntax: Tagging, Chunking, and Parsing
0:00–2:00 Weakly Supervised Named Entity Tagging with Learnable Logical Rules
Jiacheng Li, Haibo Ding, Jingbo Shang, Julian McAuley and Zhe Feng
Poster 2N: NLP Applications
0:00–2:00 Question Generation for Adaptive Education

Megha Srivastava and Noah Goodman
Poster 2O: Language Generation
0:00–2:00 Prefix-Tuning: Optimizing Continuous Prompts for Generation

Xiang Lisa Li and Percy Liang
0:00–2:00 One2Set: Generating Diverse Keyphrases as a Set

Jiacheng Ye, Tao Gui, Yichao Luo, Yige Xu and Qi Zhang
0:00–2:00 A Simple Recipe for Multilingual Grammatical Error Correction

Sascha Rothe, Jonathan Mallinson, Eric Malmi, Sebastian Krause and Aliaksei Sev-
eryn
0:00–2:00 Continuous Language Generative Flow

Zineng Tang, Shiyue Zhang, Hyounghun Kim and Mohit Bansal
0:00–2:00 RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-
SQL in Cross-Domain Databases
DongHyun Choi, Myeong Cheol Shin, EungGyun Kim and Dong Ryeol Shin
cxxvi
Poster 2P: Summarization
0:00–2:00 TWAG: A Topic-Guided Wikipedia Abstract Generator

Fangwei Zhu, Shangqing Tu, Jiaxin Shi, Juanzi Li, Lei Hou and Tong Cui
Poster 2Q: Question Answering
0:00–2:00 Towards Visual Question Answering on Pathology Images

Xuehai He, Zhuo Cai, Wenlan Wei, Yichen Zhang, Luntian Mou, Eric Xing and
Pengtao Xie
0:00–2:00 ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal
Text Data
Woojeong Jin, Rahul Khanna, Suji Kim, Dong-Ho Lee, Fred Morstatter, Aram Gal-
styan and Xiang Ren
0:00–2:00 Recursive Tree-Structured Self-Attention for Answer Sentence Selection

Khalil Mrini, Emilia Farcas and Ndapa Nakashole
Poster 2R: Language Grounding to Vision, Robotics and Beyond
0:00–2:00 Efficient Text-based Reinforcement Learning by Jointly Leveraging State and Com-
monsense Graph Representations
Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi, Kartik Talamadupula,
Mrinmaya Sachan and Murray Campbell
0:00–2:00 mTVR: Multilingual Moment Retrieval in Videos

Jie Lei, Tamara Berg and Mohit Bansal
cxxvii
Poster 2S: Information Extraction
0:00–2:00 How Knowledge Graph and Attention Help? A Qualitative Analysis into Bag-level
Relation Extraction
Zikun Hu, Yixin Cao, Lifu Huang and Tat-Seng Chua
0:00–2:00 Trigger is Not Sufficient: Exploiting Frame-aware Knowledge for Implicit Event
Argument Extraction
Kaiwen Wei, Xian Sun, Zequn Zhang, Jingyuan Zhang, Guo Zhi and li jin
0:00–2:00 Element Intervention for Open Relation Extraction

Fangchao Liu, Lingyong Yan, Hongyu Lin, Xianpei Han and Le Sun
0:00–2:00 Explicitly Capturing Relations between Entity Mentions via Graph Neural Networks
for Domain-specific Named Entity Recognition
Pei Chen, Haibo Ding, Jun Araki and Ruihong Huang
0:00–2:00 AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive De-
coding
Jun Yan, Nasser Zalmout, Yan Liang, Christan Grant, Xiang Ren and Xin Luna
Dong
0:00–2:00 CoRI: Collective Relation Integration with Data Augmentation for Open Informa-
tion Extraction
Zhengbao Jiang, Jialong Han, BUNYAMIN SISMAN and Xin Luna Dong
0:00–2:00 Benchmarking Scalable Methods for Streaming Cross Document Entity Coreference
Robert L Logan IV, Andrew McCallum, Sameer Singh and Dan Bikel
0:00–2:00 Search from History and Reason for Future: Two-stage Reasoning on Temporal
Knowledge Graphs
Zixuan Li, Xiaolong Jin, Saiping Guan, Wei Li, Jiafeng Guo, Yuanzhuo Wang and
Xueqi Cheng
cxxviii
Poster 2T: Sentiment Analysis, Stylistic Analysis, and Argument Mining
0:00–2:00 Employing Argumentation Knowledge Graphs for Neural Argument Generation

Khalid Al Khatib, Lukas Trautner, Henning Wachsmuth, Yufang Hou and Benno
Stein
0:00–2:00 Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction

Lu Xu, Yew Ken Chia and Lidong Bing
08:00–08:10 On Compositional Generalization of Neural Machine Translation

Yafu Li, Yongjing Yin, Yulong Chen and Yue Zhang
08:10–08:20 Mask-Align: Self-Supervised Neural Word Alignment

Chi Chen, Maosong Sun and Yang Liu
08:20–08:30 GWLAN: General Word-Level AutocompletioN for Computer-Aided Translation

Huayang Li, Lemao Liu, Guoping Huang and Shuming Shi
08:30–08:37 Improving Lexically Constrained Neural Machine Translation with Source-

Conditioned Masked Span Prediction
Gyubok Lee, Seongjun Yang and Edward Choi
cxxix
Session 13B: Information Extraction 6
08:00–08:10 De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention
Wenkai Zhang, Hongyu Lin, Xianpei Han and Le Sun
08:10–08:20 A Span-Based Model for Joint Overlapped and Discontinuous Named Entity Recog-
nition
Fei Li, ZhiChao Lin, Meishan Zhang and Donghong Ji
08:20–08:30 MLBiNet: A Cross-Sentence Collective Event Detection Network

Dongfang Lou, Zhilin Liao, Shumin Deng, Ningyu Zhang and Huajun Chen
08:30–08:40 Exploiting Document Structures and Cluster Consistencies for Event Coreference
Resolution
Hieu Minh Tran, Duy Phung and Thien Huu Nguyen
08:40–08:50 StereoRel: Relational Triple Extraction from a Stereoscopic Perspective

Xuetao Tian, Liping Jing, Lu He and Feng Liu
08:50–09:00 Knowledge-Enriched Event Causality Identification via Latent Structure Induction

Networks
Pengfei Cao, Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao, Yuguang Chen and
Weihua Peng
08:00–08:10 Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substi-
tution
Fanchao Qi, Yuan Yao, Sophia Xu, Zhiyuan Liu and Maosong Sun
08:10–08:20 Parameter-Efficient Transfer Learning with Diff Pruning

Demi Guo, Alexander Rush and Yoon Kim
08:20–08:30 R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hier-
archical Language Modeling
Xiang Hu, Haitao Mi, Zujie Wen, Yafang Wang, Yi Su, Jing Zheng and Gerard de
Melo
08:30–08:40 Risk Minimization for Zero-shot Sequence Labeling

Zechuan Hu, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang
and Kewei Tu
cxxx
08:40–08:50 WARP: Word-level Adversarial ReProgramming

Karen Hambardzumyan, Hrant Khachatrian and Jonathan May
08:50–09:00 Lexicon Learning for Few Shot Sequence Modeling

Ekin Akyurek and Jacob Andreas
08:00–08:10 Personalized Transformer for Explainable Recommendation

Lei Li, Yongfeng Zhang and Li Chen
08:10–08:20 Generating SOAP Notes from Doctor-Patient Conversations Using Modular Sum-
marization Techniques
Kundan Krishna, Sopan Khosla, Jeffrey Bigham and Zachary C. Lipton
08:20–08:30 Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Er-

ror Correction
Piji Li and Shuming Shi
08:30–08:40 Early Detection of Sexual Predators in Chats

Matthias Vogt, Ulf Leser and Alan Akbik
08:40–08:50 Writing by Memorizing: Hierarchical Retrieval-based Medical Report Generation

Xingyi Yang, Muchao Ye, Quanzeng You and Fenglong Ma
08:50–08:57 Quotation Recommendation and Interpretation Based on Transformation from

Queries to Quotations
Lingzhi Wang, Xingshan Zeng and Kam-Fai Wong
cxxxi
Session 13E: Information Retrieval and Text Mining 2
08:00–08:10 Concept-Based Label Embedding via Dynamic Routing for Hierarchical Text Clas-
sification
Xuepeng Wang, Li Zhao, Bing Liu, Tao Chen, Feng Zhang and Di Wang
08:10–08:20 VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image

Search with Weighted Bag-of-words
Xiaopeng Lu, Tiancheng Zhao and Kyusong Lee
08:20–08:30 Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision
Si Sun, Yingzhuo Qian, Zhenghao Liu, Chenyan Xiong, Kaitao Zhang, Jie Bao,
Zhiyuan Liu and Paul Bennett
08:30–08:40 Semi-Supervised Text Classification with Balanced Deep Representation Distribu-

tions
Changchun Li, Ximing Li and Jihong Ouyang
08:40–08:50 Improving Document Representations by Generating Pseudo Query Embeddings for

Dense Retrieval
Hongyin Tang, Xingwu Sun, Beihong Jin, Jingang Wang, Fuzheng Zhang and Wei
Wu
08:50–08:57 Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic

Coherence
Federico Bianchi, Silvia Terragni and Dirk Hovy

areas
9:00–11:00 ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation

Transfer
Yuanmeng Yan, Rumei Li, Sirui Wang, Fuzheng Zhang, Wei Wu and Weiran Xu
9:00–11:00 Exploring Dynamic Selection of Branch Expansion Orders for Code Generation
Hui Jiang, Chulun Zhou, Fandong Meng, Biao Zhang, Jie Zhou, Degen Huang,
Qingqiang Wu and Jinsong Su
9:00–11:00 COINS: Dynamically Generating COntextualized Inference Rules for Narrative

Story Completion
Debjit Paul and Anette Frank
9:00–11:00 Reasoning over Entity-Action-Location Graph for Procedural Text Understanding

Hao Huang, Xiubo Geng, Jian Pei, Guodong Long and Daxin Jiang
cxxxii
9:00–11:00 From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Syn-
chronous Semantic Decoding
Shan Wu, Bo Chen, Chunlei Xin, Xianpei Han, Le Sun, Weipeng Zhang, Jiansong
Chen, Fan Yang and Xunliang Cai
9:00–11:00 Pre-training Universal Language Representation

Yian Li and Hai Zhao
9:00–11:00 Structural Pre-training for Dialogue Comprehension

Zhuosheng Zhang and Hai Zhao
9:00–11:00 AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained

Language Models
Yichun Yin, Cheng Chen, Lifeng Shang, Xin Jiang, Xiao Chen and Qun Liu
9:00–11:00 Data Augmentation with Adversarial Training for Cross-Lingual NLI

Xin Dong, Yaxin Zhu, Zuohui Fu, Dongkuan Xu and Gerard de Melo
9:00–11:00 Input Representations for Parsing Discourse Representation Structures: Comparing

English with Chinese
Chunliu Wang, Rik van Noord, Arianna Bisazza and Johan Bos
9:00–11:00 Code Generation from Natural Language with Less Prior Knowledge and More
Monolingual Data
Sajad Norouzi, Keyi Tang and Yanshuai Cao
9:00–11:00 Bootstrapped Unsupervised Sentence Representation Learning

Yan Zhang, Ruidan He, ZUOZHU LIU, Lidong Bing and Haizhou Li
9:00–11:00 Learning Event Graph Knowledge for Abductive Reasoning

Li Du, Xiao Ding, Ting Liu and Bing Qin
9:00–11:00 Issues with Entailment-based Zero-shot Text Classification

Tingting Ma, Jin-Ge Yao, Chin-Yew Lin and Tiejun Zhao
9:00–11:00 Neural-Symbolic Commonsense Reasoner with Relation Predictors

Farhad Moghimifar, Lizhen Qu, Terry Yue Zhuo, Gholamreza Haffari and Mahsa
Baktashmotlagh
cxxxiii
9:00–11:00 A Cognitive Regularizer for Language Modeling

Jason Wei, Clara Meister and Ryan Cotterell
9:00–11:00 What Motivates You? Benchmarking Automatic Detection of Basic Needs from
Short Posts
Sanja Stajner, Seren Yenikent, Bilal Ghanem and Marc Franco-Salvador
9:00–11:00 Lower Perplexity is Not Always Human-Like

Tatsuki Kuribayashi, Yohei Oseki, Takumi Ito, Ryo Yoshida, Masayuki Asahara and
Kentaro Inui
9:00–11:00 Word Sense Disambiguation: Towards Interactive Context Exploitation from Both
Word and Sense Perspectives
Ming Wang and Yinglin Wang
9:00–11:00 A Knowledge-Guided Framework for Frame Identification

Xuefeng Su, Ru Li, Xiaoli Li, Jeff Z. Pan, Hu Zhang, Qinghua Chai and Xiaoqi Han
9:00–11:00 Obtaining Better Static Word Embeddings Using Contextual Embedding Models
Prakhar Gupta and Martin Jaggi
9:00–11:00 Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation
Yingjun Du, Nithin Holla, Xiantong Zhen, Cees Snoek and Ekaterina Shutova
9:00–11:00 LexFit: Lexical Fine-Tuning of Pretrained Language Models

Ivan Vulić, Edoardo Maria Ponti, Anna Korhonen and Goran Glavaš
9:00–11:00 Semantic Frame Induction using Masked Word Embeddings and Two-Step Cluster-
ing
Kosuke Yamada, Ryohei Sasano and Koichi Takeda
9:00–11:00 Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical

Semantic Similarity
Ivan Vulic, Simon Baker, Edoardo Maria Ponti, Ulla Petti, Ira Leviant, Kelly Wing,
Olga Majewska, Eden Bar, Matt Malone, Thierry Poibeau, Roi Reichart and Anna
Korhonen
cxxxiv
Poster 3D: Speech and Multimodality
9:00–11:00 Text-Free Image-to-Speech Synthesis Using Learned Segmental Units

Wei-Ning Hsu, David Harwath, Tyler Miller, Christopher Song and James Glass
9:00–11:00 CTFN: Hierarchical Learning for Multimodal Sentiment Analysis Using Coupled-
Translation Fusion Network
Jiajia Tang, Kang Li, Xuanyu Jin, Andrzej Cichocki, Qibin Zhao and Wanzeng
Kong
9:00–11:00 Lightweight Adapter Tuning for Multilingual Speech Translation

Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab and Laurent Be-
sacier
Poster 3E: Interpretability and Analysis of Models for NLP
9:00–11:00 Parameter Selection: Why We Should Pay More Attention to It

Jie-Jyun Liu, Tsung-Han Yang, Si-An Chen and Chih-Jen Lin
9:00–11:00 Positional Artefacts Propagate Through Masked Language Model Embeddings

Ziyang Luo, Artur Kulmizev and Xiaoxi Mao
9:00–11:00 Language Model Evaluation Beyond Perplexity

Clara Meister and Ryan Cotterell
9:00–11:00 Learning to Explain: Generating Stable Explanations Fast

Xuelin Situ, Ingrid Zukerman, Cecile Paris, Sameen Maruf and Gholamreza Haffari
9:00–11:00 StereoSet: Measuring stereotypical bias in pretrained language models

Moin Nadeem, Anna Bethke and Siva Reddy
9:00–11:00 Alignment Rationale for Natural Language Inference

Zhongtao Jiang, Yuanzhe Zhang, Zhao Yang, Jun Zhao and Kang Liu
9:00–11:00 Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression

based on Matrix Product Operators
Peiyu Liu, Ze-Feng Gao, Wayne Xin Zhao, Zhi-Yuan Xie, Zhong-Yi Lu and Ji-Rong
Wen
cxxxv
9:00–11:00 On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Se-
mantic Evaluation
Wei Zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui and Fan Zhang
9:00–11:00 CausaLM: Causal Model Explanation Through Counterfactual Language Models

Amir Feder, Nadav Oved, Uri Shalit and Roi Reichart
9:00–11:00 Amnesic Probing: Behavioral Explanation With Amnesic Counterfactuals

Yanai Elazar, Shauli Ravfogel, Alon Jacovi and Yoav Goldberg
Poster 3F: Information Retrieval and Text Mining
9:00–11:00 Syntax-Enhanced Pre-trained Model

Zenan Xu, Daya Guo, Duyu Tang, Qinliang Su, Linjun Shou, Ming Gong, Wanjun
Zhong, Xiaojun Quan, Daxin Jiang and Nan Duan
9:00–11:00 Matching Distributions between Model and Data: Cross-domain Knowledge Distil-
lation for Unsupervised Domain Adaptation
Bo Zhang, Xiaoming Zhang, Yun Liu, Lei Cheng and Zhoujun Li
9:00–11:00 Counterfactual Inference for Text Classification Debiasing

Chen Qian, Fuli Feng, Lijie Wen, Chunping Ma and Pengjun Xie
9:00–11:00 HieRec: Hierarchical User Interest Modeling for Personalized News Recommenda-
tion
Tao Qi, Fangzhao Wu, Chuhan Wu, Peiru Yang, Yang Yu, Xing Xie and Yongfeng
Huang
9:00–11:00 Distinct Label Representations for Few-Shot Text Classification

Sora Ohashi, Junya Takayama, Tomoyuki Kajiwara and Yuki Arase
9:00–11:00 PP-Rec: News Recommendation with Personalized User Interest and Time-aware
News Popularity
Tao Qi, Fangzhao Wu, Chuhan Wu and Yongfeng Huang
9:00–11:00 Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Pre-
viously Fact-Checked Claims
Qiang Sheng, Juan Cao, Xueyao Zhang, Xirong Li and Lei Zhong
9:00–11:00 Learning to Solve NLP Tasks in an Incremental Number of Languages

Giuseppe Castellucci, Simone Filice, Danilo Croce and Roberto Basili
cxxxvi
Poster 3G: Machine Learning for NLP
9:00–11:00 Defense against Synonym Substitution-based Adversarial Attacks via Dirichlet

Neighborhood Ensemble
Yi Zhou, Xiaoqing Zheng, Cho-Jui Hsieh, Kai-Wei Chang and Xuanjing Huang
9:00–11:00 Shortformer: Better Language Modeling using Shorter Inputs

Ofir Press, Noah A. Smith and Mike Lewis
9:00–11:00 BanditMTL: Bandit-based Multi-task Learning for Text Classification

Yuren Mao, Zekai Wang, Weiwei Liu, Xuemin Lin and Wenbin Hu
9:00–11:00 Unified Interpretation of Softmax Cross-Entropy and Negative Sampling: With Case
Study for Knowledge Graph Embedding
Hidetaka Kamigaito and Katsuhiko Hayashi
9:00–11:00 Hi-Transformer: Hierarchical Interactive Transformer for Efficient and Effective

Long Document Modeling
Chuhan Wu, Fangzhao Wu, Tao Qi and Yongfeng Huang
9:00–11:00 De-Confounded Variational Encoder-Decoder for Logical Table-to-Text Generation

Wenqing Chen, Jidong Tian, Yitian Li, Hao He and Yaohui Jin
9:00–11:00 Rethinking Stealthiness of Backdoor Attack against NLP Models

Wenkai Yang, Yankai Lin, Peng Li, Jie Zhou and Xu Sun
9:00–11:00 Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity

Recognition
Xin Zhang, Guangwei Xu, Yueheng Sun, Meishan Zhang and Pengjun Xie
9:00–11:00 Robust Transfer Learning with Pretrained Language Models through Adapters
Wenjuan Han, Bo Pang and Ying Nian Wu
9:00–11:00 Embracing Ambiguity: Shifting the Training Target of NLI Models

Johannes Mario Meissner, Napat Thumwanit, Saku Sugawara and Akiko Aizawa
9:00–11:00 Exploring Distantly-Labeled Rationales in Neural Network Models

Quzhe Huang, Shengqi Zhu, Yansong Feng and Dongyan Zhao
cxxxvii
9:00–11:00 Learning to Perturb Word Embeddings for Out-of-distribution QA

Seanie Lee, Minki Kang, Juho Lee and Sung Ju Hwang
Poster 3H: Dialog and Interactive Systems
9:00–11:00 Maria: A Visual Experience Powered Conversational Agent

Zujie Liang, Huang Hu, Can Xu, Chongyang Tao, Xiubo Geng, yining Chen, Fan
Liang and Daxin Jiang
9:00–11:00 A Human-machine Collaborative Framework for Evaluating Malevolence in Dia-

logues
Yangjun Zhang, Pengjie Ren and Maarten de Rijke
9:00–11:00 Generating Relevant and Coherent Dialogue Responses using Self-Separated Con-
ditional Variational AutoEncoders
Bin Sun, Shaoxiong Feng, Yiwei Li, Jiamou Liu and Kan Li
9:00–11:00 Modeling Discriminative Representations for Out-of-Domain Detection with Super-

vised Contrastive Learning
Zhiyuan Zeng, Keqing He, Yuanmeng Yan, Zijun Liu, Yanan Wu, Hong Xu, Huix-
ing Jiang and Weiran Xu
9:00–11:00 Learning to Ask Conversational Questions by Optimizing Levenshtein Distance

Zhongkun Liu, Pengjie Ren, Zhumin CHEN, Zhaochun Ren, Maarten de Rijke and
Ming Zhou
9:00–11:00 DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
Hung Le, Chinnadhurai Sankar, Seungwhan Moon, Ahmad Beirami, Alborz
Geramifard and Satwik Kottur
9:00–11:00 Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-
Domain Dialogue State Tracking
Yinpei Dai, Hangyu Li, Yongbin Li, Jian Sun, Fei Huang, Luo Si and Xiaodan Zhu
9:00–11:00 On the Generation of Medical Dialogs for COVID-19

Meng Zhou, Zechen Li, Bowen Tan, Guangtao Zeng, Wenmian Yang, Xuehai He,
Zeqian Ju, Subrato Chakravorty, Shu Chen, Xingyi Yang, Yichen Zhang, Qingyang
Wu, Zhou Yu, Kun Xu, Eric Xing and Pengtao Xie
9:00–11:00 Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically

Relevant Images
Nyoungwoo Lee, Suwon Shin, Jaegul Choo, Ho-Jin Choi and Sung-Hyon Myaeng
9:00–11:00 MMGCN: Multimodal Fusion via Deep Graph Convolution Network for Emotion
Recognition in Conversation
Jingwen Hu, Yuchen Liu, Jinming Zhao and Qin Jin
cxxxviii
9:00–11:00 DynaEval: Unifying Turn and Dialogue Level Evaluation

Chen Zhang, Yiming Chen, Luis Fernando D’Haro, Yan Zhang, Thomas Friedrichs,
Grandee Lee and Haizhou Li
9:00–11:00 Unsupervised Learning of KB Queries in Task-Oriented Dialogs

Dinesh Raghu, Nikhil Gupta and Mausam
Poster 3I: Ethics in NLP
9:00–11:00 Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection

Debora Nozza
Poster 3J: Resources and Evaluation
9:00–11:00 CoSQA: 20,000+ Web Queries for Code Search and Question Answering
Junjie Huang, Duyu Tang, Linjun Shou, Ming Gong, Ke Xu, Daxin Jiang, Ming
Zhou and Nan Duan
9:00–11:00 QED: A Framework and Dataset for Explanations in Question Answering

Matthew Lamm, Jennimaria Palomaki, Chris Alberti, Daniel Andor, Eunsol Choi,
Livio Baldini Soares and Michael Collins
Poster 3K: Machine Translation and Multilinguality
9:00–11:00 Rewriter-Evaluator Architecture for Neural Machine Translation

Yangming Li and Kaisheng Yao
9:00–11:00 BERTTune: Fine-Tuning Neural Machine Translation with BERTScore

Inigo Jauregi Unanue, Jacob Parnell and Massimo Piccardi
9:00–11:00 Modeling Bilingual Conversational Characteristics for Neural Chat Translation

Yunlong Liang, Fandong Meng, Yufeng Chen, Jinan Xu and Jie Zhou
9:00–11:00 Importance-based Neuron Allocation for Multilingual Neural Machine Translation

Wanying Xie, Yang Feng, Shuhao Gu and Dong Yu
cxxxix
9:00–11:00 Transfer Learning for Sequence Generation: from Single-source to Multi-source

Xuancheng Huang, jingfang xu, Maosong Sun and Yang Liu
9:00–11:00 A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters
Mengjie Zhao, Yi Zhu, Ehsan Shareghi, Ivan Vulić, Roi Reichart, Anna Korhonen
and Hinrich Schütze
Poster 3L: Discourse and Pragmatics
9:00–11:00 Coreference Reasoning in Machine Reading Comprehension

Mingzhu Wu, Nafise Sadat Moosavi, Dan Roth and Iryna Gurevych
9:00–11:00 Entity Enhancement for Implicit Discourse Relation Classification in the Biomedi-
cal Domain
Wei Shi and Vera Demberg
9:00–11:00 Adapting Unsupervised Syntactic Parsing Methodology for Discourse Dependency

Parsing
Liwen Zhang, Ge Wang, Wenjuan Han and Kewei Tu
9:00–11:00 Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction

Ming Shen, Pratyay Banerjee and Chitta Baral
Poster 3M: Syntax: Tagging, Chunking, and Parsing
9:00–11:00 A Conditional Splitting Framework for Efficient Constituency Parsing

Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty and Xiaoli Li
9:00–11:00 A Unified Generative Framework for Various NER Subtasks

Hang Yan, Tao Gui, Junqi Dai, Qipeng Guo, Zheng Zhang and Xipeng Qiu
9:00–11:00 An In-depth Study on Internal Structure of Chinese Words

Chen Gong, Saihao Huang, Houquan Zhou, Zhenghua Li, Min Zhang, Zhefeng
Wang, baoxing Huai and Nicholas Jing Yuan
9:00–11:00 MulDA: A Multilingual Data Augmentation Framework for Low-Resource Cross-

Lingual NER
Linlin Liu, BOSHENG DING, Lidong Bing, Shafiq Joty, Luo Si and Chunyan Miao
cxl
9:00–11:00 Lexicon Enhanced Chinese Sequence Labeling Using BERT Adapter

Wei Liu, Xiyan Fu, Yue Zhang and Wenming Xiao
Poster 3N: NLP Applications
9:00–11:00 Math Word Problem Solving with Explicit Numerical Values

Qinzhuo Wu, Qi Zhang, Zhongyu Wei and Xuanjing Huang
9:00–11:00 Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks
Jinghui Qin, Xiaodan Liang, Yining Hong, Jianheng Tang and Liang Lin
9:00–11:00 SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured

Semantics for Medical Text Mining
Taolin Zhang, Zerui Cai, Chengyu Wang, Minghui Qiu, Bite Yang and XIAOFENG
HE
9:00–11:00 What is Your Article Based On? Inferring Fine-grained Provenance

Yi Zhang, Zachary Ives and Dan Roth
9:00–11:00 Cross-modal Memory Networks for Radiology Report Generation

Zhihong Chen, Yaling Shen, Yan Song and Xiang Wan
9:00–11:00 Controversy and Conformity: from Generalized to Personalized Aggressiveness De-

tection
Kamil Kanclerz, Alicja Figas, Marcin Gruza, Tomasz Kajdanowicz, Jan Kocon,
Daria Puchalska and Przemyslaw Kazienko
9:00–11:00 Multi-perspective Coherent Reasoning for Helpfulness Prediction of Multimodal

Reviews
Junhao Liu, Zhen Hai, Min Yang and Lidong Bing
9:00–11:00 Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding

Xin Sun, Tao Ge, Furu Wei and Houfeng Wang
9:00–11:00 Automatic ICD Coding via Interactive Shared Representation Networks with Self-
distillation Mechanism
Tong Zhou, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Kun Niu, Weifeng
Chong and Shengping Liu
9:00–11:00 PHMOSpell: Phonological and Morphological Knowledge Guided Chinese

Spelling Check
Li Huang, Junjie Li, Weiwei Jiang, Zhiyu Zhang, Minchuan Chen, Shaojun Wang
and Jing Xiao
cxli
Poster 3O: Language Generation
9:00–11:00 Guiding the Growth: Difficulty-Controllable Question Generation through Step-by-

Step Rewriting
Yi Cheng, Siyao Li, Bang Liu, Ruihui Zhao, Sujian Li, Chenghua Lin and Yefeng
Zheng
9:00–11:00 Improving Encoder by Auxiliary Supervision Tasks for Table-to-Text Generation

Liang Li, Can Ma, Yinliang Yue and Dayong Hu
9:00–11:00 POS-Constrained Parallel Decoding for Non-autoregressive Generation

Kexin Yang, Wenqiang Lei, Dayiheng Liu, Weizhen Qi and Jiancheng Lv
9:00–11:00 Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Gen-
eration
Xin Liu, Baosong Yang, Dayiheng Liu, Haibo Zhang, Weihua Luo, Min Zhang,
Haiying Zhang and Jinsong Su
9:00–11:00 TGEA: An Error-Annotated Dataset and Benchmark Tasks for TextGeneration from
Pretrained Language Models
Jie He, Bo Peng, Yi Liao, Qun Liu and Deyi Xiong
9:00–11:00 Addressing Semantic Drift in Generative Question Answering with Auxiliary Ex-
traction
Chenliang Li, Bin Bi, Ming Yan, Wei Wang and Songfang Huang
Poster 3P: Summarization
9:00–11:00 Long-Span Summarization via Local Attention and Content Selection

Potsawee Manakul and Mark Gales
9:00–11:00 RepSum: Unsupervised Dialogue Summarization based on Replacement Strategy

Xiyan Fu, Yating Zhang, Tianyi Wang, Xiaozhong Liu, Changlong Sun and Zhenglu
Yang
9:00–11:00 BASS: Boosting Abstractive Summarization with Unified Semantic Graph

Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Ziqiang Cao, Sujian Li, Hua Wu
and Haifeng Wang
9:00–11:00 Capturing Relations between Scientific Papers: An Abstractive Model for Related
Work Section Generation
Xiuying Chen, Hind Alamro, Mingzhe Li, Shen Gao, Xiangliang Zhang, Dongyan
Zhao and Rui Yan
cxlii
9:00–11:00 Focus Attention: Promoting Faithfulness and Diversity in Summarization

Rahul Aralikatte, Shashi Narayan, Joshua Maynez, Sascha Rothe and Ryan Mc-
Donald
9:00–11:00 Generating Query Focused Summaries from Query-Free Resources

Yumo Xu and Mirella Lapata
9:00–11:00 Demoting the Lead Bias in News Summarization via Alternating Adversarial Learn-
ing
Linzi Xing, Wen Xiao and Giuseppe Carenini
Poster 3Q: Question Answering
9:00–11:00 DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Gener-

alization of Machine Reading Comprehension in Real-World Applications
Hongxuan Tang, Hongyu Li, Jing Liu, Yu Hong, Hua Wu and Haifeng Wang
9:00–11:00 Sequence to General Tree: Knowledge-Guided Geometry Word Problem Solving

Shih-hung Tsai, Chao-Chun Liang, Hsin-Min Wang and Keh-Yih Su
9:00–11:00 Robustifying Multi-hop QA through Pseudo-Evidentiality Training

Kyungjae Lee, Seung-won Hwang, Sang-eun Han and Dohyeon Lee
9:00–11:00 Multi-Scale Progressive Attention Network for Video Question Answering

Zhicheng Guo, Jiaxuan Zhao, Licheng Jiao, Xu Liu and Lingling Li
9:00–11:00 Efficient Passage Retrieval with Hashing for Open-domain Question Answering
Ikuya Yamada, Akari Asai and Hannaneh Hajishirzi
9:00–11:00 xMoCo: Cross Momentum Contrastive Learning for Open-Domain Question An-
swering
Nan Yang, Furu Wei, Binxing Jiao, Daxing Jiang and Linjun Yang
9:00–11:00 Learn to Resolve Conversational Dependency: A Consistency Training Framework

for Conversational Question Answering
Gangwoo Kim, Hyunjae Kim, Jungsoo Park and Jaewoo Kang
cxliii
Poster 3R: Language Grounding to Vision, Robotics and Beyond
9:00–11:00 PhotoChat: A Human-Human Dialogue Dataset With Photo Sharing Behavior For
Joint Image-Text Modeling
Xiaoxue Zang, Lijuan Liu, Maria Wang, Yang Song, Hao Zhang and Jindong Chen
9:00–11:00 Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual
Context in Multimodal Machine Translation
Zhiyong Wu, Lingpeng Kong, Wei Bi, Xiang Li and Ben Kao
9:00–11:00 Attend What You Need: Motion-Appearance Synergistic Networks for Video Ques-
tion Answering
Ahjeong Seo, Gi-Cheon Kang, Joonhan Park and Byoung-Tak Zhang
9:00–11:00 Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers
Lisa Anne Hendricks, John Mellor, Rosalia Schneider, Jean-Baptiste Alayrac and
Aida Nematzadeh
Poster 3S: Information Extraction
9:00–11:00 BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named
Entity Recognition
Yinghao Li, Pranav Shetty, Lucas Liu, Chao Zhang and Le Song
9:00–11:00 CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation
Extraction
Tao Chen, Haizhou Shi, Siliang Tang, Zhigang Chen, Fei Wu and Yueting Zhuang
9:00–11:00 SENT: Sentence-level Distant Relation Extraction via Negative Training

Ruotian Ma, Tao Gui, Linyang Li, Qi Zhang, Xuanjing Huang and Yaqian Zhou
9:00–11:00 An End-to-End Progressive Multi-Task Learning Framework for Medical Named

Entity Recognition and Normalization
Baohang Zhou, Xiangrui Cai, Ying Zhang and Xiaojie Yuan
9:00–11:00 PRGC: Potential Relation and Global Correspondence Based Joint Relational
Triple Extraction
Hengyi Zheng, rui wen, Xi Chen, Yifan Yang, Yunyan Zhang, Ziheng Zhang,
Ningyu Zhang, Bin Qin, Xu Ming and Yefeng Zheng
9:00–11:00 Learning from Miscellaneous Other-Class Words for Few-shot Named Entity Recog-
nition
Meihan Tong, Shuai Wang, Bin Xu, Yixin Cao, Minghui Liu, Lei Hou and Juanzi
Li
cxliv
9:00–11:00 Joint Biomedical Entity and Relation Extraction with Knowledge-Enhanced Collec-
tive Inference
Tuan Lai, Heng Ji, ChengXiang Zhai and Quan Hung Tran
9:00–11:00 Entity Concept-enhanced Few-shot Relation Extraction

Shan Yang, Yongfei Zhang, Guanglin Niu, Qinghua Zhao and Shiliang Pu
9:00–11:00 Fine-grained Information Extraction from Biomedical Literature based on

Knowledge-enriched Abstract Meaning Representation
Zixuan Zhang, Nikolaus Parulian, Heng Ji, Ahmed Elsayed, Skatje Myers and
Martha Palmer
9:00–11:00 Unleash GPT-2 Power for Event Detection

Amir Pouran Ben Veyseh, Viet Lai, Franck Dernoncourt and Thien Huu Nguyen
9:00–11:00 Improving Model Generalization: A Chinese Named Entity Recognition Case Study
Guanqing Liang and Cane Wing-Ki Leung
9:00–11:00 CLEVE: Contrastive Pre-training for Event Extraction

Ziqi Wang, Xiaozhi Wang, Xu Han, Yankai Lin, Lei Hou, Zhiyuan Liu, Peng Li,
Juanzi Li and Jie Zhou
9:00–11:00 Three Sentences Are All You Need: Local Path Enhanced Document Relation Ex-
traction
Quzhe Huang, Shengqi Zhu, Yansong Feng, Yuan Ye, Yuxuan Lai and Dongyan
Zhao
9:00–11:00 Document-level Event Extraction via Parallel Prediction Networks

Hang Yang, Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao and Taifeng Wang
9:00–11:00 StructuralLM: Structural Pre-training for Form Understanding

Chenliang Li, Bin Bi, Ming Yan, Wei Wang, Songfang Huang, Fei Huang and Luo
Si
cxlv
Poster 3T: Sentiment Analysis, Stylistic Analysis, and Argument Mining
9:00–11:00 Dual Graph Convolutional Networks for Aspect-based Sentiment Analysis

Ruifan Li, Hao Chen, Fangxiang Feng, Zhanyu Ma, Xiaojie WANG and Eduard
Hovy
9:00–11:00 Multi-Label Few-Shot Learning for Aspect Category Detection

Mengting Hu, Shiwan Zhao, Honglei Guo, Chao Xue, Hang Gao, Tiegang Gao,
renhong cheng and Zhong Su
9:00–11:00 Argument Pair Extraction via Attention-guided Multi-Layer Multi-Cross Encoding

Liying Cheng, Tianyu Wu, Lidong Bing and Luo Si
9:00–11:00 A Neural Transition-based Model for Argumentation Mining

Jianzhu Bao, Chuang Fan, Jipeng Wu, Yixue Dang, Jiachen Du and Ruifeng Xu
11:00–12:00 Lifetime Award
Session 14A: Language Generation 2
14:00–14:10 Keep It Simple: Unsupervised Simplification of Multi-Paragraph Text

Philippe Laban, Tobias Schnabel, Paul Bennett and Marti A. Hearst
14:10–14:20 Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence

Jian Guan, Xiaoxi Mao, changjie fan, Zitao Liu, Wenbiao Ding and Minlie Huang
14:20–14:30 OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics

Jian Guan, Zhexin Zhang, Zhuoer Feng, Zitao Liu, Wenbiao Ding, Xiaoxi Mao,
changjie fan and Minlie Huang
14:30–14:40 DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Text
Generation
Xinyu Hua, Ashwin Sreevatsa and Lu Wang
14:40–14:50 Controllable Open-ended Question Generation with A New Question Type Ontology
Shuyang Cao and Lu Wang
cxlvi
14:50–15:00 BERTGen: Multi-task Generation through BERT

Faidon Mitzalis, Ozan Caglayan, Pranava Madhyastha and Lucia Specia
Session 14B: Machine Translation and Multilinguality 9
14:00–14:10 Selective Knowledge Distillation for Neural Machine Translation

Fusheng Wang, Jianhao Yan, Fandong Meng and Jie Zhou
14:10–14:20 Measuring and Increasing Context Usage in Context-Aware Machine Translation

Patrick Fernandes, Kayo Yin, Graham Neubig and André F. T. Martins
14:20–14:30 Beyond Offline Mapping: Learning Cross-lingual Word Embeddings through Con-
text Anchoring
Aitor Ormazabal, Mikel Artetxe, Aitor Soroa, Gorka Labaka and Eneko Agirre
14:30–14:40 CCMatrix: Mining Billions of High-Quality Parallel Sentences on the Web

Holger Schwenk, Guillaume Wenzek, Sergey Edunov, Edouard Grave, Armand
Joulin and Angela Fan
14:40–14:50 EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine

Translation with Soft Lexical Constraints
Weijia Xu and Marine Carpuat
14:50–15:00 Gender Bias in Machine Translation

Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri and Marco Turchi
cxlvii
14:00–14:10 Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with
Search
Gyuwan Kim and Kyunghyun Cho
14:10–14:20 GhostBERT: Generate More Features with Cheap Operations for BERT
Zhiqi Huang, Lu Hou, Lifeng Shang, Xin Jiang, Xiao Chen and Qun Liu
14:20–14:30 Super Tickets in Pre-Trained Language Models: From Model Compression to Im-
proving Generalization
Chen Liang, Simiao Zuo, Minshuo Chen, Haoming Jiang, Xiaodong Liu,
Pengcheng He, Tuo Zhao and Weizhu Chen
14:30–14:40 A Novel Estimator of Mutual Information for Learning to Disentangle Textual Rep-
resentations
Pierre Colombo, Pablo Piantanida and Chloé Clavel
14:40–14:50 Determinantal Beam Search

Clara Meister, Martina Forster and Ryan Cotterell
14:50–15:00 Multi-hop Graph Convolutional Network with High-order Chebyshev Approxima-

tion for Text Reasoning
Shuoran Jiang, Qingcai Chen, Xin Liu, Baotian Hu and Lisai Zhang
14:00–14:10 Accelerating Text Communication via Abbreviated Sentence Input

Jiban Adhikary, Jamie Berger and Keith Vertanen
14:10–14:20 Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regres-
sions In NLP Model Updates
YUQING XIE, Yi-An Lai, Yuanjun Xiong, Yi Zhang and Stefano Soatto
14:20–14:30 Detecting Propaganda Techniques in Memes

Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri,
Hamed Firooz, Preslav Nakov and Giovanni Da San Martino
14:30–14:37 Unsupervised Cross-Domain Prerequisite Chain Learning using Variational Graph

Autoencoders
Irene Li, Vanessa Yan, Tianxiao Li, Rihao Qu and Dragomir Radev
cxlviii
14:37–14:44 Attentive Multiview Text Representation for Differential Diagnosis

Hadi Amiri, Mitra Mohtarami and Isaac Kohane
14:44–14:51 MedNLI Is Not Immune: Natural Language Inference Artifacts in the Clinical Do-
main
Christine Herlihy and Rachel Rudinger
Session 14E: Question Answering 4
14:00–14:10 On the Efficacy of Adversarial Data Collection for Question Answering: Results
from a Large-Scale Randomized Study
Divyansh Kaushik, Douwe Kiela, Zachary C. Lipton and Wen-tau Yih
14:10–14:20 Learning Dense Representations of Phrases at Scale

Jinhyuk Lee, Mujeen Sung, Jaewoo Kang and Danqi Chen
14:20–14:30 End-to-End Training of Neural Retrievers for Open-Domain Question Answering

Devendra Sachan, Mostofa Patwary, Mohammad Shoeybi, Neel Kant, Wei Ping,
William L. Hamilton and Bryan Catanzaro
14:30–14:40 Question Answering Over Temporal Knowledge Graphs

Apoorv Saxena, Soumen Chakrabarti and Partha Talukdar
14:40–14:47 Towards a more Robust Evaluation for Conversational Question Answering

Wissam Siblini, Baris Sayil and Yacine Kessaci
14:47–14:54 VAULT: VAriable Unified Long Text Representation for Machine Reading Compre-
hension
Haoyang Wen, Anthony Ferritto, Heng Ji, Radu Florian and Avi Sil
cxlix
Session 15A: Language Generation 3
15:00–15:10 Language Model Augmented Relevance Score

Ruibo Liu, Jason Wei and Soroush Vosoughi
15:10–15:20 DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-
Experts
Alisa Liu, Maarten Sap, Ximing Lu, Swabha Swayamdipta, Chandra Bhagavatula,
Noah A. Smith and Yejin Choi
15:20–15:30 Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving

Models
Tongshuang Wu, Marco Tulio Ribeiro, Jeffrey Heer and Daniel Weld
15:30–15:40 Metaphor Generation with Conceptual Mappings

Kevin Stowe, Tuhin Chakrabarty, Nanyun Peng, Smaranda Muresan and Iryna
Gurevych
15:40–15:50 Computational Framework for Slang Generation

Zhewei Sun, Richard Zemel and Yang Xu
15:50–15:57 Avoiding Overlap in Data Augmentation for AMR-to-Text Generation

Wenchao Du and Jeffrey Flanigan
Session 15B: NLP Applications 5
15:00–15:10 Learning Latent Structures for Cross Action Phrase Relations in Wet Lab Protocols
Chaitanya Kulkarni, Jany Chan, Eric Fosler-Lussier and Raghu Machiraju
15:10–15:20 Multimodal Multi-Speaker Merger & Acquisition Financial Modeling: A New Task,
Dataset, and Neural Baselines
Ramit Sawhney, Mihir Goyal, Prakhar Goel, Puneet Mathur and Rajiv Ratn Shah
15:20–15:30 Mid-Air Hand Gestures for Post-Editing of Machine Translation

Rashad Albo Jamara, Nico Herbig, Antonio Krüger and Josef van Genabith
15:30–15:40 Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and
Symbolic Reasoning
Pan Lu, Ran Gong, Shibiao Jiang, Liang Qiu, Siyuan Huang, Xiaodan Liang and
Song-Chun Zhu
cl
15:40–15:50 Joint Verification and Reranking for Open Fact Checking Over Tables
Michael Sejr Schlichtkrull, Vladimir Karpukhin, Barlas Oguz, Mike Lewis, Wen-
tau Yih and Sebastian Riedel
15:50–15:57 Weakly-Supervised Methods for Suicide Risk Assessment: Role of Related Domains
Chenghao Yang, Yudong Zhang and Smaranda Muresan
Session 15C: Resources and Evaluation 5
15:00–15:10 Evaluation of Thematic Coherence in Microblogs

Iman Munire Bilal, Bo Wang, Maria Liakata, Rob Procter and Adam Tsakalidis
15:10–15:20 Neural semi-Markov CRF for Monolingual Word Alignment

Wuwei Lan, Chao Jiang and Wei Xu
15:20–15:30 Privacy at Scale: Introducing the PrivaSeer Corpus of Web Privacy Policies
Mukund Srinath, Shomir Wilson and C Lee Giles
15:30–15:40 The statistical advantage of automatic NLG metrics at the system level
Johnny Wei and Robin Jia
15:40–15:50 Are Missing Links Predictable? An Inferential Benchmark for Knowledge Graph
Completion
Yixin Cao, Xiang Ji, Xin Lv, Juanzi Li, Yonggang Wen and Hanwang Zhang
15:50–15:57 Can Transformer Models Measure Coherence In Text: Re-Thinking the Shuffle Test
Philippe Laban, Luke Dai, Lucas Bandarkar and Marti A. Hearst
cli
Session 15D: Summarization 2
15:00–15:10 ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive

Summarization with Argument Mining
Alexander Fabbri, Faiaz Rahman, Imad Rizvi, Borui Wang, Haoran Li, Yashar
Mehdad and Dragomir Radev
15:10–15:20 Improving Factual Consistency of Abstractive Summarization via Question Answer-

ing
Feng Nan, Cicero Nogueira dos Santos, Henghui Zhu, Patrick Ng, Kathleen McKe-
own, Ramesh Nallapati, Dejiao Zhang, Zhiguo Wang, Andrew O. Arnold and Bing
Xiang
15:20–15:30 EmailSum: Abstractive Email Thread Summarization

Shiyue Zhang, Asli Celikyilmaz, Jianfeng Gao and Mohit Bansal
15:30–15:40 Cross-Lingual Abstractive Summarization with Limited Parallel Resources

Yu Bai, Yang Gao and Heyan Huang
15:40–15:50 Dissecting Generation Modes for Abstractive Summarization Models via Ablation
and Attribution
Jiacheng Xu and Greg Durrett
15:50–15:57 SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summariza-

tion
Yixin Liu and Pengfei Liu
Session 15E: Semantics: Lexical Semantics 2
15:00–15:10 Learning Prototypical Functions for Physical Artifacts

Tianyu Jiang and Ellen Riloff
15:10–15:20 Verb Knowledge Injection for Multilingual Event Processing

Olga Majewska, Ivan Vulić, Goran Glavaš, Edoardo Maria Ponti and Anna Korho-
nen
15:20–15:30 Dynamic Contextualized Word Embeddings

Valentin Hofmann, Janet Pierrehumbert and Hinrich Schütze
15:30–15:40 Lexical Semantic Change Discovery

Sinan Kurtyigit, Maike Park, Dominik Schlechtweg, Jonas Kuhn and Sabine Schulte
im Walde
clii
15:40–15:50 Analysis and Evaluation of Language Models for Word Sense Disambiguation
Daniel Loureiro, Kiamehr Rezaee, Mohammad Taher Pilehvar and Jose Camacho-
Collados
15:50–16:00 Let’s Play mono-poly: BERT Can Reveal Words’ Degree of Polysemy
Aina Garí Soler and Marianna Apidianaki
16:00–16:10 Pretraining the Noisy Channel Model for Task-Oriented Dialogue

Qi Liu, Lei Yu, Laura Rimell and Phil Blunsom
16:10–16:20 The R-U-A-Robot Dataset: Helping Avoid Chatbot Deception by Detecting User
Questions About Human or Non-Human Identity
David Gros, Yu Li and Zhou Yu
16:20–16:30 Conversation Graph: Data Augmentation, Training and Evaluation for Non-
Deterministic Dialogue Management
Milan Gritta, Gerasimos Lampourasm and Ignacio Iacobacci
16:30–16:40 Using Meta-Knowledge Mined from Identifiers to Improve Intent Recognition in

Conversational Systems
Claudio Pinhanez, Paulo Cavalin, Victor Henrique Alves Ribeiro, Ana Appel,
Heloisa Candello, Julio Nogima, Mauro Pichiliani, Melina Guerra, Maira de Bayser,
Gabriel Malfatti and Henrique Ferreira
16:40–16:50 Space Efficient Context Encoding for Non-Task-Oriented Dialogue Generation with
Graph Attention Transformer
Fabian Galetzka, Jewgeni Rose, David Schlangen and Jens Lehmann
16:50–17:00 DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Con-

versations
Dou Hu, Lingwei Wei and Xiaoyong Huai
cliii
16:00–16:10 Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater

Reliability
Ka Wong, Praveen Paritosh and Lora Aroyo
16:10–16:20 TIMEDIAL: Temporal Commonsense Reasoning in Dialog

Lianhui Qin, Aditya Gupta, Shyam Upadhyay, Luheng He, Yejin Choi and Manaal
Faruqui
16:20–16:30 RAW-C: Relatedness of Ambiguous Words in Context (A New Lexical Resource for
English)
Sean Trott and Benjamin Bergen
16:30–16:40 ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic

Muhammad Abdul-Mageed, AbdelRahim Elmadany and El Moatez Billah Nagoudi
16:40–16:47 SaRoCo: Detecting Satire in a Novel Romanian Corpus of News Articles

Ana-Cristina Rogoz, Gaman Mihaela and Radu Tudor Ionescu
16:47–16:54 Bringing Structure into Summaries: a Faceted Summarization Dataset for Long
Scientific Documents
Rui Meng, khushboo Thaker, Lei Zhang, Yue Dong, Xingdi Yuan, Tong Wang and
Daqing He
Session 16C: Semantics: Sentence-level Semantics, Textual Inference and

Other areas 4
16:00–16:10 Improving Paraphrase Detection with the Adversarial Paraphrasing Task

Animesh Nighojkar and John Licato
16:10–16:20 ADEPT: An Adjective-Dependent Plausibility Task

Ali Emami, Ian Porada, Alexandra Olteanu, Kaheer Suleman, Adam Trischler and
Jackie Chi Kit Cheung
16:20–16:30 ReadOnce Transformers: Reusable Representations of Text for Transformers

Shih-Ting Lin, Ashish Sabharwal and Tushar Khot
16:30–16:40 Conditional Generation of Temporally-ordered Event Sequences

Shih-Ting Lin, Nathanael Chambers and Greg Durrett
cliv
16:40–16:50 Hate Speech Detection Based on Sentiment Knowledge Sharing

Xianbing Zhou, yang yong, xiaochao fan, Ge Ren, Yunfeng Song, Yufeng Diao,
Liang Yang and Hongfei LIN
Session 16D: Syntax: Tagging, Chunking, and Parsing 2
16:00–16:10 Transition-based Bubble Parsing: Improvements on Coordination Structure Predic-

tion
Tianze Shi and Lillian Lee
16:10–16:20 SpanNER: Named Entity Re-/Recognition as Span Prediction

Jinlan Fu, Xuanjing Huang and Pengfei Liu
16:20–16:30 Strong Equivalence of TAG and CCG

Lena Katharina Schiffer and Andreas Maletti
16:30–16:40 StructFormer: Joint Unsupervised Induction of Dependency and Constituency

Structure from Masked Language Modeling
Yikang Shen, Yi Tay, Che Zheng, Dara Bahri, Donald Metzler and Aaron Courville
16:40–16:47 Replicating and Extending “Because Their Treebanks Leak”: Graph Isomorphism,
Covariants, and Parser Performance
Mark Anderson, Anders Søgaard and Carlos Gómez-Rodríguez
Session 16E: Machine Translation and Multilinguality 10
16:00–16:10 Language Embeddings for Typology and Cross-lingual Transfer Learning

Dian Yu, Taiqi He and Kenji Sagae
16:10–16:20 Can Sequence-to-Sequence Models Crack Substitution Ciphers?

Nada Aldarrab and Jonathan May
16:20–16:30 Beyond Noise: Mitigating the Impact of Fine-grained Semantic Divergences on

Neural Machine Translation
Eleftheria Briakou and Marine Carpuat
16:30–16:40 Revisiting Negation in Neural Machine Translation

Gongbo Tang, Philipp Rönchen, Rico Sennrich and Joakim Nivre
clv
16:40–16:50 Discriminative Reranking for Neural Machine Translation

Ann Lee, Michael Auli and Marc’Aurelio Ranzato
16:50–16:57 Don’t Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine
Translation Data
Rajat Bhatnagar, Ananya Ganesh and Katharina Kann
Best Paper Session
23:00–23:03 EXPLAINABOARD: An Explainable Leaderboard for NLP

Pengfei Liu, Jinlan Fu, Yang Xiao, Weizhe Yuan, Shuaichen Chang, Junqi Dai,
Yixin Liu, Zihuiwen Ye and Graham Neubig
23:03–23:16 Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learn-
ing for Visual Question Answering
Siddharth Karamcheti, Ranjay Krishna, Li Fei-Fei and Christopher Manning
23:16–23:29 All That’s ’Human’ Is Not Gold: Evaluating Human Evaluation of Generated Text
Elizabeth Clark, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan
and Noah A. Smith
23:29–23:42 Scientific Credibility of Machine Translation Research: A Meta-Evaluation of 769

Papers
Benjamin Marie, Atsushi Fujita and Raphael Rubino
23:42–23:55 Neural Machine Translation with Monolingual Translation Memory

Deng Cai, Yan Wang, Huayang Li, Wai Lam and Lemao Liu
23:55–00:08 Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning

Armen Aghajanyan, Sonal Gupta and Luke Zettlemoyer
00:08–00:21 UnNatural Language Inference

Koustuv Sinha, Prasanna Parthasarathi, Joelle Pineau and Adina Williams
00:21–00:39 Including Signed Languages in Natural Language Processing

Kayo Yin, Amit Moryossef, Julie Hochgesang, Yoav Goldberg and Malihe Alikhani
00:39–00:57 Vocabulary Learning via Optimal Transport for Neural Machine Translation
Jingjing Xu, Hao Zhou, Chun Gan, Zaixiang Zheng and Lei Li
clvi
Thursday, August 5, 2021 (all times UTC+0)
01:00–01:30 Distinguished Service and Test-Of-Time Awards session
01:30–02:00 Closing and Future Conferences
clvii

2021.acl-long.0(7)

Uploaded by

Copyright:

Available Formats

2021.acl-long.0(7)

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

2021.acl-long.0(7)

Uploaded by

Copyright:

Available Formats

ACL-IJCNLP 2021

The 59th Annual Meeting of the

Proceedings of the Conference, Vol. 1 (Long Papers)

©2021 The Association for Computational Linguistics

Order copies of this and other ACL proceedings from:

Association for Computational Linguistics (ACL)

ISBN 978-1-954085-52-7 (Volume 1)

Welcome and hope you all enjoy the conference!

Fei Xia, University of Washington

ACL-IJCNLP 2021 Program Committee Co-Chairs

Program Committee Co-Chairs:

Local Organization Committee Co-Chairs:

Student Research Workshop Chairs:

Faculty Advisors to the Student Research Workshop:

Diversity & Inclusion (D&I) Chairs:

Sub-Committee of Childcare ++ Accessibility:

Sub-Committee of Academic Inclusion:

Sub-Committee of Financial Access:

Sub-Committee of Socio-cultural Inclusion:

Ethics Advisory Committee (EAC):

Virtual Infrastructure Committee (VIC):

Reviewer Mentoring Committee Chairs:

Social Media Committee Co-Chairs:

Website & Conference App Chairs:

Student Volunteer Coordinator:

Fei Xia, University of Washington

Senior Area Chairs and Area Chairs:

(Senior area chairs are in bold.)

Computational Social Science and Cultural Analytics:

Dialogue and Interactive Systems:

Minlie Huang, Gina-Anne Levow, Jason Williams, Luciana Benotti, Y-Lan

Discourse and Pragmatics:

Information Retrieval and Text Mining:

Interpretability and Analysis of Models for NLP:

Language Grounding to Vision, Robotics and Beyond:

Linguistic Theories, Cognitive Modeling and Psycholinguistics:

Machine Learning for NLP:

Machine Translation and Multilinguality:

Multidisciplinary and Area Chair COI:

Iryna Gurevych, Andreas Vlachos, Dan Goldwasser, Omer Levy, Diarmuid Ó

Phonology, Morphology and Word Segmentation:

Jennifer Chu-Carroll, Alessandro Moschitti, Furu Wei, Roberto Basili, Jor-

Resources and Evaluation:

Mona Diab, Mohammad Taher Pilehvar, Marianna Apidianaki, Eduardo Blanco,

Semantics: Sentence-level Semantics, Textual Inference and Other areas:

Doug Downey, Raymond Mooney, Xiaodan Zhu, Iz Beltagy, Jonathan Berant,

Sentiment Analysis, Stylistic Analysis, and Argument Mining:

Speech and Multimodality:

Syntax: Tagging, Chunking and Parsing:

Slav Petrov, Emily Pitler, Carlos Gómez-Rodríguez, Daniel Hershcovich, Marco

Timothy Baldwin, Ellen Riloff, Bonnie Webber

Radu Iacob, Nikolai Ilinykh,

Kouta Nakayama, Yatin Nandwani, Sara Ng, Dan Nguyen,

Weizhen Qi, Yi Qin,

Clara Vania, Benjamin van Niekerk, Suzan Verberne, Huy Vu,

Xiaolin Xia, Yuqing Xie, Benfeng Xu,

We would like to recognize the following Outstanding Reviewers:

Alexander Fabbri, Agnieszka Falenska, Sergey Feldman, Daniel Fernandez-Gonzalez, An-

Jake Lever, Yaoyiran Li, Jindřich Libovický, Fangyu Liu,

Matan Orbach, Jessica Ouyang,