BEA Newsletter #12

8 minute read

Hi all,

The BEA11 workshop is quickly approaching and we thought it would be a good time to send out a newsletter. This one contains the following:

  • BEA11 Information
  • EduNLP Publications
  • Upcoming EduNLP conferences and workshops

I’m acutally amazed at how many educational related papers are being published this year in major conferences such as ACL, IJCAI, NAACL, LREC, etc. This has to be banner year for research in this field! Check out the publications below.

As always, I’d like to thank Ekaterina Kochmar, Ildiko Pilan, and Sowmya V. B. for once again helping with putting together another information packed newsletter. And as always, if you know of someone who would want to be on the email list, please have them fill out the form. I’ve also added BEA11 accepted paper authors to this current email, but if you are interested in receiving the bi-monthly newsletter, also fill out the form.

Hope you have a good start to the new year and look forward to seeing you in 2016!

Best, Joel & BEA Organization

BEA11 Announcements

The BEA11 workshop takes place this coming Thursday, June 16. We hope that many of you will be able to attend. The program can be found [here] ( And you can download papers from this temporary site. Registration is still open and can be accessed here. We are also proud to announce that we are hosting the first shared task on Automated Evaluation of Scientific Writing.

As in the last few years, we will have free BEA T-shirts for all who register. These shirts can be picked up at the workshop. And as always, we will be having our world-famous post-workshop dinner following the conclusion of the workshop Thursday night. We hope you can join us for dinner then; details to be announced during the workshop.

Finally, if you have any announcements on relevant events, resources, job postings, etc. and would like them advertised at the workshop, please let me know before Wednesday 17:00 PST.

Recent EduNLP Publications


  • CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence, Alessia Barbagli, Pietro Lucisano, Felice Dell’Orletta, Simonetta Montemagni and Giulia Venturi.

  • The COPLE2 corpus: a learner corpus for Portuguese, Amália Mendes, Sandra Antunes, Maarten Janssen and Anabela Gonçalves.

  • Compilation of an Arabic Children’s Corpus, Latifa Al-Sulaiti, Noorhan Abbas, Claire Brierley, Eric Atwell and Ayman Alghamdi.

  • SubCo: A Learner Translation Corpus of Human and Machine Subtitles, José Manuel Martínez Martínez and Mihaela Vela.

  • CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence, Alessia Barbagli, Pietro Lucisano, Felice Dell’Orletta, Simonetta Montemagni and Giulia Venturi.

  • Design and Development of the MERLIN Learner Corpus Platform, Verena Lyding and Karin Schöne.

  • French Learners Audio Corpus of German Speech (FLACGS), Jane Wottawa and Martine Adda-Decker.

  • The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and Analysis, Beata Megyesi, Jesper Näsman and Anne Palmér.

  • OSMAN ― A Novel Arabic Readability Metric, Mahmoud El-Haj and Paul Rayson. MADAD: A Readability Annotation Tool for Arabic Text. Nora Al-Twairesh, Abeer Al-Dayel, Hend Al-Khalifa, Maha Al-Yahya, Sinaa Alageel, Nora Abanmy and Nouf Al-Shenaifi.

  • Evaluating the Readability of Text Simplification Output for Readers with Cognitive Disabilities. Victoria Yaneva, Irina Temnikova and Ruslan Mitkov.

  • Analysis of English Spelling Errors in a Word-Typing Game, Ryuichi Tachibana and Mamoru Komachi.

  • Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource, Anaïs Tack, Thomas Francois, Anne-Laure Ligozat and Cédrick Fairon.

  • B2SG: a TOEFL-like Task for Portuguese, Rodrigo Wilkens, Leonardo Zilio, Eduardo Ferreira and Aline Villavicencio.

  • What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets, Enrico Santus, Alessandro Lenci, Tin-Shing Chiu, Qin Lu and Chu-Ren Huang.

  • Palabras: Crowdsourcing Transcriptions of L2 Speech, Eric Sanders, Pepi Burgos, Catia Cucchiarini and Roeland van Hout.

  • The Validation of MRCPD Cross-language Expansions on Imageability Ratings, Ting Liu, Kit Cho, Tomek Strzalkowski, Samira Shaikh and Mehrdad Mirzaei.

  • Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource Languages, Stefan Ecker, Andrea Horbach and Stefan Thater.

  • Improving POS Tagging of German Learner Language in a Reading Comprehension Scenario, Lena Keiper, Andrea Horbach and Stefan Thater.

  • SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies, Elena Volodina, Ildikó Pilán, Ingegerd Enström, Lorena Llozhi, Peter Lundkvist, Gunlög Sundberg and Monica Sandell.

  • SVALex: a CEFR-graded Lexical Resource for Swedish Foreign and Second Language Learners, Thomas Francois, Elena Volodina, Ildikó Pilán and Anaïs Tack. Detecting Word Usage Errors in Chinese Sentences for Learning Chinese as a Foreign Language, Yow-Ting Shiue and Hsin-Hsi Chen.

  • Leveraging Native Data to Correct Preposition Errors in Learners’ Dutch, Lennart Kloppenburg and Malvina Nissim.

  • Chatbot Technology with Synthetic Voices in the Acquisition of an Endangered Language: Motivation, Development and Evaluation of a Platform for Irish, Neasa Ní Chiaráin and Ailbhe Ní Chasaide.

  • A Shared Task for Spoken CALL?, Claudia Baur, Johanna Gerlach, Manny Rayner, Martin Russell and Helmer Strik.

  • DALILA: The Dialectal Arabic Linguistic Learning Assistant, Salam Khalifa, Houda Bouamor and Nizar Habash.

  • Error Typology and Remediation Strategies for Requirements Written in English by Non-Native Speakers, Marie Garnier and Patrick Saint-Dizier.

  • Joining-in-type Humanoid Robot Assisted Language Learning System, AlBara Khalifa, Tsuneo Kato and Seiichi Yamamoto.

  • Predicting Reading Difficulty for Readers with Autism Spectrum Disorder, Victoria Yaneva, Richard Evans and Irina Temnikova. ISI-NLP (Improving Social Inclusion using NLP) LREC Workshop.

  • SimplexEduReading: Simplification of Natural Language for Reading Comprehension Improvement in Education, Estela Saquete, Ruben Izquierdo Bevia and Sonia Vazquez. ISI-NLP (Improving Social Inclusion using NLP) LREC Workshop.


  • Mapping Verbs in Different Languages to Knowledge Base Relations using Web Text as Interlingua. Derry Tanti Wijaya and Tom Mitchell

  • Predicting Student Essay Structure. Isaac Persing

  • Cross-lingual Wikification Using Multilingual Embeddings. Chen-Tse Tsai and Dan Roth

  • Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning. Yulia Tsvetkov, Sunayana Sitaram, Manaal Faruqui, Guillaume Lample, Patrick Littell, David R. Mortensen, Alan W Black, Lori Levin and Chris Dyer

  • Automatic Summarization of Student Course Feedback. Wencan Luo, Fei Liu, Zitao Liu and Diane Litman

  • Bootstrapping Translation Detection and Sentence Extraction from Comparable Corpora. Kriste Krstovski and David Smith

  • Discriminative Reranking for Grammatical Error Correction with Statistical Machine Translation. Tomoya Mizumoto and Yuji Matsumoto

  • Fast and Easy Short Answer Grading with High Accuracy. Md Arafat Sultan, Cristobal Salazar and Tamara Sumner

  • Grammatical error correction using neural machine translation. Zheng Yuan and Ted Briscoe

  • Interpretese vs. Translationese: The Uniqueness of Human Strategies in Simultaneous Interpretation. He He, Jordan Boyd-Graber and Hal Daumé III

  • Multimodal Semantic Learning from Child-Directed Input. Angeliki Lazaridou, Grzegorz Chrupała, Raquel Fernandez and Marco Baroni

  • Sequential Short-Text Classification with Recurrent and Convolutional Neural Networks. Ji Young Lee and Franck Dernoncourt

  • This is how we do it: Answer Reranking for Open-domain How Questions with Paragraph Vectors and Minimal Feature Engineering. Dasha Bogdanova and Jennifer Foster

  • Using Context to Predict the Purpose of Argumentative Writing Revisions. Fan Zhang and Diane Litman

NAACL Workshop on Multilingual and Cross­-lingual Methods in NLP

  • Learning Translations for Tagged Words: Extending the Translation Lexicon of an ITG for Low Resource Languages. Markus Saers and Dekai Wu

  • Comparing Fifty Natural Languages and Twelve Genetic Languages Using Word Embedding Language Divergence (WELD) as a Quantitative Measure of Language Distance. Ehsaneddin Asgari and Mohammad R.K. Mofrad


  • A CALL system for learning preposition usage. John Lee

  • A Thorough Examination of the CNN / Daily Mail Reading Comprehension Task. Danqi Chen, Jason Bolton and Christopher D. Manning

  • Automatic Stance Classification of Argumentative Essays. Isaac Persing

  • Automatic Text Scoring Using Neural Networks. Dimitrios Alikaniotis, Helen Yannakoudakis and Marek Rei

  • Compositional Sequence Labeling Models for Error Detection in Learner Writing. Marek Rei and Helen Yannakoudakis

  • Constrained Multi-Task Learning for Automated Essay Scoring. Ronan Cummins, Meng Zhang and Ted Briscoe

  • Cross-Lingual Lexico-Semantic Transfer in Language Learning. Ekaterina Kochmar and Ekaterina Shutova

  • Easy Questions First? Curriculum Learning for Question Answering Mrinmaya Sachan and Eric Xing

  • Grammatical Error Correction: Machine Translation and Classifiers Alla Rozovskaya and Dan Roth

  • Learning Language Games through Interaction. Sida I. Wang, Percy Liang and Christopher D. Manning

  • Off-topic Response Detection for Spontaneous Spoken English Assessment. Andrey Malinin, Rogier van Dalen, Kate Knill, Yu Wang and Mark Gales

  • Phrase Structure Annotation and Parsing for Learner English. Ryo Nagata and Keisuke Sakaguchi

  • Universal Dependencies for Learner English. Yevgeni Berzak, Jessica Kenney, Carolyn Spadine, Jing Xian Wang, Lucia Lam, Keiko Sophie Mori, Sebastian Garza and Boris Katz

  • User Modeling in Language Learning with Macaronic Texts. Adithya Renduchintala, Rebecca Knowles, Philipp Koehn and Jason Eisner

  • Science Question Answering using Instructional Materials. Mrinmaya Sachan, Kumar Dubey and Eric Xing

ACL System Demonstrations

  • Creating Interactive Macaronic Interfaces For Language Learning. Adithya Renduchintala, Rebecca Knowles, Philipp Koehn and Jason Eisner

  • Language Muse: Automated Linguistic Activity Generation For English language Learners. Nitin Madnani, Jill Burstein, John Sabatini, Kietha Biggers and Slava Andreyev

  • My Science Tutor - Learning Science With A Conversational Virtual Tutor. Sameer Pradhan, Ron Cole and Wayne Ward

  • Online Information Retrieval For Language Learning. Maria Chinkina, Madeeswaran Kannan and Detmar Meurers

  • Personalized Exercises For Preposition Learning. John Lee and Mengqi Luo


  • Neural Network Translation Models for Grammatical Error Correction. Shamil Chollampatt, Kaveh Taghipour, Hwee Tou Ng

  • Exploiting N-Best Hypotheses to Improve an SMT Approach to Grammatical Error Correction. Duc Tam Hoang, Shamil Chollampatt, Hwee Tou Ng

  • Parse Tree Fragmentation of Ungrammatical Sentences. Homa B. Hashemi and Rebecca Hwa

Other venues

  • Sentence-Level Grammatical Error Identification as Sequence-to-Sequence Correction. Allen Schmaltz, Yoon Kim, Alexander M. Rush, Stuart M. Shieber

  • Readability-based Sentence Ranking for Evaluating Text Simplification. Sowmya Vajjala, Detmar Meurers

  • Reassessing the Goals of Grammatical Error Correction: Fluency Instead of Grammaticality. Keisuke Sakaguchi, Courtney Napoles, Matt Post, and Joel Tetreault

  • Phrase-based Machine Translation is State-of-the-Art for Automatic Grammatical Error Correction. Marcin Junczys-Dowmunt, Roman Grundkiewicz

Upcoming EduNLP Events

  • Joint 5th Workshop on NLP4CALL (NLP for Computer-Assisted Language Learning) and 1st Workshop on NLP4LA (NLP for Research in Language Acquisition) (deadline: September 27, date: November 16, 2016, location: Umeå, Sweden) [Link]

  • Traitement Automatique Des Langues (TAL) Special Issue On Nlp For Learning And Teaching (deadline: September 30, date: June 30, 2017) [Link]

  • AIED Special Issue on MARWIDE: M​ultidisciplinary A​pproaches to R​eading and W​riting I​ntegrated with D​isciplinary E​ducation (to be published Q1 2017)

  • 12th Teaching and Language Corpora Conference (Gießen, Germany; July 20-23, 2016)

  • 2nd Workshop on Language Teaching, Learning and Technology (LTLT) (San Francisco, CA, USA; September 06-07, 2016)

  • 5th Workshop on Child Computer Interaction (WOCCI) (San Francisco, CA, USA; September 06-07, 2016)

  • CSL special isssue on Language and Interaction Technologies for Children (target publication date: January 01, 2017)