BEA Newsletter #9

12 minute read

Hi all, if you’re new to this list, the BEA newsletter is something we’ve created to complement the BEA workshop: every month or two we’ll send out an email with updates not only about the workshop, but on other EduNLP events and resources that you might want to know about. Also, if you know of someone who should be added to the mailing list or if you wish to be removed, please send a note.

In this newsletter we’ll cover the following:

  • BEA Information form (must read)
  • BEA10 Announcements
  • Upcoming EduNLP Conferences / Workshops
  • EduNLP Publications from the last three months (big list!)

As always, I’d like to thank our new BEA Newsletter volunteers: Ekaterina Kochmar, Ildiko Pilan, Somwya V. B. and Helen Yannakoudakis for assisting in the writing of this newsletter. Thanks!

And as always, if you know of any corpora, resources, tools, pubs, conferences, job postings, etc. that would be good to have on the newsletter, please let us know and they’ll go in the next one.

Joel & BEA Friends

BEA Information Form

To date, this mailing list and the one used for reviewing has been manually constructed. As the field has grown considerably over the last few years, it is sometimes hard to keep track of everyone’s affiliations but also research interests, which helps us best match papers with appropriate reviewers. To address this we’ve created a google form where you can submit your information and then it will be saved to a google spreadsheet which the BEA organizers can use for this newsletter but also for the workshop. If you could please take 90 seconds to fill out the form, this will help us (and the field) considerably.

If you’ve done this a year or two before, please update this again. We ask a few different questions and of course email addresses, affiliations, interests change. If you know of others who would like to be added to the mailing list, please send them the above link. Thanks!

BEA10 Announcements

The BEA10 workshop took place in Denver almost three months ago and it was a success! We had one of the highest BEA participation numbers ever (58, the only other highest one was 63 but we also a hosted a shared task, so this is technically the largest BEA-only) and I think we were once again one of the largest one-day workshops at ACL. This says a lot about the growth and strength of our field! The workshop featured a large list of sponsors, free t-shirts, and of course an energetic day of oral and poster presentations! Further statistics and information can be found on the website and more specifically on the opening remarks slides.

You’ll also note that in the Workshop Program section, we’ve listed links to the papers as well as the slides. If you’re an author and haven’t sent me your materials, or have auxiliary materials you’d like me to post, just send an email. I know a few people sent in their posters to me, I’m putting those up this week.

I also took a few photos at the BEA10 poster session.

Upcoming EduNLP Conferences and Workshops

Recent EduNLP Publications

EMNLP 2015

  • A Graph-based Readability Assessment Method using Word Coupling. Zhiwei Jiang, Gang Sun, Qing Gu and Daoxu Chen

  • Flexible Domain Adaptation for Automated Essay Scoring Using Correlated Linear Regression. Peter Phandi, Kian Ming A. Chai and Hwee Tou Ng

  • Spelling Correction of User Search Queries through Statistical Machine Translation. Saša Hasan, Carmen Heger and Saab Mansour

  • Human Evaluation of Grammatical Error Correction Systems. Roman Grundkiewicz, Marcin Junczys-Dowmunt and Edward Gillian

  • Noise or additional information? Using crowdsource annotation item agreement for natural language tasks. Emily Jamison and Iryna Gurevych

  • Discourse Element Identification in Student Essays based on Global and Local Sentence Chains. Wei Song

  • Summarizing Student Responses to Reflection Prompts. Wencan Luo and Diane Litman

  • Combining Geometric, Textual and Visual Features for Generating Prepositions in Image Descriptions. Arnau Ramisa, Josiah Wang, Ying Lu, Emmanuel Dellandrea, Francesc Moreno-Noguer and Robert Gaizauskas

TACL papers at EMNLP

  • Problems in Current Text Simplification Research: New Data Can Help. Wei Xu, Chris Callison-Burch, Courtney Napoles

Workshop on Vision and Language at EMNLP 2015

The full list of accepted papers is online.

Relevant papers:

  • Do Distributed Semantic Models Dream of Electric Sheep? Visualizing Word Representations through Image Synthesis. Angeliki Lazaridou, Dat Tien Nguyen and Marco Baroni

  • Describing Spatial Relationships between Objects in Images in English and French. Anja Belz, Adrian Muscat, Maxime Aberton and Sami Benjelloun

  • Towards Reliable Automatic Multimodal Content Analysis. Olli Philippe Lautenbacher, Liisa Tiittula, Maija Hirvonen, Jorma Laaksonen and Mikko Kurimo

  • Visual Classifier Prediction by Distributional Semantic Embedding of Text Descriptions. Mohamed Elhoseiny and Ahmed Elgammal

CogACLL 2015 (Cognitive Aspects of Computational Language Learning)

The full list of accepted papers is online.

Relevant papers:

  • A Computational Study of Cross-situational Lexical Learning of Brazilian Portuguese. Pablo Faria

  • An Agent-based Model of a Historical Word Order Change. Jelke Bloem, Arjen Versloot and Fred Weerman

  • Evaluating Models of Computation and Storage in Human Sentence Processing. Thang Luong, Timothy O’Donnell and Noah Goodman

  • Perceptual, Conceptual, and Frequency Effects on Error Patterns in English Color Term Acquisition. Barend Beekhuizen and Suzanne Stevenson

  • Reading Metrics for Estimating Task Efficiency with SMT Output. Sigrid Klerke, Sheila Castilho and Anders Søgaard

  • Using Reading Behavior to Predict Grammatical Functions. Maria Barrett

  • Which Distributional Cues Help the Most? Unsupervised Contexts Selection for Lexical Category Acquisition. Giovanni Cassani, Robert Grimm, Walter Daelemans and Steven Gillis

CoNLL 2015

  • Contrastive Analysis with Predictive Power: Typology Driven Estimation of Grammatical Error Distributions in ESL. Yevgeni Berzak, Roi Reichart and Boris Katz

ACL 2015 System Demonstrations

  • A System Demonstration of a Framework for Computer Assisted Pronunciation Training. Renlong Ai and Feiyu Xu

  • LEXenstein: A Framework for Lexical Simplification. Gustavo Paetzold and Lucia Specia

  • WriteAhead: Mining Grammar Patterns in Corpora for Assisted Writing. Tzu-Hsi Yen, Jian-Cheng Wu, Jim Chang, Joanne Boisson and Jason Chang


The proceedings are available online.

Relevant Papers:

  • Sentence selection for automatic scoring of Mandarin proficiency. Jiahong Yuan; Xiaoying Xu; Wei Lai; Weiping Ye; Xinru Zhao; Mark Liberman

  • Introduction to SIGHAN 2015 Bake-off for Chinese Spelling Check. Yuen-Hsien Tseng; Lung-Hao Lee; Li-Ping Chang; Hsin-Hsi Chen

  • HANSpeller++: A Unified Framework for Chinese Spelling Correction. Shuiyuan Zhang; Jinhua Xiong; Jianpeng Hou; Qiao Zhang; Xueqi Cheng

  • Word Vector/Conditional Random Field-based Chinese Spelling Error Detection for SIGHAN-2015 Evaluation. Yih-Ru Wang; Yuan-Fu Liao

  • Introduction to a Proofreading Tool for Chinese Spelling Check Task of SIGHAN-8. Tao-Hsing Chang; Hsueh-Chih Chen; Cheng-Han Yang

  • Chinese Spelling Check System Based on N-gram Model. Weijian Xie; Peijie Huang; Xinrui Zhang; Kaiduo Hong; Qiang Huang; Bingzhou Chen; Lei Huang

  • NTOU Chinese Spelling Check System in Sighan-8 Bake-off. Wei-Cheng Chu; Chuan-Jie Lin

ACL Workshop on Arabic Natural Language Processing

The program and procedings are available online.

Relevant papers:

  • A Conventional Orthography for Algerian Arabic. Houda Saadane and Nizar Habash

  • Best Practices for Crowdsourcing Dialectal Arabic Speech Transcription. Samantha Wray, Hamdy Mubarak and Ahmed Ali

  • Multi-Reference Evaluation for Dialectal Speech Recognition System: A Study for Egyptian ASR. Ahmed Ali, Walid Magdy and Steve Renals

  • The Second QALB Shared Task on Automatic Text Correction for Arabic. Alla Rozovskaya, Houda Bouamor, Nizar Habash, Wajdi Zaghouani, Ossama Obeid and Behrang Mohit

  • QCRI@QALB-2015 Shared Task: Correction of Arabic Text for Native and Non-Native Speakers’ Errors. Hamdy Mubarak, Kareem Darwish and Ahmed Abdelali

  • Arib@QALB-2015 Shared Task: A Hybrid Cascade Model for Arabic Spelling Error Detection and Correction. Nouf AlShenaifi, Rehab AlNefie, Maha Al-Yahya and Hend Al-Khalifa

  • SAHSOH@QALB-2015 Shared Task: A Rule-Based Correction Method of Common Arabic Native and Non-Native Speakers’ Errors. Wajdi Zaghouani, Taha Zerrouki and Amar Balla

  • GWU-HASP-2015@QALB‐2015 Shared Task: Priming Spelling Candidates with Probability. Mohammed Attia, Mohamed Al-Badrashiny and Mona Diab

  • QCMUQ@QALB-2015 Shared Task: Combining Character level MT and Error-tolerant Finite-State Recognition for Arabic Spelling Correction. Houda Bouamor, Hassan Sajjad, Nadir Durrani and Kemal Oflazer

  • UMMU@QALB-2015 Shared Task: Character and Word level SMT pipeline for Automatic Error Correction of Arabic Text. Fethi Bougares and Houda Bouamor

  • TECHLIMED@QALB-Shared Task 2015: a hybrid Arabic Error Correction System. Djamel MOSTEFA, Jaber ABUALASAL, Omar ASBAYOU, Mahmoud GZAWI and Ramzi Abbès

  • CUFE@QALB-2015 Shared Task: Arabic Error Correction System. Michael Nawar Ibrahim and Moheb M. Ragheb

ACL Workshop on Building and Using Comparable Corpora

The program and proceedings are online.

  • Application of a Corpus to Identify Gaps between English Learners and Native Speakers. Katsunori Kotani and Takehiko Yoshimi


The 2nd Workshop on Natural Language Processing Techniques for Educational Applications with a Shared Task on Chinese Grammatical Error Diagnosis. The program and proceedings are online.

  • Collocation Assistant for Learners of Japanese as a Second Language. Lis Pereira, and Yuji Matsumoto

  • Semi-automatic Generation of Multiple-Choice Tests from Mentions of Semantic Relations. Renlong Ai, Sebastian Krause, Walter Kasper, Feiyu Xum, and Hans Uszkoreit

  • Interactive Second Language Learning from News Websites. Tao Chen, Naijia Zheng, Yue Zhao, Muthu Kumar Chandrasekaran, and Min-Yen Kan

  • Bilingual Keyword Extraction and its Educational Application. Chung-Chi Huang, Mei-Hua Chen, and Ping-Che Yang

  • Annotating Entailment Relations for Shortanswer Questions. Simon Ostermann, Andrea Horbach, and Manfred Pinkal

  • An Automated Scoring Tool for Korean Supply-type Items Based on Semi-Supervised Learning. Minah Cheon, Hyeong-Won Seo, Jae-Hoon Kim, Eun-Hee Noh, Kyung-Hee Sung, and EunYong Lim

  • A System for Generating Multiple Choice Questions: With a Novel Approach for Sentence Selection. Mukta Majumder, and Sujan Kumar Saha

  • The “News Web Easy’’ news service as a resource for teaching and learning Japanese: An assessment of the comprehension difficulty of Japanese sentence-end expressions. Hideki Tanaka, Tadashi Kumano, and Isao Goto

  • Grammatical Error Correction Considering Multi-word Expressions. Tomoya Mizumoto, Masato Mita, and Yuji Matsumoto

  • Salinlahi III: An Intelligent Tutoring System for Filipino Heritage Language Learners. Ralph Vincent Regalado, Michael Louie Boñon, Nadine Chua, Rene Rose Piñera, and Shannen Rose Dela Cruz

  • Using Finite State Transducers for Helping Foreign Language Learning. Hasan Kaya and Gülşen Eryiğit

  • Condition Random Fields-based Grammatical Error Detection for Chinese as Second Language. Jui-Feng Yeh, Chan Kun Yeh, Kai-Hsiang Yu, Ya-Ting Li, and Wan-Ling Tsai

  • Overview of the NLP-TEA 2015 Shared Task for Chinese Grammatical Error Diagnosis. Lung-Hao Lee, Liang-Chih Yu, and Li-Ping Chang

  • Chinese Grammatical Error Diagnosis by Conditional Random Fields. Shih-Hung Wu, Po-Lin Chen, Liang-Pu Chen, Ping-Che Yang, and Ren-Dar Yang

  • NTOU Chinese Grammar Checker for CGED Shared Task. Chuan-Jie Lin, and Shao-Heng Chen

  • Chinese Grammatical Error Diagnosis Using Ensemble Learning. Yang Xiang, Xiaolong Wang, Wenying Han, and qinghua Hong

  • Improving Chinese Grammatical Error Correction with Corpus Augmentation and Hierarchical Phrase-based Statistical Machine Translation. Yinchen Zhao, Mamoru Komachi and Hiroshi Ishikawa

  • Chinese Grammatical Error Diagnosis System Based on Hybrid Model. Xiupeng Wu, Peijie Huang, Jundong Wang, Qingwen Guo, Yuhong Xu and Chuping Chen

AIED 2015

  • Contextual Recommendation of Educational Contents. Nidhi Saraswat, Hiranmay Ghosh, Mohit Agrawal and Uma Narayanan

  • Leveraging Multiple Views of Text for Automatic Question Generation. Karen Mazidi and Rodney D. Nielsen

  • Machine Learning for Holistic Evaluation of Scientific Essays. Simon Hughes, Dylan Blaum, Mary Anne Britt, Peter Hastings and Patricia Wallace

  • Evaluating Human and Automated Generation of Distractors for Diagnostic Multiple-Choice Cloze Questions to Assess Children’s Reading Comprehension. Yi-Ting Huang and Jack Mostow

  • Distractor quality evaluation in Multiple Choice Questions. Van-Minh Pho, Anne-Laure Ligozat and Brigitte Grau

  • A framework for automated generation of questions across formal domains. Rahul Singhal, Martin Henz and Shubham Goyal

EDM 2015

  • You are your words: Modeling Students’ Vocabulary Knowledge with Natural Language Processing Techniques. Laura K.Allen and Danielle McNamara

  • Automatic Grading of Short Answers for MOOC via Semi-supervised Document Clustering. Shumin Jing

  • Data-Driven Analyses of Electronic Text Books. Ahcène Boubekki, Ulf Kröhne, Frank Goldhammer, Waltraud Schreiber and Ulf Brefeld.

  • Integrating Process and Product Data: The Case of an Automated Writing Evaluation System. Chaitanya Ramineni, Tiago Caliço and Chen Li

  • Your Model Is Predictive — but Is It Useful? Theoretical and Empirical Considerations of a New Paradigm for Adaptive Tutoring Evaluation. José P. González-Brenes and Yun Huang

  • Tutorial on Using Natural Language Processing Tools in Educational Data Mining by Scott A.Crossley.

RANLP 2015

  • Norwegian Native Language Identification. Shervin Malmasi and Irina Temnikova

  • DanProof: Pedagogical Spell and Grammar Checking for Danish. Eckhard Bick

  • Automatic Text Simplification for Spanish: Comparative Evaluation of Various Simplification Strategies. Sanja Štajner, Iacer Calixto and Horacio Saggion

  • A new approach to automated text readability classification based on concept indexing with integrated part-of-speech n-gram features. Abigail Razon and John Barnden

  • Feature Extraction for Native Language Identification Using Language Modeling. Vincent Kríž, Martin Holub and Pavel Pecina

  • Classification of Lexical Collocation Errors in the Writings of Learners of Spanish. Sara Rodríguez-Fernández, Roberto Carlini and Leo Wanner

  • Automatic Acquisition of Artifact Nouns in French. Xiaoqin Hu and Pierre-André Buvet

  • Readability Assessment of Translated Texts. Alina Maria Ciobanu, Liviu P. Dinu and Flaviu Pepelea

  • Translating from Original to Simplified Sentences using Moses: When does it Actually Work?. Sanja Štajner and Horacio Saggion

SLaTE 2015

The program for both days and the proceedings are online.

Eurocall Conference 2015

The full program is available online.

Other Venues

  • Non-native text analysis with Syntactic Diff, a general comparative text mining framework. Sean Alexander Massung, MS Thesis, UIUC, 2015

  • Automated Analysis of Text in Graduate School Recommendations. Michael Heilman, F. Jay Breyer, Frank Williams, David Klieger and Michael Flor. ETS Research Report Series, 2015

  • Using Natural Language Processing Technology to Analyze Teachers’ Written Feedback on Chinese Students’ English Essays. Ming Liu. International Journal of Learning, Teaching and Educational Research, 2015

  • Automatic Coding of Short Text Responses via Clustering in Educational Assessment. Fabian Zehner, Christine Sälzer and Frank Goldhammer. Educational and Psychological Measurement Journal, 2015

  • Comparing the Linguistic Complexity in Receptive and Productive Modes. Jessie Saraza Barrot. GEMA Online Journal of Language Studies

  • Construct validity in TOEFL iBT speaking tasks: Insights from Natural Language Processing. Kristopher Kyle, Scott A.Crossley and Danielle McNamara Language Testing journal, 2015

  • Leveling L2 Texts Through Readability: Combining Multilevel Linguistic Features with the CEFR. Yao-Ting Sung, Wei-Chun lin, Scott Benjamin Dyson, Kuo-En Chang and Yu-Chia Chen. The Modern Language Journal, 99(2), 371-391, 2015.

  • Syntactic complexity in college-level English writing: Differences among writers with diverse L1 backgrounds. Xiaofei Lu and Haiyang Ai. Journal of Second Language Writing, 2015.

  • Building a Lexicon of Formulaic Language for Language Learners. Brooke, 11th Multi Word Expressions Workshop, 2015

  • Automatically Detecting Syntactic Errors in Sentences Written by Learners of Chinese as a Foreign Language. Tao-Hsing Chang, Yao-Ting Sung and Jia-Fei Hong. Computational Linguistics and Chinese Language Processing, 2015

  • InkWell: A Creative Writer’s Creative Assistant. Richard P. Gabriel, Jeffrey Nichols, Jilin Chen. In Proceedings of the 2015 ACM SIGCHI Conference on Creativity and Cognition