BEA Newsletter #9

12 minute read

Hi all, if you’re new to this list, the BEA newsletter is something we’ve created to complement the BEA workshop: every month or two we’ll send out an email with updates not only about the workshop, but on other EduNLP events and resources that you might want to know about. Also, if you know of someone who should be added to the mailing list or if you wish to be removed, please send a note.

In this newsletter we’ll cover the following:

BEA Information form (must read)
BEA10 Announcements
Upcoming EduNLP Conferences / Workshops
EduNLP Publications from the last three months (big list!)

As always, I’d like to thank our new BEA Newsletter volunteers: Ekaterina Kochmar, Ildiko Pilan, Somwya V. B. and Helen Yannakoudakis for assisting in the writing of this newsletter. Thanks!

And as always, if you know of any corpora, resources, tools, pubs, conferences, job postings, etc. that would be good to have on the newsletter, please let us know and they’ll go in the next one.

Joel & BEA Friends

BEA Information Form

To date, this mailing list and the one used for reviewing has been manually constructed. As the field has grown considerably over the last few years, it is sometimes hard to keep track of everyone’s affiliations but also research interests, which helps us best match papers with appropriate reviewers. To address this we’ve created a google form where you can submit your information and then it will be saved to a google spreadsheet which the BEA organizers can use for this newsletter but also for the workshop. If you could please take 90 seconds to fill out the form, this will help us (and the field) considerably.

If you’ve done this a year or two before, please update this again. We ask a few different questions and of course email addresses, affiliations, interests change. If you know of others who would like to be added to the mailing list, please send them the above link. Thanks!

BEA10 Announcements

The BEA10 workshop took place in Denver almost three months ago and it was a success! We had one of the highest BEA participation numbers ever (58, the only other highest one was 63 but we also a hosted a shared task, so this is technically the largest BEA-only) and I think we were once again one of the largest one-day workshops at ACL. This says a lot about the growth and strength of our field! The workshop featured a large list of sponsors, free t-shirts, and of course an energetic day of oral and poster presentations! Further statistics and information can be found on the website and more specifically on the opening remarks slides.

You’ll also note that in the Workshop Program section, we’ve listed links to the papers as well as the slides. If you’re an author and haven’t sent me your materials, or have auxiliary materials you’d like me to post, just send an email. I know a few people sent in their posters to me, I’m putting those up this week.

I also took a few photos at the BEA10 poster session.

Upcoming EduNLP Conferences and Workshops

Recent Advances in Natural Language Processing (RANLP) http://lml.bas.bg/ranlp2015/start.php (abstract deadline: April 27; paper deadline: May 04; dates: September 5-11, location: Hissar, Bulgaria)
6th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT) http://www.slpat.org/slpat2015/ (deadline: June 08; date: September 11; location: Dresden, Germany )
Conference of the European Association for Computer-Assisted Language Learning (EUROCALL) http://www.eurocall2015.it/en/sistemacongressi/eurocall-2015/website/home/ (abstract deadline: February 15; submission deadline: June 30; dates: August 26-29; location: Padova, Italy)

Recent EduNLP Publications

EMNLP 2015

A Graph-based Readability Assessment Method using Word Coupling. Zhiwei Jiang, Gang Sun, Qing Gu and Daoxu Chen
Flexible Domain Adaptation for Automated Essay Scoring Using Correlated Linear Regression. Peter Phandi, Kian Ming A. Chai and Hwee Tou Ng
Spelling Correction of User Search Queries through Statistical Machine Translation. Saša Hasan, Carmen Heger and Saab Mansour
Human Evaluation of Grammatical Error Correction Systems. Roman Grundkiewicz, Marcin Junczys-Dowmunt and Edward Gillian
Noise or additional information? Using crowdsource annotation item agreement for natural language tasks. Emily Jamison and Iryna Gurevych
Discourse Element Identification in Student Essays based on Global and Local Sentence Chains. Wei Song
Summarizing Student Responses to Reflection Prompts. Wencan Luo and Diane Litman
Combining Geometric, Textual and Visual Features for Generating Prepositions in Image Descriptions. Arnau Ramisa, Josiah Wang, Ying Lu, Emmanuel Dellandrea, Francesc Moreno-Noguer and Robert Gaizauskas

TACL papers at EMNLP

Problems in Current Text Simplification Research: New Data Can Help. Wei Xu, Chris Callison-Burch, Courtney Napoles

Workshop on Vision and Language at EMNLP 2015

The full list of accepted papers is online.

Relevant papers:

Do Distributed Semantic Models Dream of Electric Sheep? Visualizing Word Representations through Image Synthesis. Angeliki Lazaridou, Dat Tien Nguyen and Marco Baroni
Describing Spatial Relationships between Objects in Images in English and French. Anja Belz, Adrian Muscat, Maxime Aberton and Sami Benjelloun
Towards Reliable Automatic Multimodal Content Analysis. Olli Philippe Lautenbacher, Liisa Tiittula, Maija Hirvonen, Jorma Laaksonen and Mikko Kurimo
Visual Classifier Prediction by Distributional Semantic Embedding of Text Descriptions. Mohamed Elhoseiny and Ahmed Elgammal

CogACLL 2015 (Cognitive Aspects of Computational Language Learning)

The full list of accepted papers is online.

Relevant papers:

A Computational Study of Cross-situational Lexical Learning of Brazilian Portuguese. Pablo Faria
An Agent-based Model of a Historical Word Order Change. Jelke Bloem, Arjen Versloot and Fred Weerman
Evaluating Models of Computation and Storage in Human Sentence Processing. Thang Luong, Timothy O’Donnell and Noah Goodman
Perceptual, Conceptual, and Frequency Effects on Error Patterns in English Color Term Acquisition. Barend Beekhuizen and Suzanne Stevenson
Reading Metrics for Estimating Task Efficiency with SMT Output. Sigrid Klerke, Sheila Castilho and Anders Søgaard
Using Reading Behavior to Predict Grammatical Functions. Maria Barrett
Which Distributional Cues Help the Most? Unsupervised Contexts Selection for Lexical Category Acquisition. Giovanni Cassani, Robert Grimm, Walter Daelemans and Steven Gillis

CoNLL 2015

Contrastive Analysis with Predictive Power: Typology Driven Estimation of Grammatical Error Distributions in ESL. Yevgeni Berzak, Roi Reichart and Boris Katz

ACL 2015 System Demonstrations

A System Demonstration of a Framework for Computer Assisted Pronunciation Training. Renlong Ai and Feiyu Xu
LEXenstein: A Framework for Lexical Simplification. Gustavo Paetzold and Lucia Specia
WriteAhead: Mining Grammar Patterns in Corpora for Assisted Writing. Tzu-Hsi Yen, Jian-Cheng Wu, Jim Chang, Joanne Boisson and Jason Chang

ACL SIGHAN Workshop

The proceedings are available online.

Relevant Papers:

Sentence selection for automatic scoring of Mandarin proficiency. Jiahong Yuan; Xiaoying Xu; Wei Lai; Weiping Ye; Xinru Zhao; Mark Liberman
Introduction to SIGHAN 2015 Bake-off for Chinese Spelling Check. Yuen-Hsien Tseng; Lung-Hao Lee; Li-Ping Chang; Hsin-Hsi Chen
HANSpeller++: A Unified Framework for Chinese Spelling Correction. Shuiyuan Zhang; Jinhua Xiong; Jianpeng Hou; Qiao Zhang; Xueqi Cheng
Word Vector/Conditional Random Field-based Chinese Spelling Error Detection for SIGHAN-2015 Evaluation. Yih-Ru Wang; Yuan-Fu Liao
Introduction to a Proofreading Tool for Chinese Spelling Check Task of SIGHAN-8. Tao-Hsing Chang; Hsueh-Chih Chen; Cheng-Han Yang
Chinese Spelling Check System Based on N-gram Model. Weijian Xie; Peijie Huang; Xinrui Zhang; Kaiduo Hong; Qiang Huang; Bingzhou Chen; Lei Huang
NTOU Chinese Spelling Check System in Sighan-8 Bake-off. Wei-Cheng Chu; Chuan-Jie Lin

ACL Workshop on Arabic Natural Language Processing

The program and procedings are available online.

Relevant papers:

A Conventional Orthography for Algerian Arabic. Houda Saadane and Nizar Habash
Best Practices for Crowdsourcing Dialectal Arabic Speech Transcription. Samantha Wray, Hamdy Mubarak and Ahmed Ali
Multi-Reference Evaluation for Dialectal Speech Recognition System: A Study for Egyptian ASR. Ahmed Ali, Walid Magdy and Steve Renals
The Second QALB Shared Task on Automatic Text Correction for Arabic. Alla Rozovskaya, Houda Bouamor, Nizar Habash, Wajdi Zaghouani, Ossama Obeid and Behrang Mohit
QCRI@QALB-2015 Shared Task: Correction of Arabic Text for Native and Non-Native Speakers’ Errors. Hamdy Mubarak, Kareem Darwish and Ahmed Abdelali
Arib@QALB-2015 Shared Task: A Hybrid Cascade Model for Arabic Spelling Error Detection and Correction. Nouf AlShenaifi, Rehab AlNefie, Maha Al-Yahya and Hend Al-Khalifa
SAHSOH@QALB-2015 Shared Task: A Rule-Based Correction Method of Common Arabic Native and Non-Native Speakers’ Errors. Wajdi Zaghouani, Taha Zerrouki and Amar Balla
GWU-HASP-2015@QALB‐2015 Shared Task: Priming Spelling Candidates with Probability. Mohammed Attia, Mohamed Al-Badrashiny and Mona Diab
QCMUQ@QALB-2015 Shared Task: Combining Character level MT and Error-tolerant Finite-State Recognition for Arabic Spelling Correction. Houda Bouamor, Hassan Sajjad, Nadir Durrani and Kemal Oflazer
UMMU@QALB-2015 Shared Task: Character and Word level SMT pipeline for Automatic Error Correction of Arabic Text. Fethi Bougares and Houda Bouamor
TECHLIMED@QALB-Shared Task 2015: a hybrid Arabic Error Correction System. Djamel MOSTEFA, Jaber ABUALASAL, Omar ASBAYOU, Mahmoud GZAWI and Ramzi Abbès
CUFE@QALB-2015 Shared Task: Arabic Error Correction System. Michael Nawar Ibrahim and Moheb M. Ragheb

ACL Workshop on Building and Using Comparable Corpora

The program and proceedings are online.

Application of a Corpus to Identify Gaps between English Learners and Native Speakers. Katsunori Kotani and Takehiko Yoshimi http://aclweb.org/anthology/W/W15/W15-3406.pdf

NLP-TEA-2 & CGED

The 2nd Workshop on Natural Language Processing Techniques for Educational Applications with a Shared Task on Chinese Grammatical Error Diagnosis. The program and proceedings are online.

Collocation Assistant for Learners of Japanese as a Second Language. Lis Pereira, and Yuji Matsumoto
Semi-automatic Generation of Multiple-Choice Tests from Mentions of Semantic Relations. Renlong Ai, Sebastian Krause, Walter Kasper, Feiyu Xum, and Hans Uszkoreit
Interactive Second Language Learning from News Websites. Tao Chen, Naijia Zheng, Yue Zhao, Muthu Kumar Chandrasekaran, and Min-Yen Kan
Bilingual Keyword Extraction and its Educational Application. Chung-Chi Huang, Mei-Hua Chen, and Ping-Che Yang
Annotating Entailment Relations for Shortanswer Questions. Simon Ostermann, Andrea Horbach, and Manfred Pinkal
An Automated Scoring Tool for Korean Supply-type Items Based on Semi-Supervised Learning. Minah Cheon, Hyeong-Won Seo, Jae-Hoon Kim, Eun-Hee Noh, Kyung-Hee Sung, and EunYong Lim
A System for Generating Multiple Choice Questions: With a Novel Approach for Sentence Selection. Mukta Majumder, and Sujan Kumar Saha
The “News Web Easy’’ news service as a resource for teaching and learning Japanese: An assessment of the comprehension difficulty of Japanese sentence-end expressions. Hideki Tanaka, Tadashi Kumano, and Isao Goto
Grammatical Error Correction Considering Multi-word Expressions. Tomoya Mizumoto, Masato Mita, and Yuji Matsumoto
Salinlahi III: An Intelligent Tutoring System for Filipino Heritage Language Learners. Ralph Vincent Regalado, Michael Louie Boñon, Nadine Chua, Rene Rose Piñera, and Shannen Rose Dela Cruz
Using Finite State Transducers for Helping Foreign Language Learning. Hasan Kaya and Gülşen Eryiğit
Condition Random Fields-based Grammatical Error Detection for Chinese as Second Language. Jui-Feng Yeh, Chan Kun Yeh, Kai-Hsiang Yu, Ya-Ting Li, and Wan-Ling Tsai
Overview of the NLP-TEA 2015 Shared Task for Chinese Grammatical Error Diagnosis. Lung-Hao Lee, Liang-Chih Yu, and Li-Ping Chang
Chinese Grammatical Error Diagnosis by Conditional Random Fields. Shih-Hung Wu, Po-Lin Chen, Liang-Pu Chen, Ping-Che Yang, and Ren-Dar Yang
NTOU Chinese Grammar Checker for CGED Shared Task. Chuan-Jie Lin, and Shao-Heng Chen
Chinese Grammatical Error Diagnosis Using Ensemble Learning. Yang Xiang, Xiaolong Wang, Wenying Han, and qinghua Hong
Improving Chinese Grammatical Error Correction with Corpus Augmentation and Hierarchical Phrase-based Statistical Machine Translation. Yinchen Zhao, Mamoru Komachi and Hiroshi Ishikawa
Chinese Grammatical Error Diagnosis System Based on Hybrid Model. Xiupeng Wu, Peijie Huang, Jundong Wang, Qingwen Guo, Yuhong Xu and Chuping Chen

AIED 2015

Contextual Recommendation of Educational Contents. Nidhi Saraswat, Hiranmay Ghosh, Mohit Agrawal and Uma Narayanan
Leveraging Multiple Views of Text for Automatic Question Generation. Karen Mazidi and Rodney D. Nielsen
Machine Learning for Holistic Evaluation of Scientific Essays. Simon Hughes, Dylan Blaum, Mary Anne Britt, Peter Hastings and Patricia Wallace
Evaluating Human and Automated Generation of Distractors for Diagnostic Multiple-Choice Cloze Questions to Assess Children’s Reading Comprehension. Yi-Ting Huang and Jack Mostow
Distractor quality evaluation in Multiple Choice Questions. Van-Minh Pho, Anne-Laure Ligozat and Brigitte Grau
A framework for automated generation of questions across formal domains. Rahul Singhal, Martin Henz and Shubham Goyal

EDM 2015

You are your words: Modeling Students’ Vocabulary Knowledge with Natural Language Processing Techniques. Laura K.Allen and Danielle McNamara
Automatic Grading of Short Answers for MOOC via Semi-supervised Document Clustering. Shumin Jing
Data-Driven Analyses of Electronic Text Books. Ahcène Boubekki, Ulf Kröhne, Frank Goldhammer, Waltraud Schreiber and Ulf Brefeld.
Integrating Process and Product Data: The Case of an Automated Writing Evaluation System. Chaitanya Ramineni, Tiago Caliço and Chen Li
Your Model Is Predictive — but Is It Useful? Theoretical and Empirical Considerations of a New Paradigm for Adaptive Tutoring Evaluation. José P. González-Brenes and Yun Huang
Tutorial on Using Natural Language Processing Tools in Educational Data Mining by Scott A.Crossley.

RANLP 2015

Norwegian Native Language Identification. Shervin Malmasi and Irina Temnikova
DanProof: Pedagogical Spell and Grammar Checking for Danish. Eckhard Bick
Automatic Text Simplification for Spanish: Comparative Evaluation of Various Simplification Strategies. Sanja Štajner, Iacer Calixto and Horacio Saggion
A new approach to automated text readability classification based on concept indexing with integrated part-of-speech n-gram features. Abigail Razon and John Barnden
Feature Extraction for Native Language Identification Using Language Modeling. Vincent Kríž, Martin Holub and Pavel Pecina
Classification of Lexical Collocation Errors in the Writings of Learners of Spanish. Sara Rodríguez-Fernández, Roberto Carlini and Leo Wanner
Automatic Acquisition of Artifact Nouns in French. Xiaoqin Hu and Pierre-André Buvet
Readability Assessment of Translated Texts. Alina Maria Ciobanu, Liviu P. Dinu and Flaviu Pepelea
Translating from Original to Simplified Sentences using Moses: When does it Actually Work?. Sanja Štajner and Horacio Saggion

SLaTE 2015

The program for both days and the proceedings are online.

Eurocall Conference 2015

The full program is available online.

Other Venues

Non-native text analysis with Syntactic Diff, a general comparative text mining framework. Sean Alexander Massung, MS Thesis, UIUC, 2015
Automated Analysis of Text in Graduate School Recommendations. Michael Heilman, F. Jay Breyer, Frank Williams, David Klieger and Michael Flor. ETS Research Report Series, 2015
Using Natural Language Processing Technology to Analyze Teachers’ Written Feedback on Chinese Students’ English Essays. Ming Liu. International Journal of Learning, Teaching and Educational Research, 2015
Automatic Coding of Short Text Responses via Clustering in Educational Assessment. Fabian Zehner, Christine Sälzer and Frank Goldhammer. Educational and Psychological Measurement Journal, 2015
Comparing the Linguistic Complexity in Receptive and Productive Modes. Jessie Saraza Barrot. GEMA Online Journal of Language Studies
Construct validity in TOEFL iBT speaking tasks: Insights from Natural Language Processing. Kristopher Kyle, Scott A.Crossley and Danielle McNamara Language Testing journal, 2015
Leveling L2 Texts Through Readability: Combining Multilevel Linguistic Features with the CEFR. Yao-Ting Sung, Wei-Chun lin, Scott Benjamin Dyson, Kuo-En Chang and Yu-Chia Chen. The Modern Language Journal, 99(2), 371-391, 2015.
Syntactic complexity in college-level English writing: Differences among writers with diverse L1 backgrounds. Xiaofei Lu and Haiyang Ai. Journal of Second Language Writing, 2015.
Building a Lexicon of Formulaic Language for Language Learners. Brooke et.al, 11th Multi Word Expressions Workshop, 2015
Automatically Detecting Syntactic Errors in Sentences Written by Learners of Chinese as a Foreign Language. Tao-Hsing Chang, Yao-Ting Sung and Jia-Fei Hong. Computational Linguistics and Chinese Language Processing, 2015
InkWell: A Creative Writer’s Creative Assistant. Richard P. Gabriel, Jeffrey Nichols, Jilin Chen. In Proceedings of the 2015 ACM SIGCHI Conference on Creativity and Cognition

Share on

Twitter Facebook LinkedIn

BEA Team

BEA Information Form

BEA10 Announcements

Upcoming EduNLP Conferences and Workshops

Recent EduNLP Publications

EMNLP 2015

TACL papers at EMNLP

Workshop on Vision and Language at EMNLP 2015

CogACLL 2015 (Cognitive Aspects of Computational Language Learning)

CoNLL 2015

ACL 2015 System Demonstrations

ACL SIGHAN Workshop

ACL Workshop on Arabic Natural Language Processing

ACL Workshop on Building and Using Comparable Corpora

NLP-TEA-2 & CGED

AIED 2015

EDM 2015

RANLP 2015

SLaTE 2015

Eurocall Conference 2015

Other Venues

Share on