Emily M. Bender

Jun 202012
 

An interview with Sean Wallis, author of http://corplingstats.wordpress.com/:

What led you to set up the blog?

The blog comes from several sources. My research background is in cognitive science and AI, and in particular machine learning applied to scientific research, and statistics is a key component of that. I have been involved in regular debates about the role of statistical evidence in corpus linguistics over the years, so (for example) you will find some of the same experimental design themes about choice in our 2002 book, http://www.ucl.ac.uk/english-usage/projects/ice-gb/book.htm. I am not a linguist "by trade" but a methodologist, so I can only work by collaborating with and learning from others.

read more

Nov 102011
 

This post over at Language Log is highly recommended. A quick excerpt:


“reproducible research” [...] requires three things: (1) the data sets that serve as input; (2) the programs needed to run the experiment; and (3) a comprehensible account of what the experiment does, why it matters, and what the results are.

This is from Mark Liberman's abstract for his talk at the Berlin 9 Open Access Conference taking place in Maryland (not Berlin).

Oct 142011
 

From the call for papers:

Online media have brought about numerous changes in scholarly practices, including, but not limited to gathering data, finding relevant literature, making research and results accessible, organising collaboration, communicating with colleagues and students as well as creating fruitful learning environments.

read more

May 182011
 

Claire Bowern has started a discussion on her blog, Anggarrgoon, about access to aggregated lexical data: how to protect the rights of the various stake holders while encouraging as much sharing as possible. I enjoyed her tongue-in-cheek suggestion that linguist-contributors should, in game-theoretic fashion, get access to data in proportion to the data they share.

read more

Apr 272011
 

We have had serious issues with spam comments on this blog, and so the spam filters are tuned to be fairly aggressive. If you find that your (legitimate) comments are being rejected as spam, please contact me and I'll try to make sure you can be heard!

Mar 112011
 

FLaReNet, Language Grid and META-SHARE are co-hosting the Workshop on Language Resources, Technology and Services in the Sharing Paradigm at IJCNLP 2011. From the call for papers:

The Workshop aims at addressing (some of the) technological, market and policy challenges posed by the “sharing and openness paradigm”, the major role that language resources can play and the consequences of this paradigm on language resources themselves.