Members of the Cyberling community might be interested in the Free Science Blog, for discussion of open access publishing.
Interview: New blog for experimental statistics in corpus linguistics
An interview with Sean Wallis, author of http://corplingstats.wordpress.com/:
What led you to set up the blog?
The blog comes from several sources. My research background is in cognitive science and AI, and in particular machine learning applied to scientific research, and statistics is a key component of that. I have been involved in regular debates about the role of statistical evidence in corpus linguistics over the years, so (for example) you will find some of the same experimental design themes about choice in our 2002 book, http://www.ucl.ac.uk/english-usage/projects/ice-gb/book.htm. I am not a linguist "by trade" but a methodologist, so I can only work by collaborating with and learning from others.
Liberman on Open Access and the three-legged stool
This post over at Language Log is highly recommended. A quick excerpt:
“reproducible research” [...] requires three things: (1) the data sets that serve as input; (2) the programs needed to run the experiment; and (3) a comprehensible account of what the experiment does, why it matters, and what the results are.
This is from Mark Liberman's abstract for his talk at the Berlin 9 Open Access Conference taking place in Maryland (not Berlin).
Conference on Science and the Internet 2012
From the call for papers:
Online media have brought about numerous changes in scholarly practices, including, but not limited to gathering data, finding relevant literature, making research and results accessible, organising collaboration, communicating with colleagues and students as well as creating fruitful learning environments.
Claire Bowern has started a discussion on her blog, Anggarrgoon, about access to aggregated lexical data: how to protect the rights of the various stake holders while encouraging as much sharing as possible. I enjoyed her tongue-in-cheek suggestion that linguist-contributors should, in game-theoretic fashion, get access to data in proportion to the data they share.
We have had serious issues with spam comments on this blog, and so the spam filters are tuned to be fairly aggressive. If you find that your (legitimate) comments are being rejected as spam, please contact me and I'll try to make sure you can be heard!
From the workshop webpage:
Tomorrow Kimberly Christen (WSU) will give a talk at the University of Washington on the Mukurtu Indigenous Archive Tool and repatriation of indigenous knowledge in digital form.
Abstract, from the UW Simpson Center for the Humanities calendar:
FLaReNet, Language Grid and META-SHARE are co-hosting the Workshop on Language Resources, Technology and Services in the Sharing Paradigm at IJCNLP 2011. From the call for papers:
The Workshop aims at addressing (some of the) technological, market and policy challenges posed by the “sharing and openness paradigm”, the major role that language resources can play and the consequences of this paradigm on language resources themselves.
The Open Knowledge Foundation (cf. the next post on OKCon) has a working group on Linguistic data, known as Open Linguistics. That website has links to various resources, including linguistics-related posts on the Open Knowledge Foundation blog.