[Sigwac] Final Call for Papers: 4th Web as Corpus Workshop
(LREC 2008, Marrakech)
Pavel Pecina
pecina at ufal.mff.cuni.cz
Wed Feb 27 10:20:12 CET 2008
Hi Stefan, is there any chance for a deadline extension?
Couple of days would be great.
Thank you,
-Pavel
Stefan Evert píše v Pá 22. 02. 2008 v 17:48 +0100:
> Dear friends,
>
> please forward this call to interested colleagues and students.
> Paper submission has been rather slow so far, and we would like to
> have a sufficient number of submissions so we can be selective and
> put together and interesting and exciting workshop programme.
>
> Best wishes,
> Stefan
>
> ===== Second Call for Papers =====
>
> The 4th Web as Corpus workshop: Can we beat Google?
>
> Marrakech, Morocco (post-LREC workshop)
> 1 June 2008
>
> http://webascorpus.sf.net/WAC4/
>
> ==================================
>
> Submission deadline: 29 February 2008
>
> PAPER SUBMISSION: http://www.easychair.org/conferences/?conf=wac4
>
> ==================================
>
> DESCRIPTION
>
> Commercial Web search engines offer fast search on huge amounts of
> text, combined with increasingly clever ranking and data analysis
> algorithms, but their content-centric services do not cater to the
> needs of the computational linguistics and NLP communities. The
> leading theme of this workshop, the fourth in a row of highly
> successful Web as Corpus meetings, is to find out how to combine the
> power and scalability of modern search engine technology with
> sophisticated linguistic annotation and query processing.
>
> We invite papers on various topics concerning the use of Web
> resources for corpus research and NLP applications, including (but
> not limited to) the following:
>
> * linguistic Web crawler technology and Web corpus collection
> projects
> * applications of Web-derived corpora and other kinds of Web data
> * how far does the "easy way" get you? (using search engines, or
> Google's n-gram lists; we are particularly interested in a critical
> discussion of the usefulness and limitations of such approaches)
> * methods and tools for "cleaning" Web pages to turn them into a
> corpus (contributors to this topic will be encouraged to participate
> in the second CLEANEVAL competition to be held in 2009)
> * automatic linguistic annotation of Web data: tokenisation, POS
> tagging, lemmatisation, semantic tagging, etc. (established tools
> often perform very poorly on Web data)
> * search engine architectures for linguists: bringing linguistics
> to commercial search engines, or high-performance search technology
> to linguistics?
> * search engine-related topics such as result ranking (e.g. how
> to identify "typical" uses rather than returning 50 very similar
> matches on the first page)
> * duplicate detection, interactive query refinement, etc.
> * reviews and clever uses of search engine APIs (Google, Yahoo,
> Altavista, and in particular Microsoft's current generous LiveSearch
> API)
>
> This workshop is endorsed by the Special Interest Group on the Web as
> Corpus (SIGWAC) of the Association for Computational Linguistics (ACL).
>
> SUBMISSION INFORMATION
>
> Authors are invited to submit full papers on original, unpublished
> work in the topic area of this workshop. Submissions should follow
> the format of LREC proceedings and should not exceed eight (8) pages,
> including references. We strongly recommend the use of LREC LaTeX or
> Microsoft Word style files tailored for this year's conference.
>
> Submissions are managed via EasyChair.org. In order to submit a
> paper, go to:
>
> http://www.easychair.org/conferences/?conf=wac4
>
> and login (or register an account with EasyChair if you don't have
> one yet). After logging in, click 'New Submission' and fill in the
> standard fields.
>
> PROGRAMME COMMITTEE
>
> Silvia Bernardini, U of Bologna, Italy
> Massimiliano Ciaramita, Yahoo! Research Barcelona, Spain
> Jesse de Does, INL, Netherlands
> Katrien Depuydt, INL, Netherlands
> Stefan Evert, U of Osnabrück, Germany
> Cédrick Fairon, UCLouvain, Belgium
> William Fletcher, U.S. Naval Academy, USA
> Gregory Grefenstette, Commissariat à l'Énergie Atomique, France
> Péter Halácsy, Budapest U of Technology and Economics, Hungary
> Katja Hofmann, U of Amsterdam, Netherlands
> Adam Kilgarriff, Lexical Computing Ltd, UK
> Igor Leturia, Elhuyar Fundazioa, Basque Country, Spain
> Phil Resnik, U of Maryland, College Park, USA
> Kevin Scannell, Saint Louis U, USA
> Gilles-Maurice de Schryver, U Gent, Belgium
> Klaus Schulz, LMU München, Germany
> Serge Sharoff, U of Leeds, UK
> Eros Zanchetta, U of Bologna, Italy
>
> ORGANISING COMMITTEE
>
> Stefan Evert, University of Osnabrück
> Adam Kilgarriff, Lexical Computing
> Serge Sharoff, University of Leeds
>
> _______________________________________________
> Sigwac mailing list
> Sigwac at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/sigwac
More information about the Sigwac
mailing list