What's New!
Background and Motivation
A long-standing problem in Natural Language Processing has been a lack
of large-scale knowledge for computers. The emergence of the Web and
the rapid increase of information on the Web brought us to what could
be called the "information explosion era," and drastically changed the
environment of NLP. The Web is not only a marvelous target for NLP,
but also a valuable resource from which knowledge could be extracted
for computers. Motivated by the desire to have a very first
opportunity to discuss early approaches to those issues and to share
the state-of-the-art technologies at that time, the first
International Workshop on NLP Challenges in the Information Explosion
Era (NLPIX 2008) was successfully held in conjunction with WWW 2008 in
Beijing.
Since the discussion of the first workshop, research and development
activities on large-scale text processing and large-scale knowledge
acquisition become much more popular these days. The large-scale NLP
naturally requires large-scale infrastructures, such as
neatly-prepared huge corpora, robust morpho-syntactic tools, and
high-performance computing environments. However, such infrastructures
can not be prepared by individual researchers nor research groups
alone in general, although of course we know some exceptions. Based on
this motivation, towards much larger-scale NLP, activities aiming at
constructing and sharing the infrastructures have continued. Although
we have found many publications presented in recent
conferences/workshops including the above mentioned workshop, we still
do not have opportunities to compare latest approaches, share analysis
on advantages/disadvantages, and discuss possible directions towards
further improvement and innovation.
Furthermore, beyond the success of large-scale NLP and knowledge
acquisition, we are starting to face a new problem: how to manage and
use the automatically acquired knowledge (AAK in short). We are still
not confident that those large-scale AAK can actually solve real world
problems. How to incorporate the AAK into existing NLP frameworks and
how to manage them are yet unsolved issues. One approach could be some
bootstrapping of extracting knowledge and enhancing NLP based on the
knowledge. The representation and standardization of AAK are also
emerging important issues. One of the most highly demanded
applications for AAK-based NLP is a semantic search to cope with the
information explosion on the Web. Though our daily life heavily
depends on the Web information, our diversified needs have not been
sufficiently satisfied by the existing search engines. AAK-based NLP
can be a key technology to realize a new-generation semantic search,
which incorporates enhanced information access, analysis and
organization.
Theme and Topics
The aim of the second workshop of the series of International Workshop on NLP Challenges in the
Information Explosion Era (NLPIX) is to bring researchers and practitioners together in order to
discuss large-scale and sharable NLP infrastructures, and furthermore to discuss emerging NEW
issues beyond them. Possible topics of the paper submissions include, but are not limited to:
- Construction of large corpora (crawling, preprocessing)
- Sharable large resources (e.g., Google N-gram statistics, etc.)
- Standard for a linguistic annotation framework
- Knowledge acquisition from very large corpora
- Bootstrapping approach for knowledge acquisition
- Large scale text mining based on shallow/deep NLP
- Managing and sharing acquired knowledge
- Exploitation of acquired knowledge for real applications
- Knowledge-based information access, analysis, and organization
- High performance/parallel computing environment for NLP
- Cloud computing for NLP
In particular, we solicit the papers that aim at fulfilling a NOVEL type of
needs in Web access and that can provide a new insight into future directions
of Web access research.
Invited Talks
- Hang Li (Microsoft Research Asia)
Query Understanding in Web Search - by Large Scale Log Data Mining and Statistical Learning [Slides]
- Hoifung Poon (University of Washington)
Statistical Relational Learning for Knowledge Extraction from the Web [Slides]
Accepted Papers
- Even Unassociated Features Can Improve Lexical Distributional Similarity
Kazuhide Yamamoto and Takeshi Asakura
- Summarizing Search Results using PLSI
Jun Harashima and Sadao Kurohashi
- A look inside the distributionally similar terms
Kow Kuroda, Jun'ichi Kazama and Kentaro Torisawa
- Utilizing Citations of Foreign Words in Corpus-Based Dictionary Generation
Reinhard Rapp and Michael Zock
- Large Corpus-based Semantic Feature Extraction for Pronoun Coreference
Shasha Liao and Ralph Grishman
- Mining coreference relations between formulas and texts usingWikipedia
Minh Nghiem Quoc, Keisuke Yokoi and Akiko Aizawa
- Exploiting Term Importance Categories and Dependency Relations for Natural Language Search
Keiji Shinzato and Sadao Kurohashi
- Automatic Classification of Semantic Relations between Facts and Opinions
Koji Murakami, Eric Nichols, Kentaro Inui, Junta Mizuno, Hayato Goto, Megumi Ohki, Suguru Matsuyoshi and Yuji Matsumoto
- Adverse-Effect Relations Extraction from Massive Clinical Records
Yasuhide Miura, Eiji Aramaki, Tomoko Ohkuma, Masatsugu Tonoike, Daigo Sugihara, Hiroshi Masuichi and Kazuhiko Ohe
Workshop Schedule / Important Dates
- Submission deadline: May 30, 2010 by 23:59 PDT (GMT -7 hours)
- Notification of acceptance: June 30, 2010
- Camera ready due: July 10, 2010
- Workshop date: August 28, 2010
Registration
Workshop Organizers
- Sadao Kurohashi, Kyoto University, Japan
- Takehito Utsuro, University of Tsukuba, Japan
Program Committee
- Pushpak Bhattacharyya, IIT, India
- Thorsten Brants, Google, USA
- Eric Villemonte de la Clergerie, INRIA, France
- Atsushi Fujii, Tokyo Institute of Technology, Japan
- Julio Gonzalo, UNED, Spain
- Kentaro Inui, Tohoku University, Japan
- Noriko Kando, NII, Japan
- Daisuke Kawahara, NICT, Japan
- Jun'ichi Kazama, NICT, Japan
- Adam Kilgarriff, Lexical Computing Ltd., UK
- Gary Geunbae Lee, POSTECH, Korea
- Hang Li, Microsoft, China
- Dekang Lin, Google, USA
- Tatsunori Mori, Yokohama National University, Japan
- Satoshi Sekine, New York University, USA
- Kenjiro Taura, University of Tokyo, Japan
- Kentaro Torisawa, NICT, Japan
- Marco Turchi, European Commission - Joint Research Centre, Italy
- Yunqing Xia, Tsinghua University, China
Previous NLPIX Workshop
NLP Challenges in the Information Explosion Era (NLPIX 2008), at WWW2008 in Beijing, China.
Contact Us