Other Workshops and Events (1997)


up

bib (full) Fifth Workshop on Very Large Corpora

pdf bib
Fifth Workshop on Very Large Corpora

pdf bib
Summary of Invited Speech
Mitch Marcus

pdf bib
Commercial Implementation of Text Recognition Tools for VLC
John Rausch

pdf bib
Commercial Impact of VLC Research
Howard Turtle

pdf bib
A Statistics-Based Chinese Parser
Qiang Zhou

pdf bib
Probabilistic Parsing of Unrestricted English Text, With a Highly-Detailed Grammar
Ezra Black | Stephen Eubank | Hideki Kashioka | David Magerman

pdf bib
Grammar Acquisition Based on Clustering Analysis and Its Application to Statistical Parsing
Thanaruk Theeramunkong | Manabu Okumura

pdf bib
Reestimation and Best-First Parsing Algorithm for Probabilistic Dependency Grammars
Seungmi Lee | Key-Sun Choi

pdf bib
Domain-Specific Semantic Class Disambiguation Using WordNet
Li Shiuau Peh

pdf bib
Corpus Based PP Attachment Ambiguity Resolution with a Semantic Dictionary
Jiri Stetina | Makoto Nagao

pdf bib
Corpus Based Statistical Generalization Tree in Rule Optimization
Joyce Yue Chai | Alan W. Biermann

pdf bib
Clustering Co-occurrence Graph based on Transitivity
Kumiko Tanaka-Ishii

pdf bib
Knowledge Acquisition: Classification of Terms in a Thesaurus from a Corpus
Jean-David Sta

pdf bib
Data Reliability and Its Effects on Automatic Abstracting
Tadashi Nomoto | Yuji Matsumoto

pdf bib
Automatic Identification of Zero Pronouns and their Antecedents within Aligned Sentence Pairs
Hiromi Nakaiwa

pdf bib
Statistical Acquisition of Terminology Dictionary
Xuan-jing Huang | Li-de Wu | Wen-xin Wang

pdf bib
Acquiring German Prepositional Subcategorization Frames from Corpora
Erika F. de Lima

pdf bib
A Natural Language Correction Model for Continuous Speech Recognition
Tomek Strzalkowski | Ronald Brandow

pdf bib
The Effects of Corpus Size and Homogeneity on Language Model Quality
Tony G. Rose

pdf bib
Finding Terminology Translations from Non-parallel Corpora
Pascale Fung

pdf bib
A Self-Organizing Japanese Word Segmenter using Heuristic Word Identification and Re-estimation
Masaaki Nagata

pdf bib
Collocation Lattices and Maximum Entropy Models
Andrei Mikheev

pdf bib
Using Word Frequency Lists to Measure Corpus Homogeneity and Similarity between Corpora
Adam Kilgarriff

pdf bib
Maximum Entropy Model Learning of Subcategorization Preference
Takehito Utsuro | Takashi Miyata

pdf bib
Analysis of Unknown Lexical Items using Morphological and Syntactic Information with the TIMIT Corpus
Scott M. Thede | Mary Harper

pdf bib
A Local Grammar-based Approach to Recognizing of Proper Names in Korean Texts
Jee-Sun Nam | Key-Sun Choi

pdf bib
A Statistical Approach to Thai Morphological Analyzer
Asanee Kawtrakul | Chalathip Thumkanon

pdf bib
Probabilistic Word Classification Based on Context-Sensitive Binary Tree Method
Jun Gao | XiXian Chen


up

bib (full) Tagging Text with Lexical Semantics: Why, What, and How?

pdf bib
Tagging Text with Lexical Semantics: Why, What, and How?

pdf bib
Getting Serious about Word Sense Disambiguation
Hwee Tou Ng

pdf bib
Experience in WordNet Sense Tagging in the Wall Street Journal
Janyce Wiebe | Julie Maples | Lei Duan | Rebecca Bruce

pdf bib
Desiderata for Tagging with WordNet Synsets or MCCA Categories
Kenneth C. Litkowski

pdf bib
A Frame-Semantic Approach to Semantic Annotation
John B. Lowe

pdf bib
A Lexicon for Underspecified Semantic Tagging
Paul Buitelaar

pdf bib
Analysis of a Hand-Tagging Task
Christiane Fellbaum | Joachim Grabowski | Shari Land

pdf bib
Measuring Semantic Entropy
I. Dan Melamed

pdf bib
Sense Tagging: Semantic Tagging with a Lexicon
Yorick Wilks | Mark Stevenson

pdf bib
Selectional Preference and Sense Disambiguation
Philip Resnik

pdf bib
Investigating Complementary Methods for Verb Sense Pruning
Hongyan Jing | Vasileios Hatzivassiloglou | Rebecca Passonneau | Kathleen McKeown

pdf bib
Towards a Bootstrapping Framework for Corpus Semantic Tagging
Roberto Basili | Michelangelo Della Rocca | Maria Teresa Pazienza

pdf bib
Sense Tagging in Action Combining Different Tests with Additive Weighangs
Andrew Harley | Dominic Glennon

pdf bib
A Perspective on Word Sense Disambiguation Methods and Their Evaluation
Philip Resnik

pdf bib
Writing Annotation Instructions
Janyce Wiebe

pdf bib
Combining Knowledge Sources for Automatic Semantic Tagging
Douglas Jones | Boyan Onyshkevych

pdf bib
Semantic Tagging and NLP Applications
Thierry Declerck | Judith Klein

pdf bib
Evaluating Automatic Semantic Taggers
Philip Resnik

pdf bib
Constructing Semantic Tagsets
Alain Polguere

pdf bib
Structured Lexicons and Semantic Tagging
Bonnie J. Dorr | Mari Broman Olsen


up

bib (full) Second Conference on Empirical Methods in Natural Language Processing

pdf bib
Second Conference on Empirical Methods in Natural Language Processing

pdf bib
A Linear Observed Time Statistical Parser Based on Maximum Entropy Models
Adwait Ratnaparkhi

pdf bib
Global Thresholding and Multiple-Pass Parsing
Joshua Goodman

pdf bib
An Efficient Distribution of Labor in a Two Stage Robust Interpretation Process
Carolyn Penstein Rose | Alon Lavie

pdf bib
Text Segmentation Using Exponential Models
Doug Beeferman | Adam Berger | John Lafferty

pdf bib
Detecting Subject Boundaries Within Text: A Language Independent Statistical Approach
Korin Richmond | Andrew Smith | Einat Amitay

pdf bib
Mistake-Driven Learning in Text Categorization
Ido Dagan | Yael Karov | Dan Roth

pdf bib
Tagging Grammatical Functions
Thorsten Brants | Wojciech Skut | Brigitte Krenn

pdf bib
On aligning trees
Jo Calder

pdf bib
Aggregate and mixed-order Markov models for statistical language processing
Lawrence Saul | Fernando Pereira

pdf bib
Assigning Grammatical Relations with a Back-off Model
Erika F. de Lima

pdf bib
Automatic Discovery of Non-Compositional Compounds in Parallel Data
I. Dan Melamed

pdf bib
Learning to Tag Multilingual Texts Through Observation
Scott W. Bennett | Chinatsu Aone

pdf bib
A Corpus-Based Approach for Building Semantic Lexicons
Ellen Riloff | Jessica Shepherd

pdf bib
Inducing Terminology for Lexical Acquisition
Roberto Basili | Gianluca De Rossi | Maria Teresa Pazienza

pdf bib
Name Searching and Information Retrieval
Paul Thompson | Christopher C. Dozier

pdf bib
Lexicon Effects on Chinese Information Retrieval
K.L. Kwok

pdf bib
Attaching Multiple Prepositional Phrases: Backed-off Estimation Generalized
Paola Merlo

pdf bib
Learning Methods for Combining Linguistic Indicators to Classify Verbs
Eric V. Siegel

pdf bib
Probabilistic Coreference in Information Extraction
Andrew Kehler

pdf bib
An Empirical Approach to Temporal Reference Resolution
Janyce Wiebe | Tom O’Hara | Kenneth McKeever | Thorsten Ohrstrom-Sandgren

pdf bib
Word Sense Disambiguation Based on Structured Semantic Space
Donghong Ji | Changning Huang

pdf bib
Distinguishing Word Senses in Untagged Text
Ted Pedersen | Rebecca Bruce

pdf bib
Exemplar-Based Word Sense Disambiguation” Some Recent Improvements
Hwee Tou Ng


up

bib (full) Spoken Language Translation

pdf bib
Spoken Language Translation

pdf bib
Discourse particles and routine formulas in spoken language translation
Manfred Stede | Birte Schmitz

pdf bib
A Dialogue Analysis Model with Statistical Speech Act Processing for Dialogue Machine Translation
Jae-won Lee | Gil Chang Kim

pdf bib
Towards Translating Spoken Language Pragmatics in an Analogical Framework
Keiko Horiguchi

pdf bib
Correct parts extraction from speech recognition results using semantic distance calculation, and its application to speech translation
Yumi Wakita | Jun Kawai

pdf bib
A Formal Basis for Spoken Language Translation by Analogy
Keiko Horiguchi

pdf bib
Dealing with Multilinguality in a Spoken Language Query Translator
Pascale Fung | Bertram Shi | Dekai Wu | Lain Wai Bun | Wong Shuen Kong

pdf bib
Using Categories in the EUTRANS System
J. C. Amengual | J. M. Benedí | F. Casacuberta | A. Castaño | A. Castellanos | D. Llorens | A. Marzal | F. Prat | E. Vidal | J. M. Vilar

pdf bib
English-to-Mandarin Speech Translation with Head Transducers
Hiyan Alshawi

pdf bib
Interactive Speech Translation in the DIPLOMAT Project
Robert Frederking | Alexander Rudnicky | Christopher Hogan

pdf bib
Expanding the Domain of a Multi-lingual Speech-to-Speech Translation System
Alon Lavie | Lori Levin | Puming Zhan | Maite Taboada | Donna Gates | Mirella Lapata | Cortis Clark | Matthew Broadhead | Alex Waibel

pdf bib
Translation Methodology in the Spoken Language Translator: An Evaluation
David Carter | Ralph Becket | Manny Rayner | Robert Eklund | Catriona MacDermid | Mats Wirén | Sabine Kirchmeier-Andersen | Christina Philp

pdf bib
Six Issues in Speech Translation
Mark Seligman

pdf bib
Combining Situated Reasoning with Semantic Transfer Minimally
Tsutomu Fujinami

pdf bib
A Polish-to-English Text-to-text Translation System Based on an Electronic Dictionary
Krzysztof Jassem

pdf bib
Spoken Language Translation with the ITSVox System
Eric Wehrli | Jean-Luc Cochard



up

bib (full) Interactive Spoken Dialog Systems: Bringing Speech and NLP Together in Real Applications

pdf bib
Interactive Spoken Dialog Systems: Bringing Speech and NLP Together in Real Applications

pdf bib
Evaluating Interactive Dialogue Systems: Extending Component Evaluation to Integrated System Evaluation
Marilyn A. Walker | Diane J. Litman | Candace A. Kamm | Alicia Abella

pdf bib
A Generic Template to evaluate integrated components in spoken dialogue systems
Gavin E. Churcher | Eric S. Atwell | Clive Souter

pdf bib
GENERALITY AND OBJECTIVITY Central Issues in Putting a Dialogue Evaluation Tool into Practical Use
Laila Dybkjaer | Niels Ole Bernsen | Hans Dybkjaer

pdf bib
An Object-Oriented Model for the Design of Cross-Domain Dialogue Systems
Ian M. O’Neill | Michael F. McTear

pdf bib
Automatic Lexicon Enhancement by Means of Corpus Tagging
Frederic Bechet | Thierry Spriet | Marc El-Beze

pdf bib
Clarification Dialogues as Measure to Increase Robustness in a Spoken Dialogue System
Elisabeth Maier | Norbert Reithinger | Jan Alexandersson

pdf bib
Performance Measures for the Next Generation of Spoken Natural Language Dialog Systems
Ronnie W. Smith

pdf bib
A practical Message-to-Speech strategy for dialogue systems
P. Spyns | F. Deprez | L. Van Tichelen | B. Van Coile

pdf bib
Turn off the Radio and Call Again: How Acoustic Clues can Improve Dialogue Management
Christian Lieske

pdf bib
Context Modeling for Language and Speech Generation
Kees van Deemter

pdf bib
Planning Efficient Mixed Initiative Dialogue
Eli Hagen

pdf bib
A Robust Dialogue System with Spontaneous Speech Understanding and Cooperative Response
Toshihiko Itoh | Akihiro Denda | Satoru Kogure | Seiichi Nakagawa

pdf bib
The “Casual Cashmere Diaper Bag”: Constraining Speech Recognition Using Examples
Paul Martin

pdf bib
Grammatical analysis in the OVIS spoken-dialogue system
Mark-Jan Nederhof | Gosse Bouma | Rob Koeling | Gertjan van Noord

pdf bib
Filtering Errors and Repairing Linguistic Anomalies for Spoken Dialogue Systems
David Roussel | Ariane Halber

pdf bib
How to obey the 7 commandments for spoken dialogue?
Emiel Krahmer | Jan Landsbergen | Xavier Pouteau

pdf bib
Towards a PURE Spoken Dialogue System for Information Access
Rajeev Agarwal

pdf bib
A Programmable Multi-Blackboard Architecture for Dialogue Processing Systems
Matthias Denecke

pdf bib
Corpus-Based Information Presentation for a Spoken Public Transport Information System
M.M.M. Rats | R.J. van Vark | J.P.M. de Vreught

pdf bib
Dialogue Strategies for Improving the Usability of Telephone Human-Machine Communication
Morena Danieli | Elisabetta Gerbino | Loreta M. Moisa

pdf bib
Speech-Graphics Dialogue Systems
Alan W. Biermann | Michael S. Fulkerson | Greg A. Keim


up

bib (full) Intelligent Scalable Text Summarization

pdf bib
Intelligent Scalable Text Summarization

pdf bib
Summarising: Where are we now? Where should we go?
Karen Sparck Jones

pdf bib
Salience-based Content Characterisafion of Text Documents
Branimir Boguraev

pdf bib
Using Lexical Chains for Text Summarization
Regina Barzilay | Michael Elhadad

pdf bib
Automated Text Summarization in SUMMARIST
Eduard Hovy | ChinYew Lin

pdf bib
How to Appreciate the Quality of Automatic Text Summarization? Examples of FAN and MLUCE Protocols and their Results on SERAPHIN
Jean-Luc Minel | Sylvaine Nugier | Gerald Piat

pdf bib
A Proposal for Task-based Evaluation of Text Summarization Systems
Therese Firmin Hand

pdf bib
Automatic Text Summarization by Paragraph Extraction
Mandar Mitra | Amit Singhal | Chris Buckley

pdf bib
Goal-Directed Approach for Text Summarization
Ryo Ochitani | Yoshio Nakao | Fumihito Nishino

pdf bib
Statistical methods for retrieving most significant paragraphs in newspaper articles
Jose Abracos | Gabriel Pereira Lopes

pdf bib
Sentence extraction as a classification task
Simone Teufel

pdf bib
A Scalable Summarization System Using Robust NLP
Chinatsu Aone | Mary Ellen Okurowski | James Gorlinsky | Bjornar Larsen

pdf bib
COSY-MATS: An Intelligent and Scalable Summarisation Shell
Maria Aretoulaki

pdf bib
From discourse structures to text summaries
Daniel Marcu

pdf bib
SimSum: Simulation of summarizing
Brigitte Endres-Niggemeyer

pdf bib
A Formal Model of Text Summarization Based on Condensation Operators of a Terminological Logic
Ulrich Reimer | Udo Hahn


up

bib (full) Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications

pdf bib
Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications

pdf bib
Multilingual design of EuroWordNet
Piek Vossen | Pedro Diez-Orzas | Wim Peters

pdf bib
GermaNet - a Lexical-Semantic Net for German
Birgit Hamp | Helmut Feldweg

pdf bib
Extending a thesaurus by classifying words
Takenobu Tokunaga | Atsushi Fujii | Naoyuki Sakurai | Hozumi Tanaka

pdf bib
Formal redundancy and consistency checking rules for the lexicai database WordNet 1.5
Dietrich H. Fischer

pdf bib
Lexical Discrimination with the Italian Version of WordNet
Alessandro Artale | Bernardo Magnini | Carlo Strapparava

pdf bib
Integrating a Lexical Database and a Training Collection for Text Categorization
Jose Maria Gomez-Hidalgo | Manuel de Buenaga Rodriguez

pdf bib
Integration of Hand-Crafted and Statistical Resources in Measuring Word Similarity
Atsushi Fujii | Toshihiro Hasegawa | Takenobu Tokunaga

pdf bib
Word Sense Disambiguation for Acquisition of Selectional Preferences
Diana McCarthy

pdf bib
The Use of Lexical Semantics in Information Extraction
Joyce Yue Chai | Alan W. Biermann

pdf bib
Subject and Object Dependency Extraction Using Finite-State Transducers
Salah Ait-Mokhtar | Jean-Pierre Chanod

pdf bib
An Experiment in Semantic Tagging using Hidden Markov Model Tagging
Frederique Segond | Anne Schiller | Gregory Grefenstette | Jean-Pierre Chanod

pdf bib
Using Semantic Similarity to Acquire Cooccurrence Restrictions from Corpora
Antonio Sanfilippo

pdf bib
Inferring Semantic Similarity from Distributional Evidence: an Analogy-based Approach to Word Sense Disambiguation
Stefano Federici | Simonetta Montemagni | Vito Pirrelli


up

bib (full) From Research to Commercial Applications: Making NLP Work in Practice

pdf bib
From Research to Commercial Applications: Making NLP Work in Practice

pdf bib
Reuse of a Proper Noun Recognition System in Commercial and Operational NLP Applications
Chinatsu Aone | John Maloney

pdf bib
Developing a new grammar checker for English as a second language
Cornelia Tschichold | Franck Bodmer | Etienne Cornu | Francois Grosjean | Lysiane Grosjean | Natalie Kubler | Nicolas Lewy | Corinne Tschumi

pdf bib
Software Re-Use and Evolution in Text Generation Applications
Karen Kukich | Rebecca Passonneau | Kathleen McKeown | Dragomir Radev | Vasileios Hatzivassiloglou | Hongyan Jing

pdf bib
Estimating the True Performance of Classification-Based NLP Technology
James R. Nolan

pdf bib
Tailored Patient Information: Some Issues and Questions
Ehud Reiter

pdf bib
Practical Considerations in Building a Multi-Lingual Authoring System for Business Letters
John Tait

pdf bib
A Language Identification Application Built on the Java Client / Server Platform
Gary Adams | Philip Resnik

pdf bib
Practical Experience with Grammar Sharing in Multilingual NLP
Michael Gamon | Carmen Lozano | Jessie Pinkham | Tom Reutter

pdf bib
NLP and Industry: Transfer and Reuse of Technologies
Leo Obrst | Krishna Nanda Jha

pdf bib
Recycling Lingware in a Multilingual MT System
Manny Rayner | David Carter | Ivan Bretan | Robert Eklund | Mats Wiren | Steffen Leo Hansen | Sabine Kirchmeier-Andersen | Christina Philp | Finn Sorensen | Hanne Erdman Thomsen


up

bib (full) CoNLL97: Computational Natural Language Learning

pdf bib
CoNLL97: Computational Natural Language Learning

pdf bib
A Trainable Message Understanding System
Amit Bagga | Joyce Yue Chai

pdf bib
Relational Learning of Pattern-Match Rules for Information Extraction
Mary Elaine Califf | Raymond J. Mooney

pdf bib
A Preliminary Study of Word Clustering Based on Syntactic Behavior
Wide R. Hogenhout | Yuji Matsumoto

pdf bib
Learning New Compositions from Given Ones
Donghong Ji | Jun He | Changning Huang

pdf bib
A Statistical Decision Making Method: A Case Study on Prepositional Phrase Attachment
Mehmet Kayaalp | Ted Pedersen | Rebecca Bruce

pdf bib
Method for Improving Automatic Word Categorization
Emin Erkan Korkmaz | Gokturk Ucoluk

pdf bib
From Psycholinguistic Modelling of Interlanguage in Second Language Acquisition to a Computational Model
Montse Maritxalar | Arantza Diaz de Ilarraza | Maite Oronoz

pdf bib
What makes a word: Learning base units in Japanese for speech recognition
Laura Mayfield Tomokiyo | Klaus Ries

pdf bib
Evolution of a Rapidly Learned Representation for Speech
Ramin Charles Nakisa | Kim Plunkett

pdf bib
Learning Stochastic Categorial Grammars
Miles Osborne | Ted Briscoe

pdf bib
Learning and Application of Differential Grammars
David M. W. Powers

pdf bib
Recurrent Neural-Network Learning of Phonological Regularities in Turkish
Jennifer Rodd

pdf bib
Explanation-Based Learning of Data-Oriented Parsing
K. Sima’an

pdf bib
Word Triggers and the EM Algorithm
Christoph Tillmann | Hermann Ney

pdf bib
A Comparative Study of the Application of Different Learning Techniques to Natural Language Interfaces
Werner Winiwarter | Yahiko Kambayashi

pdf bib
Resolving PP attachment Ambiguities with Memory-Based Learning
Jakub Zavrel | Walter Daelemans | Jorn Veenstra




up

bib (full) Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts

pdf bib
Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts

pdf bib
Resolving bridging references in unrestricted text
Massimo Poesio | Renata Vieira | Simone Teufel

pdf bib
Constraints and defaults of zero pronouns in Japanese instruction manuals
Tatsunori Mori | Mamoru Matsuo | Hiroshi Nakagawa

pdf bib
Factors in anaphora resolution: they are not the only things that matter. A case study based on two different approaches
Ruslan Mitkov

pdf bib
Automatic extraction of rules for anaphora resolution of Japanese. zero pronouns from aligned sentence pairs
Hiromi Nakaiwa

pdf bib
Resolving anaphoric references on deficient syntactic descriptions
Roland Stuckardt

pdf bib
CogNIAC: high precision coreference with limited knowledge and linguistic resources
Breck Baldwin

pdf bib
Recognizing referential links: an information extraction prespective
Megumi Kameyama

pdf bib
Supporting anaphor resolution in dialogues with a corpus-based probabilistic model
Marco Rocha

pdf bib
Towards reliable partial anaphora resolution
Sabine Bergler

pdf bib
Corpus annotation and reference resolution
Tony McEnery | Izumi Tanaka | Simon Botley

pdf bib
Event coreference for information extraction
Kevin Humphreys | Robert Gaizauskas | Saliha Azzam

pdf bib
How far are we from (semi-)automatic of anaphoric links in corpora?
Ruslan Mitkov

pdf bib
What brings a higher-order entity into focus of attention? Sentential pronouns in English and Norwegian.
Kaja Borthen | Thorstein Fretheim

pdf bib
Cooperation between pronoun and reference resolution for unrestricted texts
Andrei Popescu-Belis | Isabelle Robba


up

bib (full) Referring Phenomena in a Multimedia Context and their Computational Treatment

pdf bib
Referring Phenomena in a Multimedia Context and their Computational Treatment

pdf bib
Integration and Synchronization of Input Modes during Multimodal Human-Computer Interaction
Sharon Oviatt | Antonella DeAngeli | Karen Kuhn

pdf bib
Referring in Multimodal Systems: The Importance of User Expertise and System Features
Daniela Petrelli | Antonella De Angeli | Walter Gerbino | Giulia Cassano

pdf bib
Towards Generation of Fluent Referring Action in Multimodal Situations
Tsuneaki Kato | Yukiko I. Nakano

pdf bib
Hypertext and Deixis
Dan Loehr

pdf bib
Multimodal References in GEORAL TACTILE
Jacques Siroux | Marc Guyomard | Franck Multon | Christophe Remondeau

pdf bib
Active and Passive Gestures - Problems with the Resolution of Deictic and Elliptic Expressions in a Multimodal System
Michael Streit

pdf bib
Scene Direction Based Reference In Drama Scenes
Hiroshi Nakagawa | Yoshitomo Yaginuma | Masao Sakauchi

pdf bib
Generating Referential Descriptions in Multimedia Environments
Helmut Horacek

pdf bib
Planning Referential Acts for Animated Presentation Agents
Elisabeth Andre | Thomas Rist

pdf bib
Exploiting Image Descriptions for the Generation of Referring Expressions
Knut Hartmann | Jochen Schopp

pdf bib
Referring to Displays in Multimodal Interfaces
Daqing He | Graeme Ritchie | John Lee

pdf bib
A Syndetic Approach to Referring Phenomena in Multimodal Interaction
Giorgio P. Faconti | Mieke Massink

pdf bib
Constraints on the Use of Language, Gesture and Speech for Multimodal Dialogues
Bertrand Gaiffe | Laurent Romary

pdf bib
A Model for Multimodal Reference Resolution
Luis. A. Pineda | E. Gabriela Garza

pdf bib
The CARTOON project : Towards Integration of Multimodal and Linguistic Analysis for Cartographic Applications
J.C. Martin | X. Briffault | M.R. Goncalves | J. Vapillon

pdf bib
Recognition of referring expressions
Stig W. Jorgensen

pdf bib
Information on the ACL Special Interest Group on Multimedia Language Processing


up

bib (full) Computational Environments for Grammar Development and Linguistic Engineering

pdf bib
Computational Environments for Grammar Development and Linguistic Engineering

pdf bib
Some apparently disjoint aims and requirements for grammar development environments” the case of natural language generation
John Bateman

pdf bib
The TreeBanker: a Tool for Supervised Training of Parsed Corpora
David Carter

pdf bib
Participatory Design for Linguistic Engineering: the Case of the GEPPETTO Development Environment
Fabio Ciravegna | Alberto Lavelli | Daniela Petrelli | Fabio Pianesi

pdf bib
Hypertextual Grammar Development
Luca Dini | Giampaolo Mazzini

pdf bib
Maintaining the Forest and Burning out the Underbrush in XTAG
Christine Doran | Beth Hockey | Philip Hopely | Joseph Rosenzweig | Anoop Sarkar | B. Srinivas | Fei Xia

pdf bib
The ConTroll System as Large Grammar Development Platform
Thilo Gotz | Walt Detmar Meurers

pdf bib
Application-driven automatic subgrammar extraction
Renate Henschel

pdf bib
Lexical Resource Reconciliation in the Xerox Linguistic Environment
Ronald M. Kaplan

pdf bib
Head-Driven Generation and Indexing in ALE
Gerald Penn

pdf bib
EFLUF - an Implementation of a FLexible Unification Formalism
Lena Stromback

pdf bib
Exploiting Contextual Information in Hypothesis Selection for Grammar Refinement
Thanaruk Theeramunkong | Yasunobu Kawaguchi | Manabu Okumura

pdf bib
ALEP-Based Distributed Grammar Engineering
Axel Theofilidis

pdf bib
Hdrug. A Flexible and Extendible Development Environment for Natural Language Processing.
Gertjan van Noord | Gosse Bouma

pdf bib
An Object-Oriented Linguistic Engineering Environment using LFG (Lexical Functionnal Grammar) and CG (Conceptual Graphs)
Jerome Vapillon | Xavier Briffault | Gerard Sabah | Karim Chibout

pdf bib
Experiences with the GTU grammar development environment
Martin Volk


up

pdf (full)
bib (full)
Proceedings of the Fifth International Workshop on Parsing Technologies

bib

pdf bib
The Computation of Movement
Sandiway Fong

A central goal of parsing is to recover linguistic structure for interpretation. One property of language that seems to be prevalent is the so-called displacement property. That is, syntactic items commonly appear in places other than where we would normally expect for interpretation. Some examples of phenomena involving displacement include Wh-movement, raising, passivization, scrambling, topicalization and focus. As Chomsky (1995) points out, displacement is an irreducible fact about human language that every contemporary theory of language has to address. In the principles-and-parameters framework, it is customary to posit a general movement operation, Move-α, that in concert with conditions on its application serve to link displaced elements with their base positions. In terms of parsing, the task is to decode or unravel the effects of Move-α from the surface order. More specifically, for each element, we have to determine whether that element has been displaced or not, and, if so, determine the original position it was displaced from and reconstruct the path it took – including any intermediate positions or landing sites. In general, each displaced element is said to head a (non-trivial) chain with one or more empty categories known as traces occupying the positions that it passed through. Note that in such theories, empty categories are not just simple placeholders, but elements with much of the same type and range of syntactic properties displayed by their overt counterparts. For example, empty categories in argument positions, like anaphors and pronouns, participate in binding theory and theta role discharge. Hence, the well-formedness of a given sentence will depend, in general, in recovering both the visible and non-visible parts of syntactic structure. In this talk, we will describe how PAPPI, a multi-lingual parser for theories in the principles-and-parameters frameworks, deals with the computation of movement chains and empty categories in general. Drawing from implemented examples across a variety of languages, we will discuss the mechanism used to handle standard cases of phrasal movement commonly discussed in the literature such as Wh-movement, passivization, raising and verb second (V2) phemomena. We will also describe how this mechanism is adapted to handle instances of argument scrambling in languages like Korean and Japanese. We will also focus our attention on head movement. Here, following Pollock (1989), we will discuss the mechanism used to handle the surface differences in the behaviour of verbal inflection in English and French. Following Pesetsky (1995), we will also discuss the implementation of a theory of double object constructions involving the incorporation of both overt and non-overt prepositions into verbal heads. Finally, we will describe two recent additions to the movement mechanism in the PAPPI system. Moving towards a theory of goal-driven movement - as opposed to the free movement system implied by Move-α, we will discuss an implementation of Case-driven movement within the VP-shell to handle examples involving focus, backgrounding and topicalization in Turkish. Finally, using examples from English and Turkish, we will discuss the necessity of a mechanism of reconstruction that optionally “undoes” or reverses the effects of movement to handle facts involving binding and scope.

pdf bib
Parsing Technology and RNA Folding: a Promising Start
Fabrice Lefebvre

The determination of the secondary structure of RNAs is a problem which has been tackled by distantly related methods ranging from comparative analysis to thermodynamic energy optimization or stochastic context-free grammars (SCFGs). Because of its very nature (properly nested pairs of bases of a single stranded sequence) the secondary structure of RNAs is well modeled by context-free grammars (CFGs). This fact has been recognized several years ago by people who used context-free grammars as a tool to discover some combinatorial properties of secondary structures. More recently SCFGs were used by several teams (esp. David Haussler’s team at UC Santa Cruz) as an effective tool to fold RNAs through Cocke-Younger-Kasami-like parsers. Until 1996, and in the context of RNA folding, CFGs and their derivatives where still considered the oretical tools, barely usable outside the computer scientist lab. The exception of SCFGs seemed promising, with all the hype around Hidden Markov Models and other stochastic methods, but it remained to be confirmed for RNAs longer than 200 bases. The main obstacle to the use of context-free grammars and parsing technology for RNA folding and other closely related problems is the following: suitable grammars are exponentially ambiguous, and sentences to parse (i.e. RNA or DNA sequences) typically have more than 200 words, and sometimes more than 4000 words. These figures are rather unusual for ordinary parsers or parser generators, because they are mostly used in the context of natural language parsing, and thus do not have to face the same computation problems. Fact is, most people dealing with RNA folding problems were manually writing dynamic programming based tools. This was the case for folding models popularized by Michael Zuker, and based on free energy minimization. This was also the case for folding models based on SCFGs. This was in effect the case for just about every computer method available to fold or align sequences. Parsing sequences was not an issue because it simply seemed too slow, too memory hungry and even unrelated. In 1995, I showed that S-attribute grammars were perfectly able to handle both the thermodynamic model and the stochastic model of RNA folding. I then introduced a parser generator which was able, given a proper S-attribute grammar, to automatically write an efficient parser based on suitable optimizations of Earley’s parsing algorithm. All generated parsers turned out to be faster and less memory hungry than other available parsers for the same exponentially ambiguous grammars and the same sequences. More surprisingly, these parsers also turned out to be faster than hand-written programs based on dynamic programming equations. This was the first proof that improvements in parsing technology may certainly be put to good use in biocomputing problems, and that they shall lead to better algorithms and tools. While trying to overcome some limitations of SCFGs, I generalized S-attribute grammars to multi-tape S-attribute grammars (MTSAGs). The automata theory counterpart of a MTSAG would be a non-deterministic push-down automaton with several one-way reading heads, instead of a single one-way reading head as it is the case for CFG. Given these MTSAGs, a generalization of the previous single-tape parser generator was the obvious way forward. Thanks to this new parser generator, I was able to show that most biocomputing models previously based on dynamic programming equations were unified by MTSAGs, and that they were better handled by automatically generated parsers than by handwritten programs. It did not matter whether these models were trying to align sequences, fold RNAs, align folded RNAs, align folded and unfolded RNAs, simultaneously align and fold RNAs, etc. It also turned out that the way SCFGs and HMMs are currently used may be better pictured, thanks to 2-tape MTSAGs, as the simultaneous alignment and folding of a first special tape, representing the target model, against a second tape, containing the actual sequence. This representation may lead to algorithms which will efficiently learn SCFGs from initially unaligned sequences. While the current parser generator for MTSAGs-is a usable proof of concept, which nevertheless required several months of work, I am quite convinced that there should be better ways than the current algorithm to parse several tapes. There should also exist other generalizations of CFGs which may reveal themselves fruitful. Current results are only promising starting points. The irony of the story is that HMMs and SCFGs were borrowed by biocomputing people from other fields such as signal or speech analysis. It may very well be the time for these fields to retrofit their own models with current advances in biocomputing such as MTSAGs.

pdf bib
Intelligent Multimedia Information Access
Mark T. Maybury

The expansion of the information highway has generated requirements for more effective access to global and corporate information repositories. These repositories are increasingly multimedia, including text, audio (e.g., spoken language, music), graphics, imagery, and video. The advent of large, multimedia digital libraries has turned attention toward the problem of processing and managing multiple and heterogeneous media in a principled manner, including their creation, storage, indexing, browsing, search, visualization, and summarization. Intelligent multimedia information access is a multidisciplinary area that lies at the intersection of artificial intelligence, information retrieval, human computer interaction, and multimedia computing. Intelligent multimedia information access includes those systems which go beyond traditional hypermedia or hypertext environments and analyze media, generate media, or support intelligent interaction with or via multiple media using knowledge of the user, discourse, domain, world, or the media itself. Providing machines with the ability to interpret, generate, and support interaction with multimedia artifacts (e.g., documents, broadcasts, hypermedia) will be a valuable facility for a number of key applications such as videoteleconference archiving, custom on-line news, and briefing assistants. These media facilities, in turn, may support a variety of tasks ranging from training to information analysis to decision support. In this talk I will describe our group’s efforts to provide content based access to broadcast news sources, including our use of corpus-based processing techniques to the problems of video indexing, segmentation, and summarization. In addition to better access to content, we also need to concern ourselves with enabling more effective, efficient and natural human computer or computer mediated human-human interaction. This will require automated understanding and generation of multimedia and demand explicit representation of and reasoning about the user, discourse, task and context (Maybury 1993). To this end, I will describe our work in progress that aims to fully instrument the interface and build ( automatically and semi-automatically) annotated corpora of human-machine interaction. We believe this will yield deeper and more comprehensive models of interaction which should ultimately enable more principled interface design.

pdf bib
Making Use of Intonation in Interactive Dialogue Translation
Mark Steedman

Intonational information is frequently discarded in speech recognition, and assigned by default heuristics in text-to-speech generation. However, in many applications involving dialogue and interactive discourse, intonation conveys significant information, and we ignore it at our peril. Translating telephones and personal assistants are an interesting test case, in which the salience of rapidly shifting discourse topics and the fact that sentences are machine-generated, rather than written by humans, combine to make the application particularly vulnerable to our poor theoretical grasp of intonation and its functions. I will discuss a number of approaches to the problem for such applications, ranging from cheap tricks to a combinatory grammar-based theory of the semantics involved and a syntax-phonology interface for building and generating from interpretations.

pdf bib
Disambiguating with Controlled Disjunctions
Philippe Blache

In this paper, we propose a disambiguating technique called controlled disjunctions. This extension of the so-called named disjunctions relies on the relations existing between feature values (covariation, control, etc.). We show that controlled disjunctions can implement different kind of ambiguities in a consistent and homogeneous way. We describe the integration of controlled disjunctions into a HPSG feature structure representation. Finally, we present a direct implementation by means of delayed evaluation and we develop an example within the functional programming paradigm.

pdf bib
Encoding Frequency Information in Lexicalized Grammars
John Carroll | David Weir

We address the issue of how to associate frequency information with lexicalized grammar formalisms, using Lexicalized Tree Adjoining Grammar as a representative framework. We consider systematically a number of alternative probabilistic frameworks, evaluating their adequacy from both a theoretical and empirical perspective using data from existing large treebanks. We also propose three orthogonal approaches fo r backing off probability estimates to cope with the large number of parameters involved.

pdf bib
Towards a Reduced Commitment, D-Theory Style TAG Parser
John Chen | K. Vijay-Shankar

Many traditional TAG parsers handle ambiguity by considering all of the possible choices as they unfold during parsing. In contrast , D-theory parsers cope with ambiguity by using underspecified descriptions of trees. This paper introduces a novel approach to parsing TAG, namely one that explores how D-theoretic notions may be applied to TAG parsing. Combining the D-theoretic approach to TAG parsing as we do here raises new issues and problems. D-theoretic underspecification is used as a novel approach in the context of TAG parsing for delaying attachment decisions. Conversely, the use of TAG reveals the need for additional types of underspecification that have not been considered so far in the D-theoretic framework. These include combining sets of trees into their underspecified equivalents as well as underspecifying combinations of trees. In this paper, we examine various issues that arise in this new approach to TAG parsing and present solutions to some of the problems. We also describe other issues which need to be resolved for this method of parsing to be implemented.

pdf bib
Controlling Bottom-Up Chart Parsers through Text Chunking
Fabio Ciravegna | Alberto Lavelli

In this paper we propose to use text chunking for controlling a bottom-up parser. As it is well known, during analysis such parsers produce many constituents not contributing to the final solution(s). Most of these constituents are introduced due to t he parser inability of checking the input context around them. Preliminary text chunking allows to focus directly on the constituents that seem more likely and to prune the search space in the case some satisfactory solutions are found. Preliminary experiments show that a CYK-like parser controlled through chunking is definitely more efficient than a traditional parser without significantly losing in correctness. Moreover the quality of possible partial results produced by the controlled parser is high. The strategy is particularly suited for tasks like Information Extraction from text (IE) where sentences are often long and complex and it is very difficult to have a complete coverage. Hence, there is a strong necessity of focusing on the most likely solutions; furthermore, in IE the quality of partial results is important .

pdf bib
Pruning Search Space for Parsing Free Coordination in Categorial Grammar
Crit Cremers

The standard resource sensitive invariants of categorial grammar are not suited to prune search space in the presence of coordination. We propose a weaker variant of count invariancy in order to prune the search space for parsing coordinated sentences at a stage prior to proper parsing. This Coordinative Count Invariant is argued to be the strongest possible instrument to prune search space for parsing coordination in categorial grammar. Its mode of operation is explained, and its effect at pruning search space is exemplified.

pdf bib
Bilexical Grammars and a Cubic-time Probabilistic Parser
Jason Eisner

pdf bib
Automaton-based Parsing for Lexicalised Grammars
Roger Evans | David Weir

In wide-coverage lexicalized grammars many of the elementary structures have substructures in common. This means that during parsing some of the computation associated with different structures is duplicated. This paper explores ways in which the grammar can be precompiled into finite state automata so that some of this shared structure results in shared computation at run-time.

pdf bib
From Part of Speech Tagging to Memory-based Deep Syntactic Analysis
Emmanuel Giguet | Jacques Vergne

This paper presents a robust system for deep syntactic parsing of unrestricted French. This system uses techniques from Part-of-Speech tagging in order to build a constituent structure and uses other techniques from dependency grammar in an original framework of memories in order to build a functional structure. The two structures are build simultaneously by two interacting processes. The processes share the same aim, that is, to recover efficiently and reliably syntactic information with no explicit expectation on text structure.

pdf bib
Probabilistic Feature Grammars
Joshua Goodman

We present a new formalism, probabilistic feature grammar (PFG). PFGs combine most of the best properties of several other formalisms, including those of Collins, Magerman, and Charniak, and in experiments have comparable or better performance. PFGs generate features one at a time, probabilistically, conditioning the probabilities of each feature on other features in a local context. Because the conditioning is local, efficient polynomial time parsing algorithms exist for computing inside, outside, and Viterbi parses. PFGs can produce probabilities of strings, making them potentially useful for language modeling. Precision and recall results are comparable to the state of the art with words, and the best reported without words.

pdf bib
Message-passing Protocols for Real-world Parsing - An Object-oriented Model and its Preliminary Evaluation
Udo Hahn | Peter Neuhaus | Norbert Broeker

We argue for a performance-based design of natural language grammars and their associated parsers in order to meet the constraints imposed by real-world NLP. Our approach incorporates declarative and procedural knowledge about language and language use within an object-oriented specification framework. We discuss several message-passing protocols for parsing and provide reasons for sacrificing completeness of the parse in favor of efficiency based on a preliminary empirical evaluation.

pdf bib
Probabilistic Parse Selection based on Semantic Cooccurrences
Eirik Hektoen

This paper presents a new technique for selecting the correct parse of ambiguous sentences based on a probabilistic analysis, of lexical cooccurrences in semantic forms. The method is called “Semco” (for semantic cooccurrence analysis) and is specifically targeted at the differential distribution of such cooccurrences in correct and incorrect parses. It uses Bayesian Estimation for the cooccurrence probabilities to achieve higher accuracy for sparse data than the more common Maximum Likelihood Estimation would. It has been tested on the Wall Street Journal corpus (in the PENN Treebank) and shown to find the correct parse of 60.9% of parseable sentences of 6-20 words.

pdf bib
A New Formalization of Probabilistic GLR Parsing
Kentaro Unui | Virach Sornlertlamvanich | Hozumi Tanaka | Takenobu Tokunaga

This paper presents a new formalization of probabilistic GLR language modeling for statistical parsing. Our model inherits its essential features from Briscoe and Carroll’s generalized probabilistic LR model, which obtains context-sensitivity by assigning a probability to each LR parsing action according to its left and right context. Briscoe and Carroll’s model, however, has a drawback in that it is not formalized in any probabilistically well-founded way, which may degrade its parsing performance. Our formulation overcomes this drawback with a few significant refinements, while maintaining all the advantages of Briscoe and Carroll’s modeling.

pdf bib
Efficient Parsing for CCGs with Generalized Type-raised Categories
Nobo Komagata

A type of ‘non-traditional constituents’ motivates an extended class of Combinatory Categorial Grammars, CCGs with Generalized Type-Raised Categories (CCG-GTRC) involving variables. Although the class of standard CCGs is known to be polynomially parsable, unrestricted use of variables can destroy this essential requirement for a practical parser. This paper argues for polynomial parsability of CCG-GTRC from practical and theoretical points of view. First, we show that an experimental parser runs polynomially in practice on a realistic fragment of Japanese by eliminating spurious ambiguity and excluding genuine ambiguities. Then, we present a worst-case polynomial recognition algorithm for CCG-GTRC by extending the polynomial algorithm for the standard CCGs.

pdf bib
Probabilistic Parsing using Left Corner Language Models
Christopher D. Manning | Bob Carpenter

We introduce a novel parser based on a probabilistic version of a left-corner parser. The left-corner strategy is attractive because rule probabilities can be conditioned on both top-down goals and bottom-up derivations. We develop the underlying theory and explain how a grammar can be induced from analyzed data. We show that the left-corner approach provides an advantage over simple top-down probabilistic context-free grammars in parsing the Wall Street Journal using a grammar induced from the Penn Treebank. We also conclude that the Penn Treebank provides a fairly weak tes bed due to the flatness of its bracketings and to the obvious overgeneration and undergeneration of its induced grammar.

pdf bib
Regular Approximations of CFLs: A Grammatical View
Mark-Jan Nederhof

We show that for each context-free grammar a new grammar can be constructed that generates a regular language. This construction differs from existing methods of approximation in that use of a pushdown automaton is avoided . This allows better insight into how the generated language is affected. The new method is also more attractive from a computational viewpoint.

pdf bib
A Left-to-right Tagger for Word Graphs
Christer Samuelsson

An algorithm is presented for tagging input word graphs and producing output tag graphs that are to be subjected to further syntactic processing. It is based on an extension of the basic HMM equations for tagging an input word string that allows it to handle word-graph input, where each arc has been assigned a probability. The scenario is that of some word-graph source, e.g., an acoustic speech recognizer, producing the arcs of a word graph, and the tagger will in turn produce output arcs, labelled with tags and assigned probabilities. The processing as done entirely left-to-right, and the output tag graph is constructed using a minimum of lookahead, facilitating real-time processing.

pdf bib
Parsing by Successive Approximation
Helmut Schmid

It is proposed to parse feature structure-based grammars in several steps. Each step is aimed to eliminate as many invalid analyses as possible as efficiently as possible. To this end the set of feature constraints is divided into three subsets, a set of context-free constraints, a set of filtering constraints and a set of structure-building constraints, which are solved in that order. The best processing strategy differs: Context-free constraints are solved efficiently with one of the well-known algorithms for context-free parsing. Filtering constraints can be solved using unification algorithms for non-disjunctive feature structures whereas structure-building constraints require special techniques to represent feature structures with embedded disjunctions efficiently. A compilation method and an efficient processing strategy for filtering constraints are presented.

pdf bib
Performance Evaluation of Supertagging for Partial Parsing
B. Srinivas

In previous work we introduced the idea of supertagging as a means of improving the efficiency of a lexicalized grammar parser. In this paper, we present supertagging in conjunction with a lightweight dependency analyzer as a robust and efficient partial parser. The present work is significant for two reasons. First, we have vastly improved our results; 92% accurate for supertag disambiguation using lexical information, larger training corpus and smoothing techniques. Second, we show how supertagging can be used for partial parsing and provide detailed evaluation results for detecting noun chunks, verb chunks, preposition phrase attachment and a variety of other linguistic constructions. Using supertag representation, we achieve a recall rate of 93.0% and a precision rate of 91.8% for noun chunking, improving on the best known result for noun chunking.

pdf bib
An Earley Algorithm for Generic Attribute Augmented Grammars and Applications
Frederic Tendeau

We describe an extension of Earley’s algorithm which computes the decoration of a shared forest in a generic domain. At tribute computations are defined by a morphism from leftmost derivations to the generic domain, which leaves the computations independent from (even if guided by) the parsing strategy. The approach is illustrated by the example of a definite clause grammar, seen as CF-grammars decorated by attributes.

pdf bib
A Case Study in Optimizing Parsing Schemata by Disambiguation Filters
Eelco Visser

Disambiguation methods for context-free grammars enable concise specification of programming languages by ambiguous grammars. A disambiguation filter is a function that selects a subset from a set of parse trees the possible parse trees for an ambiguous sentence. The framework of filters provides a declarative description of disambiguation methods independent of parsing. Although filters can be implemented straightforwardly as functions that prune the parse forest produced by some generalized parser, this can be too inefficient for practical applications. In this paper the optimization of parsing schemata, a framework for high-level description of parsing algorithms, by disambiguation filters is considered in order to find efficient parsing algorithms for declaratively specified disambiguation methods. As a case study the optimization of the parsing schema of Earley’s parsing algorithm by two filters is investigated. The main result is a technique for generation of efficient LR-like parsers for ambiguous grammars disambiguated by means of priorities.

pdf bib
New Parsing Method using Global Association Table
Juntae Yoon | Seonho Kim | Mansuk Song

This paper presents a new parsing method using statistical information extracted from corpus, especially for Korean. The structural ambiguities are occurred in deciding the dependency relation between words in Korean. While figuring out the correct dependency, the lexical associations play an important role in resolving the ambiguities. Our parser uses statistical cooccurrence data to compute the lexical associations. In addition, it can be shown that sentences are parsed deterministically by the global management of the association. In this paper, the global association table (GAT) is defined and the association between words is recorded in the GAT. The system is the hybrid semi-deterministic parser and is controlled not by the condition-action rule. but by the association value between phrases. Whenever the expectation of the parser fails, it chooses the alternatives using a chart to remove the backtracking.

pdf bib
Constraint-driven Concurrent Parsing Applied to Romanian VP
Liviu Ciortuz

We show that LP constraints (together with language specific constraints) could be interpreted as meta-rules in (an extended) head-corner parsing algorithm using weakened ID rule schemata from the theory of HPSG [Pollard and Sag, 1994].

pdf bib
Robustness and Efficiency in AGFL
Caspar Derksen | Cornelis H. A. Koster | Erik Oltmans

pdf bib
Language Analysis in SCHISMA
Danny Lie | Joris Hulstijn | Hugo ter Doest | Anton Nijholt

pdf bib
Reducing the Complexity of Parsing by a Method of Decomposition
Caroline Lyon | Bob Dickerson

pdf bib
Formal Tools for Separating Syntactically Correct and Incorrect Structures
Martin Plátek | Vladislav Kuboň | Tomáš Holan

In this paper we introduce a class of formal grammars with special measures capable to describe typical syntactic inconsistencies in free word order languages. By means of these measures it is possible to characterize more precisely the problems connected with the task of building a robust parser or a grammar checker of Czech.

pdf bib
Parsers Optimization for Wide-coverage Unification-based Grammars using the Restriction Technique
Nora La Serna | Arantxa Díaz | Horacio Rodríguez

This article describes the methodology we have followed in order to improve the efficiency of a parsing algorithm for wide coverage unification-based grammars. The technique used is the restriction technique (Shieber 85), which has been recognized as an important operation to obtain efficient parsers for unification-based grammars. The main objective of the research is how to choose appropriate restrictors for using the restriction technique. We have developed a statistical model for selecting restrictors. Several experiments have been done in order to characterise those restrictors.