Research Projects
TITLE OF THE PROJECT: EUROTERM: Extending the EuroWorNet with Public Sector Terminology.
REFERENCE: EuroTerm EDC-2214
FINANCE: eContent Programme
TIME:
FROM: 01/01/2001 TO:
30/06/2002
PROJECT LEADER: Dr. Manuel Palomar
Sanz
MEMBERS: Dr. Antonio Ferrández,
Armando Súarez, Dr.Patricio Martínez-Barco, Dr. Jesús
Peral,
Dr. Rafael Muñoz, Maximiliano Saiz-Noeda,
Andrés Montoyo, José Luis Vicedo, Fernando Llopis
PARTNERS: - Databases
Laboratory of Computer Engineering & Informatics Department of Patras
University (Greece).
- CentER Applied Research, Tilburg University, Netherlands
- Departamento de Lenguajes y Sistemas Informáticos, Universidad
de Alicante.
Current Information about the project.
TITLE OF THE PROJECT:“TUSIR: Desarrollo de un sistema de comprensión de textos aplicado a la recuperación de información”
REFERENCE: TIC2000-0664-C02-01/02
FINANCE: Comisión Interministerial
de Ciencia y Tecnología (CICYT).
TIME:
FROM: 28/12/2000 TO:
27/12/2003
PROJECT LEADER: Dr. Manuel Palomar
Sanz
MEMBERS: Dr. Antonio Ferrández,
Armando Súarez, Dr.Patricio Martínez-Barco, Dr. Jesús
Peral,
Dr. Rafael Muñoz, Maximiliano Saiz-Noeda,
Andrés Montoyo, José Luis Vicedo, Fernando Llopis, Rafael
Romero
PARTNERS: Universidad de Alicante y Universidad Politécnica de Valencia
TITLE OF THE PROJECT: Construcción
de Analizadores Híbridos de Lenguajes Naturales
(Construction of Hybrid Analyzers for
Natural Languages Defined in Constrained Semantic Domains)
REFERENCE: TIC97-0671-C02-02
FINANCE: Comisión Interministerial
de Ciencia y Tecnología (CICYT).
TIME:
FROM: 01/08/1997 TO:
31/07/2000
PROJECT LEADER: Dr. Manuel Palomar
Sanz
MEMBERS: Dr. Antonio Ferrández,
Armando Súarez, Jaime Gómez, Patricio Martínez-Barco,
Jesús Peral,
Rafael Muñoz, Maximiliano Saiz-Noeda.
SUMMARY OF THE PROJECT:
The development of systems for Natural
Language processing has specific difficulties when we want to achieve
a wide coverage of the language. Some
well characterised linguistic phenomena (anaphora, ellipsis,...), and the
phenomena characteristics of the spontaneous
speech, which could produce syntactically incorrect sentences,
must be considered. These problems can
be raised from a deductive point of view, through the design of a
language model from the linguistic knowledge,
or can be raised through the application of inductive learning
techniques from data. On the other hand,
attending to the output that the system must supply, one of
the most interesting possibility is that
the system give us the meaning of the input sentences, in terms of a
semantic language. This kind of semantic
output could be used in later processes (translation, execution
of actions, etc.). The main objective
of this project is the development of language understanding systems
in semantically constrained tasks. We
will develop different deductive methodologies of analysis, as it is
habitual in the Natural Language area,
and some inductive learning techniques, which are successfully
used in the Automatic Speech Recognition
area. The co-operation of these two approaches will allow us
to develop Natural Language processing
systems more and more complex. The main aspects to be studied
are the estimation of statistic regular
models and the construction of context-free grammars in order to
represent the characteristics of the language
of the task, and the development of analyzers which supply
the semantic messages of the input sentences.
TITLE OF THE PROJECT: Development
of a Corpus-based integrated anaphora resolution system for spanish and
english.
REFERENCE: HB1998-0068 (Collaborative
research project between Spain and The United Kingdom)
FINANCE: Ministerio de Educación
y Cultura. Dir. Gral. de Enseñanza Superior e Investigación
Científica.
TIME: FROM:
01/04/1999 TO: 1/03/2001
PROJECT LEADER: Dr. Manuel Palomar
Sanz
MEMBERS: Dr. Antonio Ferrández,
Dra. Lidia Moreno, Patricio Martínez-Barco, Jesús Peral,
Rafael Muñoz,
Maximiliano Saiz-Noeda.
SUMMARY OF THE PROJECT:
The project will focus on the construction
of a bilingual (Spanish, English) coreferentially annotated corpus and
its exploita-tion
for the development and improvement of
an integrated corpus-based anaphora resolution system for Spanish and English.
The new system will be built by combining
key components of the anaphora resolution systems which have already been
developed by each of the partners (and
which have demonstrated high success rates - see Férrandez
et al. 1998a;
Mitkov 1998b) and will benefit from the
bilingual corpus with a view to achieving maximum performance.
The project will naturally combine the
competence in anaphora resolution of the British and Spanish sides which
each have
good experience in the field. The main
contributions of the project can be seen in (i) the development of a new
enhanced
multilingual (English and Spanish) anaphora
resolution system and (ii) the construction of a coreferentially marked
up
corpus for both Spanish and English which
will be a novel resource