Aleš Horák


2015

pdf bib
Increasing Coverage of Translation Memories with Linguistically Motivated Segment Combination Methods
Vít Baisa | Aleš Horák | Marek Medveď
Proceedings of the Workshop Natural Language Processing for Translation Memories

2012

pdf bib
Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification
Jan Rygl | Aleš Horák
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

In the authorship identification task, examples of short writings of N authors and an anonymous document written by one of these N authors are given. The task is to determine the authorship of the anonymous text. Practically all approaches solved this problem with machine learning methods. The input attributes for the machine learning process are usually formed by stylistic or grammatical properties of individual documents or a defined similarity between a document and an author. In this paper, we present the results of an experiment to extend the machine learning attributes by ranking the similarity between a document and an author: we transform the similarity between an unknown document and one of the N authors to the order in which the author is the most similar to the document in the set of N authors. The comparison of similarity probability and similarity ranking was made using the Support Vector Machines algorithm. The results show that machine learning methods perform slightly better with attributes based on the ranking of similarity than with previously used similarity between an author and a document.

2007

pdf bib
Verb Valency Semantic Representation for Deep Linguistic Processing
Aleš Horák | Karel Pala | Marie Duží | Pavel Materna
ACL 2007 Workshop on Deep Linguistic Processing

2006

pdf bib
Platform for Full-Syntax Grammar Development Using Meta-grammar Constructs
Aleš Horák | Vladimír Kadlec
Proceedings of the 20th Pacific Asia Conference on Language, Information and Computation

2002

pdf bib
Best Analysis Selection in Inflectional Languages
Aleš Horák | Pavel Smrž
COLING 2002: The 19th International Conference on Computational Linguistics

2001

pdf bib
Efficient Sentence Parsing with Language Specific Features: A Case Study of Czech
Aleš Horák | Pavel Smrž
Proceedings of the Seventh International Workshop on Parsing Technologies

2000

pdf bib
Large Scale Parsing of Czech
Pavel Smrž | Aleš Horák
Proceedings of the COLING-2000 Workshop on Efficiency In Large-Scale Parsing Systems