Monday, February 17, 2014

NLP - Arabic Parsing Overview (1/6)

Arabic Natural Language Parsing - overview

In the coming posts a selected set of grammar formalisms are briefed. These formalisms have been applied to the Arabic language. Projects that are claimed to have wide coverage over the Arabic grammar by their writers are listed. Most of the rule based and some of the corpus based projects with their formalisms are selected to be briefed.

2.1. Overview

The most complex task that a natural language parser has to perform is syntactic analysis. The two main parts of the syntactic analysis component of a parser are the grammar and the parsing algorithm. The grammar encodes the linguistic rules and specifies how each sentence is constructed from its parts. The parsing algorithm applies the defined rules by the grammar to a given input. The output scheme defines the format of the parser’s output.
Grammar can be characterized and compared in many different ways. Two characterizing factors are presented in the next section. The first depends on linguistic properties and the other comes from processing perspective. Briefs for the most well-known formalisms that have been successfully applied are presented in the coming posts.

No comments:

Post a Comment