Project Preferences

The first GiDoc entry, 0: Preferences:, opens a window as that shown in Fig. 2, by which GiDoc preferences can be set on a transcription task or project basis. There are two buttons in the top part for project creation or load. The main window area, in the middle part, is divided into four tabs: Project, Preprocessing, Training and Recognition.

Figure 2: Project tab.
Image preferences0

As shown in Fig. 2, the Project tab consists of three items:

As can be seen in Fig. 3, the Preprocessing tab includes preferences for both, document and line preprocessing. Document preferences comprises two items:

Figure 3: Preprocessing tab.
Image preferences1

Line preferences refers to preprocessing and feature extraction for HMM modelling of text line images. Each text line image is first preprocessed and then transformed into a sequence of (fixed-dimension) feature vectors in accordance with the following preferences:

As it name indicates, the Training tab groups all options related to model training. As shown in Fig. 4, this tab is divided in two parts: HMMs and language model.

Figure 4: Training Options
Image preferences2

Options included in the HMMs part are:

The language model part includes the following options:

The last tab, Recognition, includes options for both, recognition and verification (see Fig. 5):

Figure 5: Recognition Options
Image preferences3

giDoc Team