Contents
1 Scope
2 Normative references
3 Terms and definitions
4 Key standards used by MAF
4.1 ISO 12620 Data Category Registry (DCR)
4.2 ISO 24610 Feature Structures (FSR and FSD)
4.3 OLAC Metadata
4.4 Unified Modeling Language (UML)
5 General characteristics of MAF
5.1 Overview
5.2 MAF Meta-Model
6 Segmenting with tokens
6.1 Standoff notation
6.2 Embedding notation
6.3 Informative attributes
6.4 Completing the embedding token notation
6.4.1 Joining tokens
6.4.2 Overlapping tokens
6.5 Formal description:
token
7 Word Forms as linguistic units
7.1 Token attachment
7.1.1 One token; one word form
7.1.2 Several contiguous tokens; one word form
7.1.3 Several discontinuous tokens; one word form
7.1.4 Zero token; one word form
7.1.5 One token; several word forms
7.2 Referring lexicon entries
7.3 Compound word forms
7.4 Formal description:
wordForm
8 Morpho-syntactic content
8.1 Using feature structures
8.2 Compact morpho-syntactic tags
8.2.1 FSR libraries
8.3 Designing tagsets
8.4 Formal description:
tagset
9 Handling ambiguities
9.1 Word form Content Ambiguities
9.2 Lexical Ambiguities
9.3 Structural Ambiguities
9.3.1 Structural ambiguities over word forms
9.3.2 Structural ambiguities over tokens
9.4 Simplified structuring variants
9.4.1 Non ambiguous linear representation
9.4.2 Mixed linear and lattice representation
9.5 Expanding the simplified variants
9.5.1 Separating tokens and word forms
9.5.2 Wrapping into local lattices
9.5.3 Merging local lattices
9.5.4 Removing
wfAlt
9.6 Formal description:
wfAlt
and
fsm
10 Header and metadata
10.1 Formal description
A (informative) RELAX NG compact schema
A.1 Validating MAF documents
B (informative) DTD
C (informative) Illustrative examples
C.1 Tagsets
C.2 Demonstrator
D (illustrative) Morpho-syntactic Data Categories
E (informative) UML notions used within MAF
E.1 Introduction
E.2 The notion of class
E.3 The notion of attribute
E.4 The notion of relationship
E.5 The notion of association
E.6 The notion of aggregation
E.7 The notion of generalization
E.8 The notion of instance
E.9 The notion of package
E.10 Graphical notations