Splitting+and+joining+words

=Splitting and joining words=

 In general, words are split and joined according to the guidelines described for the [|Penn Historical Corpora]. Some special categories are noted below; conventions regarding specific words can be found under individual words.


 * **DAZU, DAHIN, HERBEI, etc.** are split into ADV and P. This is unlike the guidelines for the Penn Parsed Corpora of English.


 * **Separable verbal particles** (RP) are always split from the verb.


 * **Inseparable verbal particles** (ver-, zur-, be-, etc.) are not annotated as RP, and are joined to the verb if written separately, i.e. zur=treten.


 * **WO+P** (WOMIT, etc.) are joined into WADV+P.


 * **YM/YHM, ZUM, VOM, YNß (INS), etc.** are split into P and D. For simplicity, the split D is always written as a definite ("dem," "das," etc.) even when the interpretation is more likely to be indefinite.