ENPC Home Page
Documentation
English-Norwegian Parallel Corpus (ENPC): Manual
3rd draft, June -95
Stig Johansson/Jarle Ebeling
Table of Contents
1 Introduction
1.1 Aim
1.2 Structure of the corpus
1.2.1 Core corpus
1.2.2 Supplementary corpus
1.3 Text selection
1.4 Definition of text
1.4.1 Core corpus
1.4.2 Supplementary corpus
1.5 Methodology
1.5.1 Core corpus
1.5.2 Supplementary corpus
1.6 Availability
2 Coding
2.1 General principles
2.2 Markup levels
2.3 The header
2.3.1 File description
2.3.2 Encoding description
2.3.3 Profile description
2.3.4 Revision description
2.4 Text units
2.4.1 Text
2.4.2 Divisions
2.4.3 Paragraphs
2.4.4 S-units
2.4.5 Words
2.5 Headings and other openers
2.6 Punctuation
2.6.1 Full stop
2.6.2 Hyphen
2.6.3 Dash
2.6.4 Quotation marks
2.6.5 Apostrophe
2.7 Highlighting and quotation
2.7.1 Typographical highlighting
2.7.2 Foreign words and expressions
2.7.3 Language mention
2.7.4 Terms
2.7.5 Titles
2.7.6 Names
2.7.7 Quotations
2.7.8 Use of single (') vs double quotation marks (")
2.8 Linguistically distinct material
2.9 Notes
2.10 Lists
2.11 Figures, diagrams, and tables
2.12 Embedded texts
2.13 Editorial comment
2.13.1 Correction and regularization
2.13.2 Addition, deletion, and omission
2.14 Special characters
2.15 Page breaks
2.16 Reference system
2.17 Links
2.18 Analytic coding