Skip to content
GitLab
Menu
Projects
Groups
Snippets
/
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
Krista Liin
TÜveakorpus
Commits
b0c2dd23
Commit
b0c2dd23
authored
May 23, 2019
by
rabauti
Browse files
DTD korpuse valideerimiseks
parent
e4d14ff0
Changes
2
Hide whitespace changes
Inline
Side-by-side
korpus_public/COMMANDS
View file @
b0c2dd23
#korpuse üheks failiks kokku koondamine
python joinXml.py > korpus_public/oppijakeel.xml
#korpuse faili formaalse korrektsuse valideerimine
#korpuse faili formaalse korrektsuse valideerimine
DTD abil
xmllint --noout korpus_public/oppijakeel.xml
...
...
@@ -39,4 +39,6 @@ cat korpus_public/oppijakeel.xml | grep -o 'tyyp="[^"]*"' | sort | uniq -c | sor
209 tyyp="tõlge"
42 tyyp="referaat"
25 tyyp="referering"
15 tyyp=""
\ No newline at end of file
15 tyyp=""
\ No newline at end of file
korpus_public/oppijakeel.dtd
0 → 100644
View file @
b0c2dd23
<!ELEMENT korpus ( header, tekst ) >
<!ELEMENT header ( #PCDATA | parandajad)* >
<!ELEMENT parandajad (#PCDATA)* >
<!ELEMENT tekst ( eksimus+ ) >
<!ELEMENT eksimus ( algne, parandus+, kommentaar )+ >
<!ATTLIST eksimus emakeel CDATA #REQUIRED >
<!ATTLIST eksimus id CDATA #REQUIRED >
<!ATTLIST eksimus tase CDATA #REQUIRED >
<!ATTLIST eksimus tyyp CDATA #REQUIRED >
<!ELEMENT algne ( #PCDATA ) >
<!ATTLIST algne id CDATA #REQUIRED >
<!ELEMENT parandus ( #PCDATA ) >
<!ATTLIST parandus id CDATA #REQUIRED >
<!ELEMENT kommentaar ( #PCDATA )* >
<!ATTLIST kommentaar id CDATA #REQUIRED >
Write
Preview
Supports
Markdown
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment