Skip to content
GitLab
Menu
Projects
Groups
Snippets
/
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
Krista Liin
TÜveakorpus
Commits
1a2eeae6
Commit
1a2eeae6
authored
Apr 04, 2019
by
rabauti
Browse files
statistika käsud
parent
9e2d75df
Changes
1
Hide whitespace changes
Inline
Side-by-side
korpus_public/COMMANDS
View file @
1a2eeae6
...
...
@@ -4,4 +4,39 @@ python joinXml.py > korpus_public/oppijakeel.xml
#korpuse faili formaalse korrektsuse valideerimine
xmllint --noout korpus_public/oppijakeel.xml
#header.xml ja muud rajad on hardcoded joinXML.py failis
\ No newline at end of file
#statistikaks
cat korpus_public/oppijakeel.xml | grep -o 'tase="[^"]*"' | sort | uniq -c | sort -nr
cat korpus_public/oppijakeel.xml | grep -o 'emakeel="[^"]*"' | sort | uniq -c | sort -nr
cat korpus_public/oppijakeel.xml | grep -o 'emakeel="[^"]*"' | sort | uniq -c | sort -nr
7528 tase="kesk"
1177 tase="määramata"
311 tase="kõrg"
13 tase="alg"
8375 emakeel="vene"
357 emakeel="soome"
152 emakeel="läti"
46 emakeel="hispaania"
44 emakeel="komi"
24 emakeel="bulgaaria"
18 emakeel="hollandi"
9 emakeel="inglise"
4 emakeel="saksa"
5752 tyyp="kirjand"
1819 tyyp="vastused"
653 tyyp="eksamitöö"
514 tyyp="refereering"
209 tyyp="tõlge"
42 tyyp="referaat"
25 tyyp="referering"
15 tyyp=""
\ No newline at end of file
Write
Preview
Supports
Markdown
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment