Skip to content
GitLab
Menu
Projects
Groups
Snippets
/
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
keeleliin
word_cloud_est
Commits
37940a17
Commit
37940a17
authored
Jul 19, 2014
by
Andreas Mueller
Browse files
Merge pull request #19 from petrushev/unicode-parsing
Parse words with unicode letters
parents
a8fbf5bc
05bb5624
Changes
1
Hide whitespace changes
Inline
Side-by-side
wordcloud/__init__.py
View file @
37940a17
...
...
@@ -178,7 +178,7 @@ def process_text(text, max_features=200, stopwords=None):
stopwords
=
STOPWORDS
d
=
{}
for
word
in
re
.
findall
(
r
"\w[\w']*"
,
text
):
for
word
in
re
.
findall
(
r
"\w[\w']*"
,
text
,
flags
=
re
.
UNICODE
):
if
word
.
isdigit
():
continue
...
...
Write
Preview
Supports
Markdown
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment