For full functionality of Sketch Engine it is necessary to
enable JavaScript
Monolingual_yrk
Monolingual_yrk
Reset settings
English
česky
slovensky
简体中文
繁體中文
Gaeilge
slovenščina
hrvatski
العربية
español
français
українська
polski
Search
Word list
Corpus info
User guide
All words
Menu position
This action may take several minutes for large corpora, please wait.
Word list options
Subcorpus:
create new
Search attribute:
word
doc.id
doc.genre
doc.source
doc.gender
doc.dialect
use n-grams
. Value of n: from
2
3
4
5
6
to
2
3
4
5
6
hide/nest sub-n-grams
Filter options:
Filter word list by:
Regular expression:
Minimum frequency:
Maximum frequency:
(0 = no maximum frequency)
Whitelist:
Blacklist:
format
Word list whitelists and blacklists must be plain text (.txt), encoded in UTF-8, with one item per line. The items must correspond to the selected attribute, so, eg, if 'lemma' is selected from the attribute menu, then the list should be a list of lemmas. We use exact matching, not regular-expression matching, for file input.
Include non-words
Output options:
Frequency figures:
Hit counts
Document counts
ARF
Output type:
Simple
Keywords
Reference (sub)corpus
Monolingual_yrk
(whole corpus)
Prefer:
rare words
common words
Change output attribute(s)
---
word
---
word
---
word
You can select one or more output attributes. Please note that this option can be time-consuming.