Bude se odbývat pomocí kontrastních nástrojů:
WordNet je vedoucí lingvistickou databází ve světovém měřítku a postoj k němu musí zaujmout všechny rozsáhlejší lingvistické projekty. Bohužel, je několik důvodů, proč z WordNetu nelze vycházet přímo:
Přehled archivů
Přehled archivů
Je popsán v souborech 1_wn.cpp a WN_ALL.000
Porovnání velikosti souborů
verze 1.6 až 3.0
Filename | WordNet 1.6 | WordNet 2.0 | WordNet 2.1 | WordNet 3.0 | Zpracovává se |
LEXNAMES | 1 kB | 1 kB | Ne | ||
ADJ.DAT | 2 887 kB | 2 989 kB | 3 240 kB | 3 155 kB | Ano |
ADV.DAT | 510 kB | 522 kB | 520 kB | 517 kB | Ano |
NOUN.DAT | 10 989 kB | 14 434 kB | 15 100 kB | 15 300 kB | Ano |
VERB.DAT | 1 894 kB | 2 690 kB | 2 747 kB | 2 773 kB | Ano |
ADJ.IDX | 768 kB | 835 kB | Ne | ||
ADV.IDX | 165 kB | 174 kB | Ne | ||
GLOSS.IDX | 5 709 kB | neexistuje | Ne | ||
NOUN.IDX | 3 779 kB | 4 752 kB | Ne | ||
SENSE.IDX | 6 125 kB | 7 375 kB | Ne | ||
VERB.IDX | 467 kB | 525 kB | Ne | ||
ADJ.EXC | 20 kB | 24 kB | Ano | ||
ADV.EXC | 720 B | 920 B | Ne | ||
COUSIN.EXC | 41 kB | neexistuje | Ne | ||
COUSIN.TPS | 4 kB | neexistuje | Ne | ||
NOUN.EXC | 110 kB | 40 kB | Ano | ||
VERB.EXC | 83 kB | 40 kB | Ano | ||
STOPLIST.PL | 9 kB | neexistuje | Ne | ||
SENTIDX.VRB | 59 kB | 76 kB | Ne | ||
SENTS.VRB | 5 kB | 6 kB | Ne | ||
CNTLIST | 897 kB | 939 kB | Ne | ||
CNTLIST.REV | neexistuje | 946 kB | Ne |
Ke stažení ve verzi 2.0 je též soubor WNgrind-2.0.tar[1] o velikosti 16 760 KB - obsahuje data pro lexikografickou editaci a programy v jazyku Perl
{ entity, physical thing
(that which is perceived or known or inferred to have its own
physical existence (living or nonliving)) }
{ psychological_feature, (a
feature of the mental life of a living organism) }
{ abstraction, (a general
concept formed by extracting common features from specific
examples) }
{ state, (the way something
is with respect to its main attributes; "the current state of
knowledge"; "his state of health"; "in a weak financial state") }
{ event, (something that
happens at a given place and time) }
{ act, human_action, human_activity,
(something that people do or cause to happen) }
{ group, grouping, (any
number of entities (members) considered as a unit) }
{ possession, (anything owned
or possessed) }
{ phenomenon, (any state or
process known through the senses rather than by intuition or
reasoning) }
WordNet lexical categories and sample words
|
|
|
|
UNAMBIGUOUS WORDS & PHRASES | TOTAL WORDS & PHRASES |
I. ADVERBS | |||||
All |
|
|
|
4051 | 4667 |
II. ADJECTIVES | |||||
All |
|
|
14273 | 17553 | |
Pertainyms |
|
|
|
2837 | 4247 |
Participial |
|
|
|
87 | 127 |
III. NOUNS | |||||
Act |
|
|
|
5527 | 9158 |
Animal |
|
|
|
13127 | 14181 |
Artifact |
|
|
|
12374 | 15864 |
Attribute |
|
|
|
2804 | 4581 |
Body |
|
|
|
2892 | 3447 |
Cognition |
|
|
|
2640 | 4225 |
Communication |
|
|
|
4877 | 7847 |
Event |
|
|
|
620 | 1613 |
Feeling |
|
|
|
309 | 744 |
Food |
|
|
|
2344 | 3504 |
Group |
|
|
|
2771 | 3757 |
Location |
|
|
|
3732 | 4588 |
Motive |
|
|
|
47 | 77 |
Object |
|
|
|
1528 | 2156 |
Person |
|
|
|
14836 | 17987 |
Phenomenon |
|
|
|
646 | 960 |
Plant |
|
|
|
16781 | 17701 |
Possession |
|
|
|
1015 | 1483 |
Process |
|
|
|
703 | 1106 |
Quantity |
|
|
|
1261 | 1860 |
Relation |
|
|
|
376 | 635 |
Shape |
|
|
|
229 | 524 |
State |
|
|
|
3957 | 5469 |
Substance |
|
|
|
3703 | 4469 |
Time |
|
|
|
1221 | 1592 |
IV. VERBS | |||||
Body | bungle, disinfect, hibernate | 154 | 759 | 282 | 960 |
Change | amplify, blossom, crystallize | 796 | 2233 | 1049 | 2702 |
Cognition | amaze, brainstorm, diagnose | 196 | 849 | 317 | 1053 |
Communication | applaud, argue, circumstantiate | 465 | 1816 | 711 | 2214 |
Competition | capitulate, equalize, outrival | 77 | 489 | 142 | 609 |
Consumption | cater, gratify, quench | 56 | 305 | 111 | 398 |
Contact | attach, bulldoze, collide | 367 | 1993 | 650 | 2479 |
Creation | annotate, confect, enact | 128 | 732 | 196 | 873 |
Emotion | agonize, condole, exacerbate | 102 | 505 | 191 | 639 |
Motion | clump, deflect, eject | 159 | 1177 | 441 | 1657 |
Perception | cense, conceal, hallucinate | 87 | 543 | 152 | 647 |
Possession | allocate, donate, hospitalize | 155 | 806 | 269 | 1024 |
Social | abolish, celebrate, demote | 248 | 1271 | 492 | 1669 |
Stative | coexist, embody, incriminate | 95 | 746 | 249 | 1014 |
Weather | freeze, ignite, pour | 3 | 94 | 23 | 130 |