The CLAWS 5 tagset is known mainly for its use in the 1994 British National Corpus (BNC).
Tag | Description |
---|---|
AJ0 | Adjective, general or positive (e.g. good, old, beautiful) |
AJC | Comparative adjective (e.g. better, older) |
AJS | Superlative adjective (e.g. best, oldest) |
AT0 | Article (e.g. the, a, an, no) |
AV0 | General adverb: an adverb not subclassified as AVP or AVQ (e.g. often, well, longer (adv.) |
AVP | Adverb particle (e.g. up, off, out) |
AVQ | Wh-adverb (e.g. when, where, how, why, wherever) |
CJC | Coordinating conjunction (e.g. and, or, but) |
CJS | Subordinating conjunction (e.g. although, when) |
CJT | The subordinating conjunction that |
CRD | Cardinal number (e.g. one, 3, fifty-five, 3609) |
DPS | Possessive determiner (e.g. your, their, his) |
DT0 | General determiner: a determiner not subclassified as DTQ (e.g. this, that, all, some, these, any, such, many) |
DTQ | Wh-determiner (e.g. which, what, whose, whichever) |
EX0 | Existential there |
ITJ | Interjection or other isolate (e.g. oh, yes, mhm, wow) |
NN0 | Common noun, neutral for number (e.g. aircraft, data, committee) |
NN1 | Singular common noun (e.g. pencil, goose, time, revelation) |
NN2 | Plural common noun (e.g. pencils, geese, times, revelations) |
NP0 | Proper noun (e.g. London, Michael, Mars, IBM) |
ORD | Ordinal numeral (e.g. first, sixth, 77th, last) |
PNI | Indefinite pronoun (e.g. none, everything, one, nobody) |
PNP | Personal pronoun (e.g. I, you, them, ours) |
PNQ | Wh-pronoun (e.g. who, whoever, whom) |
PNX | Reflexive pronoun (e.g. myself, yourself, itself, ourselves) |
POS | The possessive or genitive marker 's or ' |
PRF | The preposition of |
PRP | Preposition, except for of (e.g. about, at, in, on, with) |
PUL | Punctuation: left bracket |
PUN | Punctuation: general separating mark - i.e. . , ! , : ; - or ? |
PUQ | Punctuation: quotation mark |
PUR | Punctuation: right bracket |
TO0 | Infinitive marker to |
UNC | Unclassified items, e.g. foreign words, special typographical symbols, formulae, and hesitation markers like er and erm |
VBB | The present tense forms of the verb be, except for is, 's |
VBD | The past tense forms of the verb be, i.e. was and were |
VBG | The -ing form of the verb be, i.e. being |
VBI | The infinitive form of the verb be, i.e. be |
VBN | The past participle form of the verb be, i.e. been |
VBZ | The -s form of the verb be, i.e. is, 's |
VDB | The finite base form of the verb be, i.e. do |
VDD | The past tense form of the verb do, i.e. did |
VDG | The -ing form of the verb do, i.e. doing |
VDI | The infinitive form of the verb do, i.e. do |
VDN | The past participle form of the verb do, i.e. done |
VDZ | The -s form of the verb do, i.e. does, 's |
VHB | The finite base form of the verb have, i.e. have, 've |
VHD | The past tense form of the verb have, i.e. had, 'd |
VHG | The -ing form of the verb have, i.e. having |
VHI | The infinitive form of the verb have, i.e. have |
VHN | The past participle form of the verb have, i.e. had |
VHZ | The -s form of the verb have, i.e. has, 's |
VM0 | Modal auxiliary verb (e.g. will, would, can, could, 'll, 'd) |
VVB | The finite base form of lexical verbs (e.g. forget, send, live, return) |
VVD | The past tense form of lexical verbs (e.g. forgot, sent, lived, returned) |
VVG | The -ing form of lexical verbs (e.g. forgetting, sending, living, returning) |
VVI | The infinitive form of lexical verbs (e.g. forget, send, live, return) |
VVN | The past participle form of lexical verbs (e.g. forgotten, sent, lived, returned) |
VVZ | The -s form of lexical verbs (e.g. forgets, sends, lives, returns) |
XX0 | The negative particle, i.e. not or n't |
ZZ0 | Alphabetical symbols (e.g. A, a, B, b, c, d) |