User Tools

Site Tools


cqp:practice-complex-queries-solutions

[ Collection: Introduction to CQP ]

Formulating Complex Queries – Solutions

Exercise 1

[class=“ADJ”][hw=“snow|rain” & class=“SUBST”] (82 matches)

[pos=“AJ.”][hw=“snow|rain” & pos=“NN.”] (63 matches)

Exercise 2

[word=“going”][word=“to”][class=“VERB”] (1583 matches)

Exercise 3

[pos=“XX0”][word=“about”][word=“to”] (6 matches)

Exercise 4

The exact name of the of the value of the structural attribute is W:ac:medicine.

This query should return 61 matches in the BNC-BABY: [hw=“heart”] :: match.text_genre=“W:ac:medicine”

Which common constructions can be seen in the matches?

From a cursory look at the concordance, one could see that “heart failure” is common in medical texts, whereas constructions such has “her heart sank”, “her heart leaped”, “her heart dropped” seem more common in prose.

Exercise 5

There are two ways to match past participles via pos-tags in the BNC-BABY, as the CLAWS-5 tagset differentiates between the past participle of the verbs to have VHN, to be VBN and to do VDN and the participle of “lexical verbs” VVN, i.e. all others.

Depending on whether you included the past participles of non-lexical verbs or not, you might end up with either one of the constructions.

[hw=“have”][pos=“VVN”] (22699 matches)

[hw=“have”][pos=“VVN|VHN|VBN|VDN”] (33527 matches)

To generate a frequency list of lemmas in the participle slot: count Last by word %c on match[1]

The frequency list of the first query:

4541    got  
808     gone  
590     seen  
477     come  
476     made  
441     taken  
391     said  
304     become  
229     given  
225     told 

The frequency list of the second query:

8822    been  
4541    got  
1155    had  
850     done  
808     gone  
590     seen  
477     come  
476     made  
441     taken  
391     said 

Exercise 6

This query may have to be adjusted gradually, these are possible ways to start out:

[hw=“drive”][pos=“PNP|NP0”] (92 matches)

[hw=“drive”][class=“SUBST”][class=“ADJ”] (53 matches)

The previous queries match too many sentences that aren’t instantiations of the constructions. This is a possible way to narrow it down to people in the noun slot. Note that PNP is the pos-tag for personal pronouns and NP0 is the pos-tag for proper nouns, like Michael or NHS:

[hw=“drive”][pos=“PNP|NP0”][class=“ADJ”] (13 matches)

Which of the construction’s slots remain the same, which are variable?

The lemma drive stays the same, the host-class is adjective. Note, however, that this query would not match instantiations like “driving me bananas”.

What is the meaning of the construction?

To cause someone to “lose their mind”, as it were, to be angry or upset.

Exercise 7

Query for venomous: [hw=“venomous”][class=“SUBST”] (91 matches)

Frequency list for venomous:

count Last by hw on match[1]

18	snake  
7 	animal  
5 	attack  
4 	bite  
3 	look  
3 	shot  
3 	spine  
2 	creature  
2 	dart  
2 	glance  

Query and frequency list for toxic (956 matches):

[hw="toxic"][class="SUBST"] 

286	waste  
80	chemical  
46	substance  
45	shock  
43	effect  
28	gas  
26	fume  
16	material  
16	metal  
15	emission  

Query and frequency list for poisonous (290 matches):

[hw="poisonous"][class="SUBST"] 

31	gas  
15	snake  
13	plant  
13	substance  
10	chemical  
9 	waste  
8 	animal  
7 	creature  
6 	fume  
5 	atmosphere  
4 	bite  
4 	secretion  

Compare lists. What differences do you see?

  1. venoumous tends to occur with animals or animal-related words
  2. toxic tends to occur with substances and inanimate objects
  3. poisonous tends to occur with both substances and animals

More exercises

cqp/practice-complex-queries-solutions.txt · Last modified: 2024/06/20 13:53 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki