<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.sarg.dev/index.php?action=history&amp;feed=atom&amp;title=Collocation</id>
	<title>Collocation - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.sarg.dev/index.php?action=history&amp;feed=atom&amp;title=Collocation"/>
	<link rel="alternate" type="text/html" href="https://wiki.sarg.dev/index.php?title=Collocation&amp;action=history"/>
	<updated>2026-04-17T00:35:11Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.44.2</generator>
	<entry>
		<id>https://wiki.sarg.dev/index.php?title=Collocation&amp;diff=271753&amp;oldid=prev</id>
		<title>imported&gt;Just plain Bill: Reverted 1 edit by 62.201.255.253 (talk) to last revision by Bender the Bot</title>
		<link rel="alternate" type="text/html" href="https://wiki.sarg.dev/index.php?title=Collocation&amp;diff=271753&amp;oldid=prev"/>
		<updated>2025-09-28T11:45:40Z</updated>

		<summary type="html">&lt;p&gt;Reverted 1 edit by &lt;a href=&quot;/index.php/Special:Contributions/62.201.255.253&quot; title=&quot;Special:Contributions/62.201.255.253&quot;&gt;62.201.255.253&lt;/a&gt; (&lt;a href=&quot;/index.php?title=User_talk:62.201.255.253&amp;amp;action=edit&amp;amp;redlink=1&quot; class=&quot;new&quot; title=&quot;User talk:62.201.255.253 (page does not exist)&quot;&gt;talk&lt;/a&gt;) to last revision by Bender the Bot&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;{{Short description|Frequent occurrence of words next to each other}}&lt;br /&gt;
{{About|the corpus linguistics notion||Colocation (disambiguation)}}&lt;br /&gt;
&lt;br /&gt;
In [[corpus linguistics]], a &amp;#039;&amp;#039;&amp;#039;collocation&amp;#039;&amp;#039;&amp;#039; is a series of words or [[terminology|terms]] that [[co-occurrence|co-occur]] more often than would be expected by chance. In [[phraseology]], a &amp;#039;&amp;#039;&amp;#039;collocation&amp;#039;&amp;#039;&amp;#039; is a type of [[principle of compositionality|compositional]] [[phraseme]], meaning that it can be understood from the words that make it up. This contrasts with an [[idiom]], where the meaning of the whole cannot be inferred from its parts, and may be completely unrelated.&lt;br /&gt;
&lt;br /&gt;
There are about seven main types of collocations&amp;lt;!-- in english?  --&amp;gt;: adjective&amp;amp;nbsp;+&amp;amp;nbsp;noun, noun&amp;amp;nbsp;+&amp;amp;nbsp;noun (such as [[collective nouns]]), noun&amp;amp;nbsp;+&amp;amp;nbsp;verb, verb&amp;amp;nbsp;+&amp;amp;nbsp;noun, adverb&amp;amp;nbsp;+&amp;amp;nbsp;adjective, verbs&amp;amp;nbsp;+&amp;amp;nbsp;prepositional phrase ([[phrasal verb]]s), and verb&amp;amp;nbsp;+&amp;amp;nbsp;adverb.&lt;br /&gt;
&lt;br /&gt;
[[Collocation extraction]] is a computational technique that finds collocations in a document or corpus, using various [[computational linguistics]] elements resembling [[data mining]].&lt;br /&gt;
&lt;br /&gt;
==Expanded definition==&lt;br /&gt;
Collocations are partly or fully fixed expressions that become established through repeated context-dependent use. Such terms as &amp;#039;&amp;#039;crystal clear&amp;#039;&amp;#039;, &amp;#039;&amp;#039;middle management&amp;#039;&amp;#039;, &amp;#039;&amp;#039;nuclear family&amp;#039;&amp;#039;, and &amp;#039;&amp;#039;cosmetic surgery&amp;#039;&amp;#039; are examples of collocated pairs of words.&lt;br /&gt;
&lt;br /&gt;
Collocations can be in a [[syntax|syntactic]] relation (such as [[subject–verb–object|verb–object]]: &amp;#039;&amp;#039;make&amp;#039;&amp;#039; and &amp;#039;&amp;#039;decision&amp;#039;&amp;#039;), [[lexicon|lexical]] relation (such as [[antonymy]]), or they can be in no linguistically defined relation.  Knowledge of collocations is vital for the competent use of a language: a [[grammar|grammatically]] correct sentence will stand out as awkward if collocational preferences are violated.  This makes collocation a common focus for language teaching.&lt;br /&gt;
&lt;br /&gt;
Corpus linguists specify a [[Keyword (linguistics)|key word]] in context ([[key Word in Context|KWIC]]) and identify the words immediately surrounding them, to illustrate the way words are used in practice.&lt;br /&gt;
&lt;br /&gt;
The processing of collocations involves a number of parameters, the most important of which is the &amp;#039;&amp;#039;measure of association&amp;#039;&amp;#039;, which evaluates whether the [[co-occurrence]] is purely by chance or statistically [[Statistical significance|significant]].  Due to the non-random nature of language, most collocations are classed as significant, and the association scores are simply used to rank the results.  Commonly used measures of association include [[mutual information]], [[Student&amp;#039;s t-test|t scores]], and [[log-likelihood]].&amp;lt;ref&amp;gt;Dunning, Ted (1993): &amp;quot;[http://aclweb.org/anthology/J/J93/J93-1003.pdf Accurate methods for the statistics of surprise and coincidence] {{Webarchive|url=https://web.archive.org/web/20120805163029/http://www.aclweb.org/anthology/J/J93/J93-1003.pdf |date=2012-08-05 }}&amp;quot;. [[Computational Linguistics (journal)|Computational Linguistics]] 19, 1 (Mar. 1993), 61–74.&amp;lt;/ref&amp;gt;&amp;lt;ref&amp;gt;{{cite web |url=https://tdunning.blogspot.com/2008/03/surprise-and-coincidence.html |title=Surprise and Coincidence |author=Dunning, Ted |date=2008-03-21 |publisher=blogspot.com |access-date=2012-04-09 |archive-date=2012-01-20 |archive-url=https://web.archive.org/web/20120120140321/http://tdunning.blogspot.com/2008/03/surprise-and-coincidence.html |url-status=live }}&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Rather than select a single definition, Gledhill&amp;lt;ref&amp;gt;Gledhill C. (2000): [https://books.google.com/books?id=U8FlfunUIOEC Collocations in Science Writing] {{Webarchive|url=https://web.archive.org/web/20230629054533/https://books.google.com/books?id=U8FlfunUIOEC |date=2023-06-29 }}, Narr, Tübingen&amp;lt;/ref&amp;gt; proposes that collocation involves at least three different perspectives: co-occurrence, a statistical view, which sees collocation as the recurrent appearance in a text of a node and its collocates;&amp;lt;ref&amp;gt;Firth J.R. (1957): Papers in Linguistics 1934–1951. Oxford: Oxford University Press.&amp;lt;/ref&amp;gt;&amp;lt;ref&amp;gt;Sinclair J. (1996): &amp;quot;The Search for Units of Meaning&amp;quot;, in Textus, IX, 75–106.&lt;br /&gt;
&amp;lt;/ref&amp;gt;&amp;lt;ref&amp;gt;Smadja F. A &amp;amp; McKeown, K. R. (1990): &amp;quot;[http://aclweb.org/anthology/P/P90/P90-1032.pdf Automatically extracting and representing collocations for language generation] {{Webarchive|url=https://web.archive.org/web/20150906073636/http://www.aclweb.org/anthology/P/P90/P90-1032.pdf |date=2015-09-06 }}&amp;quot;, Proceedings of ACL&amp;#039;90, 252–259, Pittsburgh, Pennsylvania.&amp;lt;/ref&amp;gt; construction, which sees collocation either as a correlation between a lexeme and a lexical-grammatical pattern,&amp;lt;ref&amp;gt;Hunston S. &amp;amp; Francis G. (2000): [https://books.google.com/books?id=nqqh46Q0uVMC&amp;amp;q=collocation Pattern Grammar — A Corpus-Driven Approach to the Lexical Grammar of English] {{Webarchive|url=https://web.archive.org/web/20230629054534/https://books.google.com/books?id=nqqh46Q0uVMC&amp;amp;q=collocation |date=2023-06-29 }}, Amsterdam, John Benjamins&amp;lt;/ref&amp;gt; or as a relation between a base and its collocative partners;&amp;lt;ref&amp;gt;Hausmann F. J. (1989): Le dictionnaire de collocations. In Hausmann F.J., Reichmann O., Wiegand H.E., Zgusta L.(eds), Wörterbücher : ein internationales Handbuch zur Lexikographie. Dictionaries. Dictionnaires. Berlin/New-York : De Gruyter. 1010–1019.&amp;lt;/ref&amp;gt; and expression, a pragmatic view of collocation as a conventional unit of expression, regardless of form.&amp;lt;ref&amp;gt;&lt;br /&gt;
Moon R. (1998): Fixed Expressions and Idioms, a Corpus-Based Approach. Oxford, Oxford University Press.&amp;lt;/ref&amp;gt;&amp;lt;ref&amp;gt;Frath P. &amp;amp; Gledhill C. (2005): &amp;quot;[https://www.academia.edu/download/28949432/Frath__Pierre___Gledhill__Christopher_2005a._Free-Range_Clusters_or_Frozen_Chunks.pdf Free-Range Clusters or Frozen Chunks? Reference as a Defining Criterion for Linguistic Units]{{dead link|date=July 2022|bot=medic}}{{cbignore|bot=medic}}&amp;quot;, in Recherches anglaises et Nord-américaines, vol. 38 :25–43&amp;lt;/ref&amp;gt; These different perspectives contrast with the usual way of presenting collocation in phraseological studies. Traditionally speaking, collocation is explained in terms of all three perspectives at once, in a continuum:&lt;br /&gt;
&lt;br /&gt;
{{block indent|Free combination ↔ bound collocation ↔ frozen idiom}}&lt;br /&gt;
&lt;br /&gt;
==In dictionaries==&lt;br /&gt;
&lt;br /&gt;
In 1933, [[Harold E. Palmer|Harold Palmer]]&amp;#039;s &amp;#039;&amp;#039;Second Interim Report on English Collocations&amp;#039;&amp;#039; highlighted the importance of collocation as a key to producing natural-sounding language, for anyone learning a [[foreign language]].&amp;lt;ref&amp;gt;Cowie, A.P., English Dictionaries for Foreign Learners, Oxford University Press 1999:54–56&amp;lt;/ref&amp;gt; Thus from the 1940s onwards, information about recurrent word combinations became a standard feature of [[Monolingual learner&amp;#039;s dictionary|monolingual learner&amp;#039;s dictionaries]]. As these dictionaries became &amp;quot;less word-centred and more phrase-centred&amp;quot;,&amp;lt;ref&amp;gt;Bejoint, H., The Lexicography of English, Oxford University Press 2010: 318&amp;lt;/ref&amp;gt; more attention was paid to collocation. This trend was supported, from the beginning of the 21st century, by the availability of large text [[Corpus linguistics|corpora]] and intelligent [[Text mining|corpus-querying software]], making it possible to provide a more systematic account of collocation in dictionaries. Using these tools, dictionaries such as the &amp;#039;&amp;#039;[[Macmillan English Dictionary for Advanced Learners|Macmillan English Dictionary]]&amp;#039;&amp;#039; and the &amp;#039;&amp;#039;[[Longman Dictionary of Contemporary English]]&amp;#039;&amp;#039; included boxes or panels with lists of frequent collocations.&amp;lt;ref&amp;gt;{{cite web|url=http://www.macmillandictionaries.com/about/med/key-features-of-the-macmillan-english-dictionary-second-edition/#7|title=MED Second Edition – Key features – Macmillan|work=macmillandictionaries.com|access-date=2011-08-24|archive-date=2020-09-28|archive-url=https://web.archive.org/web/20200928035907/http://www.macmillandictionaries.com/about/med/key-features-of-the-macmillan-english-dictionary-second-edition/#7|url-status=dead}}&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
There are also a number of [[Specialized dictionary|specialized dictionaries]] devoted to describing the frequent collocations in a language.&amp;lt;ref&amp;gt;Herbst, T. and Klotz, M. &amp;#039;Syntagmatic and Phraseological Dictionaries&amp;#039; in Cowie, A.P. (Ed.) The Oxford History of English Lexicography, 2009: part 2, 234–243&amp;lt;/ref&amp;gt; These include (for Spanish) &amp;#039;&amp;#039;Redes: Diccionario combinatorio del español contemporaneo&amp;#039;&amp;#039; (2004),  (for French) &amp;#039;&amp;#039;Le Robert: Dictionnaire des combinaisons de mots&amp;#039;&amp;#039; (2007), and (for English) the &amp;#039;&amp;#039;LTP Dictionary of Selected Collocations&amp;#039;&amp;#039; (1997) and the &amp;#039;&amp;#039;Macmillan Collocations Dictionary&amp;#039;&amp;#039; (2010).&amp;lt;ref&amp;gt;{{cite web|url=http://www.macmillandictionaries.com/features/how-dictionaries-are-written/macmillan-collocations-dictionary/|title=Macmillan Collocation Dictionary – How it was written - Macmillan|work=macmillandictionaries.com|access-date=2011-08-24|archive-date=2018-12-21|archive-url=https://web.archive.org/web/20181221182544/http://www.macmillandictionaries.com/features/how-dictionaries-are-written/macmillan-collocations-dictionary/|url-status=dead}}&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Statistically significant collocation ==&lt;br /&gt;
[[Student&amp;#039;s t-test|Student&amp;#039;s &amp;#039;&amp;#039;t&amp;#039;&amp;#039;-test]] can be used to determine whether the occurrence of a collocation in a corpus is statistically significant.&amp;lt;ref&amp;gt;{{Cite book|title=Foundations of Statistical Natural Language Processing|url=https://archive.org/details/foundationsstati00mann_118|url-access=limited|last1=Manning|first1=Chris|last2=Schütze|first2=Hinrich|publisher=MIT Press|year=1999|isbn=0262133601|location=Cambridge, MA|pages=[https://archive.org/details/foundationsstati00mann_118/page/n202 163]–166}}&amp;lt;/ref&amp;gt; For a [[bigram]] &amp;lt;math&amp;gt;w_1w_2&amp;lt;/math&amp;gt;, let &amp;lt;math&amp;gt;P(w_1) = \frac{\#w_1}{N}&amp;lt;/math&amp;gt; be the unconditional probability of occurrence of &amp;lt;math&amp;gt;w_1&amp;lt;/math&amp;gt; in a corpus with size &amp;lt;math&amp;gt;N&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;P(w_2) = \frac{\#w_2}{N}&amp;lt;/math&amp;gt; be the unconditional probability of occurrence of &amp;lt;math&amp;gt;w_2&amp;lt;/math&amp;gt; in the corpus. The t-score for the bigram &amp;lt;math&amp;gt;w_1w_2&amp;lt;/math&amp;gt; is calculated as:&lt;br /&gt;
&lt;br /&gt;
{{block indent|&amp;lt;math&amp;gt;t = \frac{\bar{x} - \mu}{\sqrt{\frac{s^2}{N}}}, &amp;lt;/math&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
where &amp;lt;math&amp;gt;\bar{x} = \frac{\# w_iw_j}{N}&amp;lt;/math&amp;gt; is the sample mean of the occurrence of &amp;lt;math&amp;gt;w_1w_2&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;\#w_1w_2&amp;lt;/math&amp;gt; is the number of occurrences of &amp;lt;math&amp;gt;w_1w_2&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;\mu = P(w_i)P(w_j)&amp;lt;/math&amp;gt; is the probability of &amp;lt;math&amp;gt;w_1w_2&amp;lt;/math&amp;gt; under the null-hypothesis that &amp;lt;math&amp;gt;w_1&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;w_2&amp;lt;/math&amp;gt; appear independently in the text, and &amp;lt;math&amp;gt;s^2 = \bar{x}(1-\bar{x}) \approx \bar{x}&amp;lt;/math&amp;gt; is the sample variance. With a large &amp;lt;math&amp;gt;N&amp;lt;/math&amp;gt;, the &amp;#039;&amp;#039;t&amp;#039;&amp;#039;-test is equivalent to a [[z-test|&amp;#039;&amp;#039;Z&amp;#039;&amp;#039;-test]].&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
{{Portal|Linguistics}}&lt;br /&gt;
{{div col|colwidth=22em}}&lt;br /&gt;
*[[English collocations]]&lt;br /&gt;
*[[Agreement (linguistics)]]&lt;br /&gt;
*[[Cliché]]&lt;br /&gt;
*[[Collocational restriction]]&lt;br /&gt;
*[[Collostructional analysis]]&lt;br /&gt;
*[[Compound noun, adjective and verb]]&lt;br /&gt;
*[[Government (linguistics)]]&lt;br /&gt;
*[[Idiom (language structure)]]&lt;br /&gt;
*[[Irreversible binomial]]&lt;br /&gt;
*[[Isocolon]]&lt;br /&gt;
*[[Lexical item]]&lt;br /&gt;
*[[N-gram]]&lt;br /&gt;
*[[Phrasal verb]]&lt;br /&gt;
*[[Phraseology]]&lt;br /&gt;
*[[Phraseme#Collocations|Phraseme]]&lt;br /&gt;
*[[Sketch Engine]]&lt;br /&gt;
*[[Statistically improbable phrase]]&lt;br /&gt;
*[[Word sketch]]&lt;br /&gt;
{{div col end}}&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
{{Reflist}}&lt;br /&gt;
&lt;br /&gt;
==External links==&lt;br /&gt;
{{Wiktionary|collocation}}&lt;br /&gt;
*[https://ozdic.com/ Ozdic Collocation Dictionary]&lt;br /&gt;
* [https://doi.org/10.1007%2F978-3-540-24630-5_30 A Small System Storing Spanish Collocations] (Igor A. Bolshakov &amp;amp; Sabino Miranda-Jiménez)&lt;br /&gt;
* [https://web.archive.org/web/20120317004247/http://www.cic.ipn.mx/posgrados/images/sources/cic/tesis/B001123.pdf Morphological characterization of collocations and semantic relationships in Spanish] (Sabino Miranda-Jiménez &amp;amp; Igor A. Bolshakov)&lt;br /&gt;
* [https://wordassociations.net/en/words-associated-with/surgery?button=Search Example of collocations for the word &amp;quot;Surgery&amp;quot;] at &amp;#039;&amp;#039;wordassociations.net&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
{{Authority control}}&lt;br /&gt;
&lt;br /&gt;
[[Category:Lexical units]]&lt;br /&gt;
[[Category:Language education]]&lt;br /&gt;
[[Category:Corpus linguistics]]&lt;br /&gt;
[[Category:Semantic relations]]&lt;/div&gt;</summary>
		<author><name>imported&gt;Just plain Bill</name></author>
	</entry>
</feed>