Grep

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Template:Short description Template:Lowercase title {{#invoke:Infobox|infobox}}Template:Template other {{#invoke:Check for unknown parameters|check|unknown=Template:Main other|preview=Page using Template:Infobox software with unknown parameter "_VALUE_"|ignoreblank=y| AsOf | author | background | bodystyle | caption | collapsetext | collapsible | developer | discontinued | engine | engines | genre | included with | language | language count | language footnote | latest preview date | latest preview version | latest release date | latest release version | latest_preview_date | latest_preview_version | latest_release_date | latest_release_version | licence | license | logo | logo alt | logo caption | logo class | logo size | logo title | logo upright | logo_alt | logo_caption | logo_class | logo_size | logo_title | logo_upright | middleware | module | name | operating system | operating_system | other_names | platform | programming language | programming_language | qid | released | replaced_by | replaces | repo | screenshot | screenshot alt | screenshot class | screenshot size | screenshot title | screenshot upright | screenshot_alt | screenshot_class | screenshot_size | screenshot_upright | service_name | size | standard | title | ver layout | website }}Template:Main other grep is a command-line utility for searching plaintext for lines that match a regular expression. Its name comes from the ed command g/re/p (global, regular expression, print), which has the same effect.<ref>Template:Cite book</ref><ref name="etymology">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> grep was originally developed for the Unix operating system, and is commonly available on Unix-like and some other systems such as OS-9.<ref>Template:Cite book</ref>

History

Before it was named, grep was a private utility written by Ken Thompson to search files for certain patterns. Doug McIlroy, unaware of its existence, asked Thompson to write such a program. Responding that he would think about such a utility overnight, Thompson actually corrected bugs and made improvements for about an hour on his own program called s (short for "search"). The next day he presented the program to McIlroy, who said it was exactly what he wanted. Thompson's account may explain the belief that grep was written overnight.<ref>Template:Cite AV mediaTemplate:Cbignore (35 mins)</ref>

Thompson wrote the first version in PDP-11 assembly language to help Lee E. McMahon analyze the text of The Federalist Papers to determine authorship of the individual papers.<ref>Computerphile, Where GREP Came From, interview with Brian Kernighan</ref> The ed text editor (also authored by Thompson) had regular expression support but could not be used to search through such a large amount of text, as it loaded the entire file into memory to enable random access editing, so Thompson excerpted that regexp code into a standalone tool which would instead process arbitrarily long files sequentially without buffering too much into memory.<ref name=history102 /> He chose the name because in ed, the command g/re/p, where the re is the regular expression to match, would print all lines featuring a specified pattern match.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref><ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> grep was first included in Version 4 Unix. Stating that it is "generally cited as the prototypical software tool", McIlroy credited grep with "irrevocably ingraining" Thompson's tools philosophy in Unix.<ref name="reader">Template:Cite tech report</ref>

Implementations

A variety of grep implementations are available in many operating systems and software development environments.<ref>Template:Cite tech report</ref> Early variants included egrep and fgrep, introduced in Version 7 Unix.Template:R The egrep variant supports an extended regular expression syntax added by Alfred Aho after Ken Thompson's original regular expression implementation.<ref name=Huma1988>Template:Cite journal</ref> The fgrep variant searches for any of a list of fixed strings using the Aho–Corasick string matching algorithm.<ref name=Meurant1990>Template:Cite book</ref> Binaries of these variants exist in modern systems, usually linking to grep or calling grep as a shell script with the appropriate flag added, e.g. <syntaxhighlight lang="bash" class="" style="" inline="1">exec grep -E "$@"</syntaxhighlight>. Commands egrep and fgrep, while commonly deployed on POSIX systems, to the point the POSIX specification mentions their widespread existence, are actually not part of POSIX.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Other commands contain the word "grep" to indicate they are search tools, typically ones that rely on regular expression matches. The pgrep utility, for instance, displays the processes whose names match a given regular expression.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

In the Perl programming language, grep is a built-in function that finds elements in a list that satisfy a certain property.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> This higher-order function is typically named filter or where in other languages.

Template:AnchorThe pcregrep command is an implementation of grep that uses Perl regular expression syntax.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> Similar functionality can be invoked in the GNU version of grep with the -P flag.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Ports of grep (within Cygwin and GnuWin32, for example) also run under Microsoft Windows. Some versions of Windows feature the similar qgrep or findstr command.<ref>Template:Cite book</ref>

A grep command is also part of ASCII's MSX-DOS2 Tools for MSX-DOS version 2.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

The grep, egrep, and fgrep commands have also been ported to the IBM i operating system.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

The software Adobe InDesign has functions GREP (since CS3 version (2007)<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>), in the find/change dialog box<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> "GREP" tab, and introduced with InDesign CS4<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> in paragraph styles<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> "GREP styles".

agrep

{{#invoke:Labelled list hatnote|labelledList|Main article|Main articles|Main page|Main pages}} agrep (approximate grep) is an open-source approximate string matching program, developed by Udi Manber and Sun Wu between 1988 and 1991,<ref>Template:Cite conference</ref> for use with the Unix operating system. It was later ported to OS/2, DOS, and Windows.

agrep matches even when the text only approximately fits the search pattern.<ref name=eGrep.SunX>Template:Cite magazine</ref>

This following invocation finds netmasks in file myfile, but also any other word that can be derived from it, given no more than two substitutions.

agrep -2 netmasks myfile

This example generates a list of matches with the closest, that is those with the fewest, substitutions listed first. The command flag -B means "best":

agrep -B netmasks myfile

Usage as a verb

In December 2003, the Oxford English Dictionary Online added "grep" as both a noun and a verb.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

A common verb usage is the phrase "You can't grep dead trees"—meaning one can more easily search through digital media, using tools such as grep, than one could with a hard copy (i.e. one made from "dead trees", which in this context is a dysphemism for paper).<ref>Jargon File, article "Documentation"</ref>

See also

Template:Portal

References

Template:Reflist

Notes

Template:Refbegin

Template:Refend

Template:Sister project

Template:Unix commands Template:Plan 9 commands

Template:Authority control