<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.sarg.dev/index.php?action=history&amp;feed=atom&amp;title=Explicitly_parallel_instruction_computing</id>
	<title>Explicitly parallel instruction computing - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.sarg.dev/index.php?action=history&amp;feed=atom&amp;title=Explicitly_parallel_instruction_computing"/>
	<link rel="alternate" type="text/html" href="https://wiki.sarg.dev/index.php?title=Explicitly_parallel_instruction_computing&amp;action=history"/>
	<updated>2026-04-17T20:25:00Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.44.2</generator>
	<entry>
		<id>https://wiki.sarg.dev/index.php?title=Explicitly_parallel_instruction_computing&amp;diff=408997&amp;oldid=prev</id>
		<title>imported&gt;Cldemote: Add a archive for https://www.hpl.hp.com/techreports/1999/HPL-1999-111.pdf as it is offline.</title>
		<link rel="alternate" type="text/html" href="https://wiki.sarg.dev/index.php?title=Explicitly_parallel_instruction_computing&amp;diff=408997&amp;oldid=prev"/>
		<updated>2025-11-13T05:27:51Z</updated>

		<summary type="html">&lt;p&gt;Add a archive for https://www.hpl.hp.com/techreports/1999/HPL-1999-111.pdf as it is offline.&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;{{short description|Instruction set architecture}}&lt;br /&gt;
{{Refimprove|date=October 2010}}&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Explicitly parallel instruction computing&amp;#039;&amp;#039;&amp;#039; (&amp;#039;&amp;#039;&amp;#039;EPIC&amp;#039;&amp;#039;&amp;#039;) is a term coined in 1997 by the [[Itanium|HP–Intel alliance]]&amp;lt;ref&amp;gt;{{cite web&lt;br /&gt;
|url = http://www.hpl.hp.com/techreports/1999/HPL-1999-111.pdf&lt;br /&gt;
|format=PDF|title = EPIC: An Architecture for Instruction-Level Parallel Processors&lt;br /&gt;
|access-date = 2008-05-08&lt;br /&gt;
|last = Schlansker and Rau&lt;br /&gt;
|work = HP Laboratories Palo Alto, HPL-1999-111&lt;br /&gt;
|date=February 2000&lt;br /&gt;
|archive-date = 2022-05-19&lt;br /&gt;
|archive-url=https://web.archive.org/web/20220519162947/https://www.hpl.hp.com/techreports/1999/HPL-1999-111.pdf&lt;br /&gt;
|url-status=dead&lt;br /&gt;
}}&amp;lt;/ref&amp;gt; to describe a [[computing paradigm]] that researchers had been investigating since the early 1980s.&amp;lt;ref&amp;gt;{{cite patent|country=US|number=4847755|pubdate=1989-07-11|title=Parallel processing method and apparatus for increasing processing throughout by parallel processing low level instructions having natural concurrencies|assign1=MCC Development Ltd.|inventor1-last=Morrison|inventor1-first=Gordon E.|inventor2-last=Brooks|inventor2-first=Christopher B.|inventor3-last=Gluck|inventor3-first=Frederick G.}}&amp;lt;/ref&amp;gt; This paradigm is also called &amp;#039;&amp;#039;Independence&amp;#039;&amp;#039; architectures. It was the basis for [[Intel]] and [[Hewlett-Packard|HP]] development of the Intel [[Itanium]] architecture,&amp;lt;ref name=&amp;quot;HP_Labs&amp;quot;&amp;gt;{{cite web&lt;br /&gt;
| url = http://www.hpl.hp.com/news/2001/apr-jun/itanium.html&lt;br /&gt;
| title = Inventing Itanium: How HP Labs Helped Create the Next-Generation Chip Architecture&lt;br /&gt;
| access-date = 2007-12-14&lt;br /&gt;
| date = June 2001&lt;br /&gt;
| work = [[Hewlett-Packard|HP]] Labs&lt;br /&gt;
| archive-date = 2012-03-04&lt;br /&gt;
| archive-url = https://web.archive.org/web/20120304045612/http://www.hpl.hp.com/news/2001/apr-jun/itanium.html&lt;br /&gt;
| url-status = dead&lt;br /&gt;
}}&amp;lt;/ref&amp;gt; and [[Hewlett-Packard|HP]] later asserted that &amp;quot;EPIC&amp;quot; was merely an old term for the Itanium architecture.&amp;lt;ref name=&amp;quot;anand&amp;quot;&amp;gt;{{cite web&lt;br /&gt;
| url = http://www.anandtech.com/cpuchipsets/showdoc.aspx?i=2598&lt;br /&gt;
| archive-url = https://web.archive.org/web/20060113234912/http://www.anandtech.com/cpuchipsets/showdoc.aspx?i=2598&lt;br /&gt;
| url-status = dead&lt;br /&gt;
| archive-date = January 13, 2006&lt;br /&gt;
| title = Itanium&amp;amp;ndash;Is there light at the end of the tunnel?&lt;br /&gt;
| access-date = 2008-05-08&lt;br /&gt;
| last = De Gelas&lt;br /&gt;
| first = Johan&lt;br /&gt;
| date = November 9, 2005&lt;br /&gt;
| work = [[AnandTech]]&lt;br /&gt;
}}&amp;lt;/ref&amp;gt; EPIC permits microprocessors to execute software instructions in parallel by using the [[compiler]], rather than complex on-[[die (integrated circuit)|die]] circuitry, to control parallel instruction execution. This was intended to allow simple performance scaling without resorting to higher [[clock rate|clock frequencies]].&lt;br /&gt;
&lt;br /&gt;
==Roots in VLIW==&lt;br /&gt;
By 1989, researchers at HP recognized that [[reduced instruction set computer]] (RISC) architectures were reaching a limit at one [[instruction per cycle]].{{Clarify|date=September 2009}} They began an investigation into a new architecture, later named &amp;#039;&amp;#039;&amp;#039;EPIC&amp;#039;&amp;#039;&amp;#039;.&amp;lt;ref name=&amp;quot;HP_Labs&amp;quot;/&amp;gt; The basis for the research was [[Very long instruction word|VLIW]], in which multiple operations are encoded in every instruction, and then processed by multiple execution units.&lt;br /&gt;
&lt;br /&gt;
One goal of EPIC was to move the complexity of instruction scheduling from the CPU hardware to the software [[compiler]], which can do the instruction scheduling statically (with help of trace feedback information). This eliminates the need for complex scheduling circuitry in the CPU, which frees up space and power for other functions, including additional execution resources. An equally important goal was to further exploit [[instruction-level parallelism]] (&amp;#039;&amp;#039;ILP&amp;#039;&amp;#039;) by using the compiler to find and exploit additional opportunities for [[parallel computing|parallel execution]].&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;VLIW&amp;#039;&amp;#039; (at least the original forms) has several short-comings that precluded it from becoming mainstream:&lt;br /&gt;
* VLIW [[instruction set]]s are not [[backward compatibility|backward compatible]] between implementations. When [[wide-issue|wider]] implementations (more [[execution unit]]s) are built, the instruction set for the wider machines is not backward compatible with older, narrower implementations.&lt;br /&gt;
* Load responses from a [[memory hierarchy]] which includes [[CPU cache]]s and [[DRAM]] do not have a deterministic delay, because the compiler can&amp;#039;t predict ahead of time how high in the hierarchy a piece of data is going to be available (more specifically when data will be missing from all cache and need to be fetched from main memory). This makes static scheduling of load instructions by the compiler very difficult.&lt;br /&gt;
EPIC architecture evolved from VLIW architecture, but retained many concepts of the [[superscalar]] architecture.&lt;br /&gt;
&lt;br /&gt;
==Moving beyond VLIW==&lt;br /&gt;
&amp;#039;&amp;#039;EPIC&amp;#039;&amp;#039; architectures add several features to get around the deficiencies of&lt;br /&gt;
VLIW:&lt;br /&gt;
* Each group of multiple software instructions is called a &amp;#039;&amp;#039;bundle&amp;#039;&amp;#039;. Each of the bundles has a [[stop bit]] indicating if this set of operations is depended upon by the subsequent bundle. With this capability, future implementations can be built to issue multiple bundles in parallel. The dependency information is calculated by the compiler, so the hardware does not have to perform operand dependency checking.&lt;br /&gt;
* A software prefetch instruction is used as a type of data prefetch. This prefetch increases the chances for a cache hit for loads, and can indicate the degree of temporal locality needed in various levels of the cache.&lt;br /&gt;
* A speculative load instruction is used to speculatively load data before it is known whether it will be used (bypassing control dependencies), or whether it will be modified before it is used (bypassing data dependencies).&lt;br /&gt;
* A check load instruction aids speculative loads by checking whether a speculative load was dependent on a later store, and thus must be reloaded.&lt;br /&gt;
&lt;br /&gt;
The &amp;#039;&amp;#039;EPIC&amp;#039;&amp;#039; architecture also includes a &amp;#039;&amp;#039;grab-bag&amp;#039;&amp;#039; of architectural concepts to increase [[Instruction-level_parallelism|&amp;#039;&amp;#039;ILP&amp;#039;&amp;#039;]]:&lt;br /&gt;
* [[Branch predication|Predicated execution]] is used to decrease the occurrence of branches and to increase the [[speculative execution]] of instructions. In this feature, branch conditions are converted to predicate registers which are used to kill results of executed instructions from the side of the branch which is not taken.&lt;br /&gt;
* Delayed exceptions, using a [[Not a thing (computing)|not a thing]] bit within the general purpose registers, allow speculative execution past possible exceptions.&lt;br /&gt;
* Very large architectural [[register file]]s avoid the need for [[register renaming]].&lt;br /&gt;
* Multi-way branch instructions improve branch prediction by combining many alternative branches into one bundle.&lt;br /&gt;
&lt;br /&gt;
The [[Itanium]] architecture also added [[Register window|rotating register file]]s, a tool useful for [[software pipelining]] since it avoids having to manually [[Loop unrolling|unroll]] and rename registers.&lt;br /&gt;
&lt;br /&gt;
==Other research and development==&lt;br /&gt;
There have been other investigations into EPIC architectures that are not directly tied to the development of the Itanium architecture:&lt;br /&gt;
*The [[Impact (research group)|IMPACT]] project at [[University of Illinois at Urbana–Champaign]], led by [[Wen-mei Hwu]], was the source of much influential research on this topic.&lt;br /&gt;
*The [[PlayDoh]] architecture from HP-labs was another major research project.&lt;br /&gt;
*[[Gelato Federation|Gelato]] was an open source development community in which academic and commercial researchers worked to develop more effective compilers for Linux applications running on Itanium servers.&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
*[[Complex instruction set computer]] (CISC)&lt;br /&gt;
*[[Reduced instruction set computer]] (RISC)&lt;br /&gt;
*[[Minimal instruction set computer]] (MISC)&lt;br /&gt;
*[[Very long instruction word]] (VLIW)&lt;br /&gt;
*[[No instruction set computing]]&lt;br /&gt;
*[[One-instruction set computer]]&lt;br /&gt;
*[[Computer architecture]]&lt;br /&gt;
*[[Superscalar_processor|Superscalar]]&lt;br /&gt;
*[[Wide-issue]]&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
{{reflist}}&lt;br /&gt;
&lt;br /&gt;
==External links==&lt;br /&gt;
* [http://www.cs.clemson.edu/~mark/epic.html Historical background for EPIC]&lt;br /&gt;
* Mark Smotherman (2002) &amp;quot;[http://www.cs.clemson.edu/~mark/464/acmse_epic.pdf Understanding EPIC Architectures and Implementations]&amp;quot;&lt;br /&gt;
&lt;br /&gt;
{{CPU technologies}}&lt;br /&gt;
&lt;br /&gt;
{{DEFAULTSORT:Explicitly Parallel Instruction Computing}}&lt;br /&gt;
[[Category:Instruction processing]]&lt;br /&gt;
[[Category:Very long instruction word computing]]&lt;/div&gt;</summary>
		<author><name>imported&gt;Cldemote</name></author>
	</entry>
</feed>