In generative grammar, a theta role or ?-role is the formal device for representing syntactic argument structure--the number and type of noun phrases--required syntactically by a particular verb. For example, the verb put requires three arguments (i.e., it is trivalent).
The formal mechanism for implementing a verb's argument structure is codified as theta roles. The verb put is said to "assign" three theta roles. This is coded in a theta grid associated with the lexical entry for the verb. The correspondence between the theta grid and the actual sentence is accomplished by means of a bijective filter on the grammar known as the theta criterion. Early conceptions of theta roles include Fillmore (1968) (Fillmore called theta roles "cases") and Gruber (1965).
Theta roles are prominent in government and binding theory and the standard theory of transformational grammar.
The term "theta role" is often used interchangeably with the term thematic relations (particularly in mainstream generative grammar--for an exception see Carnie 2006 harvnb error: no target: CITEREFCarnie2006 (help)). The reason for this is simple: theta roles typically reference thematic relations. In particular, theta roles are often referred to by the most prominent thematic relation in them. For example, a common theta role is the primary or external argument. Typically, although not always, this theta role maps to a noun phrase which bears an agent thematic relation. As such, the theta role is called the "agent" theta role. This often leads to confusion between the two notions. The two concepts, however, can be distinguished in a number of ways.
One common way of thinking about theta roles is that they are bundles of thematic relations associated with a particular argument position (Carnie 2006) harv error: no target: CITEREFCarnie2006 (help).
Theta roles are stored in a verb's theta grid. Grids typically come in two forms. The simplest and easiest to type is written as an ordered list between angle brackets. The argument associated with the external argument position (which typically ends up being the subject in active sentences) is written first and underlined. The theta roles are named by the most prominent thematic relation that they contain. In this notation, the theta grid for a verb such as give is <agent, theme, goal>.
The other notation (see for example the textbook examples in Haegeman 1994 harvnb error: no target: CITEREFHaegeman1994 (help) and Carnie 2006 harvnb error: no target: CITEREFCarnie2006 (help)) separates the theta roles into boxes, in which each column represents a theta role. The top row represents the names of the thematic relations contained in the theta role. In some work (e.g., Carnie 2006 harvnb error: no target: CITEREFCarnie2006 (help)), this box also contains information about the category associated with the theta role. This mingles theta-theory with the notion of subcategorization. The bottom row gives a series of indexes which are associated with subscripted markers in the sentence itself which indicate that the NPs they are attached to have been assigned the theta role in question.
When applied to the sentence [S[NP Susan]i gave [NP the food]j [PPto Biff]k] the indices mark that Susan is assigned the external theta role of agent/source, the food is assigned the theme role, and to Biff is assigned the goal role.
The theta criterion (or ?-criterion) is the formal device in Government and Binding Theory for enforcing the one to one match between arguments and theta roles. This acts as a filter on the D-structure of the sentence. If an argument fails to have the correct match between the number of arguments (typically NPs, PPs, or embedded clauses) and the number of theta roles, the sentence will be ungrammatical or unparseable. Chomsky's formulation (Chomsky 1981, p. 36) harv error: no target: CITEREFChomsky1981 (help) is:
The theta criterion Each argument bears one and only one ?-role, and each ?-role is assigned to one and only one argument.
Although it is often not explicitly stated, adjuncts are excluded from the theta criterion.
Drawing on observations based in typological cross-linguistic comparisons of languages (Fillmore 1968), linguists in the relational grammar (RG) tradition (e.g. Perlmutter & Postal 1984 harvnb error: no target: CITEREFPerlmutterPostal1984 (help)) observed that particular thematic relations and theta roles map on to particular positions in the sentence. For example, in unmarked situations agents map to subject positions, themes onto object position, and goals onto indirect objects. In RG, this is encoded in the Universal Alignment Hypothesis (or UAH), where the thematic relations are mapped directly into argument position based on the following hierarchy: Agent < Theme < Experiencer < Others. Mark Baker adopted this idea into GB theory in the form of the Uniformity of Theta Assignment Hypothesis (or UTAH) (Baker 1988). UTAH explains how identical thematic relationships between items are shown by identical structural relationships. A different approach to the correspondence is given in (Hale & Keyser 1993) harv error: no target: CITEREFHaleKeyser1993 (help) and (Hale & Keyser 2001) harv error: no target: CITEREFHaleKeyser2001 (help), where there are no such things as underlying theta roles or even thematic relations. Instead, the interpretive component of the grammar identifies the semantic role of an argument based on its position in the tree.
Lexical-functional grammar (LFG) (Falk 2001) harv error: no target: CITEREFFalk2001 (help) and (Bresnan 2001) harv error: no target: CITEREFBresnan2001 (help) is perhaps the most similar to Chomskyan approaches in implementing theta-roles. However, LFG uses three distinct layers of structure for representing the relations or functions of arguments-structure, a-structure (argument structure) and f-structure (functional structure) which expresses grammatical relations. These three layers are linked together using a set of intricate linking principles. Thematic relations in the ?-structure are mapped onto a set of positions in the a-structure which are tied to features [+o] (roughly "object") and [±r] (roughly "restricted" meaning it is marked explicitly by a preposition or a case marking). Themes map to [-r], second themes map to [+o] and non-themes map to [-o]. These features then determine how the arguments are mapped to specific grammatical functions in the sentence. The first [-o] argument is mapped to the SUBJ (subject) relation. If there is no [-o] argument then the first [-r] argument is mapped to the SUBJ relation. If neither of these apply, then you add the plus value ([+r] or [+o]) to the feature structure and apply the following mappings: [-o,-r]: SUBJ, [+o, -r]: Object (OBJ), [-o,+r]: prepositional marked oblique (OBL?), [+o, +r]: prepositionally marked object (OBJ?). These mappings are further constrained by the following constraints:
Function argument biuniqueness: Each a-structure role corresponds to a unique f-structure function, and each f-structure function corresponds to a unique a-structure role
The Subject Condition: Every verb must have a SUBJ
F-structures are further constrained by the following two constraints which do much of the same labor as the ?-criterion:
Coherence requires that every participant in the f-structure of a sentence must be mentioned in a-structure (or in a constituting equation) of a predicate in its clause.
Completeness: An f-structure for a sentence must contain values for all the grammatical functions mentioned in a-structure.
Head-driven phrase structure grammar (HPSG) (for a textbook introduction, see Sag, Wasow & Bender 2005 harvnb error: no target: CITEREFSagWasowBender2005 (help)) does not use theta roles per se, but divides their property into two distinct feature structures. The number and category are indicated by a feature called ARG-STR. This feature is an ordered list of categories that must cooccur with a particular verb or predicate. For example, the ARG-STR list of the verb give is <NP, NP, PP>. The semantic part of theta roles (i.e. the thematic relations) are treated in a special set of semantic restriction (RESTR) features. These typically express the semantic properties more directly than thematic relations. For example, the semantic relations associated with the arguments of the verb give are not agent, theme and goal, but giver, given, givee.
Many approaches to grammar including construction grammar and the Simpler Syntax model (Culicover & Jackendoff 2005) harv error: no target: CITEREFCulicoverJackendoff2005 (help) (see also Jackendoff's earlier work on argument structure and semantics, including Jackendoff 1983 harvnb error: no target: CITEREFJackendoff1983 (help) and Jackendoff 1990 harvnb error: no target: CITEREFJackendoff1990 (help)) claim that theta roles (and thematic relations) are neither a good way to represent the syntactic argument structure of predicates nor of the semantic properties that they reveal. They argue for more complex and articulated semantic structures (often called Lexical-conceptual structures) which map onto the syntactic structure.
Similarly, most typological approaches to grammar, functionalist theories (such as functional grammar and Role and Reference Grammar (Van Valin & La Polla 1993) harv error: no target: CITEREFVan_ValinLa_Polla1993 (help), and dependency grammar do not use theta roles, but they may make reference to thematic relations and grammatical relations or their notational equivalents. These are usually related to one another directly using principles of mapping.