Operator (computer programming)
In computer programming, an operator is a programming language construct that provides functionality that may not be possible to define as a user-defined function (i.e. sizeof in C) or has syntax different than a function (i.e. infix addition as in a+b
). Like other programming language concepts, operator has a generally accepted, although debatable meaning among practitioners while at the same time each language gives it specific meaning in that context, and therefore the meaning varies by language.
Some operators are represented with symbols – characters typically not allowed for a function identifier – to allow for presentation that is more familiar looking than typical function syntax. For example, a function that tests for greater-than could be named gt
, but many languages provide an infix symbolic operator so that code looks more familiar. For example, this:
if gt(x, y) then return
Can be:
if x > y then return
Some languages allow a language-defined operator to be overridden with user-defined behavior and some allow for user-defined operator symbols.
Operators may also differ semantically from functions. For example, short-circuit Boolean operations evaluate later arguments only if earlier ones are not false.
Differences from functions
Syntax
Many operators differ syntactically from user-defined functions. In most languages, a function is prefix notation with fixed precedence level and associativity and often with compulsory parentheses (e.g. Func(a)
or (Func a)
in Lisp). In contrast, many operators are infix notation and involve different use of delimiters such as parentheses.
In general, an operator may be prefix, infix, postfix, matchfix, circumfix or bifix, and the syntax of an expression involving an operator depends on its arity (number of operands), precedence, and (if applicable), associativity. Most programming languages support binary operators and a few unary operators, with a few supporting more operands, such as the ?: operator in C, which is ternary. There are prefix unary operators, such as unary minus -x
, and postfix unary operators, such as post-increment x++
; and binary operations are infix, such as x + y
or x = y
. Infix operations of higher arity require additional symbols, such as the ternary operator ?: in C, written as a ? b : c
– indeed, since this is the only common example, it is often referred to as the ternary operator. Prefix and postfix operations can support any desired arity, however, such as 1 2 3 4 +
.
Semantics
The semantics of an operator may significantly differ from that of a normal function. For reference, addition is evaluated like a normal function. For example, x + y
can be equivalent to a function add(x, y)
in that the arguments are evaluated and then the functional behavior is applied. However, assignment is different. For example, given a = b
the target a
is not evaluated. Instead its value is replaced with the value of b
. The scope resolution and element access operators (as in Foo::Bar
and a.b
, respectively) operate on identifier names; not values.
In C, for instance, the array indexing operator can be used for both read access as well as assignment. In the following example, the increment operator reads the element value of an array and then assigns the element value.
++a[i];
The C++ <<
operator allows for fluent syntax by supporting a sequence of operators that affect a single argument. For example:
cout << "Hello" << " " << "world!" << endl;
ad hoc polymorphic
Some languages provide operators that are ad hoc polymorphic – inherently overloaded. For example, in Java the +
operator sums numbers or concatenates strings.
Customization
Some languages support user-defined overloading (such as C++). An operator, defined by the language, can be overloaded to behave differently based on the type of input.
Some languages (e.g. C, C++ and PHP) define a fixed set of operators, while others (e.g. Prolog, Seed7, F#, OCaml, Haskell) allow for user-defined operators. Some programming languages restrict operator symbols to special characters like
+ or:= while others allow names likediv
(e.g. Pascal).Most languages do not support user-defined operators since the feature significantly complicates parsing. Introducing a new operator changes the arity and precedence lexical specification of the language, which affects phrase-level lexical analysis. Custom operators, particularly via runtime definition, often make correct static analysis of a program impossible, since the syntax of the language may be Turing-complete, so even constructing the syntax tree may require solving the halting problem, which is impossible. This occurs for Perl, for example, and some dialects of Lisp.
If a language does allow for defining new operators, the mechanics of doing so may involve meta-programming – specifying the operator in a separate language.
Operand coercion
Some languages implicitly convert (aka coerce) operands to be compatible with each other. For example, Perl coercion rules cause 12 + "3.14"
to evaluate to 15.14
. The string literal "3.14"
is converted to the numeric value 3.14 before addition is applied. Further, 3.14
is treated as floating point so the result is floating point even though 12
is an integer literal. JavaScript follows different rules so that the same expression evaluates to "123.14"
since 12
is converted to a string which is then concatenated with the second operand.
In general, a programmer must be aware of the specific rules regarding operand coercion in order to avoid unexpected and incorrect behavior.
Examples
- Mathematical operators
- Arithmetic: such as addition,
a + b
- Relational: such as greater than,
a > b
- Logic: such as
a AND b
ora && b
- Assignment: such as
a = b
ora := b
- Three-way comparison (aka spaceship):
x <=> y
- Program structure operators
- Record or object field access: such as
a.b
- Scope resolution: such as
a::b
ora.b
- Conditional operators
- Ternary conditional:
condition ? a : b
- Elvis:
x ?: y
- Null coalesing:
x ?? y
- Notable C and C++ operators
- Address-of operator:
&x
- Dereference:
*p
- Comma:
e, f
- Compound operators
- Compound assignment (aka augmented assignment) in C/C++:
+=
,-=
,*=
,/=
,%=
,<<=
,>>=
,&=
,^=
,|=
- Fused: such as cis x = cos x + i sin x
Operator features in programming languages
The following table shows the operator features in several programming languages: