ECMA-262 3rd Edition December 1999
ECMA-262 3rd Edition December 1999
ECMA-262 3rd Edition December 1999
ECMAScript Language
Specification
Phone: +41 22 849.60.00 - Fax: +41 22 849.60.01 - URL: http://www.ecma.ch - Internet: [email protected]
Standard ECMA-262
3 r d Edition - December 1999
ECMAScript Language
Specification
Phone: +41 22 849.60.00 - Fax: +41 22 849.60.01 - URL: http://www.ecma.ch - Internet: [email protected]
MB Ecma-262.doc 08-04-02 16,53
Brief History
This ECMA Standard is based on several originating technologies, the most well known being JavaScript (Netscape)
and JScript (Microsoft). The language was invented by Brendan Eich at Netscape and first appeared in that
company’s Navigator 2.0 browser. It has appeared in all subsequent browsers from Netscape and in all browsers from
Microsoft starting with Internet Explorer 3.0.
The development of this Standard started in November 1996. The first edition of this ECMA Standard was adopted by
the ECMA General Assembly of June 1997.
That ECMA Standard was submitted to ISO/IEC JTC 1 for adoption under the fast-track procedure, and approved as
international standard ISO/IEC 16262, in April 1998. The ECMA General Assembly of June 1998 approved the
second edition of ECMA-262 to keep it fully aligned with ISO/IEC 16262. Changes between the first and the second
edition are editorial in nature.
The current document defines the third edition of the Standard and includes powerful regular expressions, better
string handling, new control statements, try/catch exception handling, tighter definition of errors, formatting for
numeric output and minor changes in anticipation of forthcoming internationalisation facilities and future language
growth.
Work on the language is not complete. The technical committee is working on significant enhancements, including
mechanisms for scripts to be created and used across the Internet, and tighter coordination with other standards bodies
such as groups within the World Wide Web Consortium and the Wireless Application Protocol Forum.
This Standard has been adopted as 3rd Edition of ECMA-262 by the ECMA General Assembly in December, 1999.
- i -
Table of contents
1 Scope 1
2 Conformance 1
3 References 1
4 Overview 1
4.1 Web Scripting 2
4.2 Language Overview 2
4.2.1 Objects 3
4.3 Definitions 4
4.3.1 Type 4
4.3.2 Primitive Value 4
4.3.3 Object 4
4.3.4 Constructor 4
4.3.5 Prototype 4
4.3.6 Native Object 4
4.3.7 Built-in Object 4
4.3.8 Host Object 4
4.3.9 Undefined Value 4
4.3.10 Undefined Type 4
4.3.11 Null Value 4
4.3.12 Null Type 4
4.3.13 Boolean Value 4
4.3.14 Boolean Type 4
4.3.15 Boolean Object 5
4.3.16 String Value 5
4.3.17 String Type 5
4.3.18 String Object 5
4.3.19 Number Value 5
4.3.20 Number Type 5
4.3.21 Number Object 5
4.3.22 Infinity 5
4.3.23 NaN 5
5 Notational Conventions 6
5.1 Syntactic and Lexical Grammars 6
5.1.1 Context-Free Grammars 6
5.1.2 The Lexical and RegExp Grammars 6
5.1.3 The Numeric String Grammar 6
5.1.4 The Syntactic Grammar 6
5.1.5 Grammar Notation 7
5.2 Algorithm Conventions 9
6. Source Text 10
7 Lexical Conventions 11
7.1 Unicode Format-Control Characters 11
7.2 White Space 11
7.3 Line Terminators 12
7.4 Comments 12
- ii -
7.5 Tokens 13
7.5.1 Reserved Words 13
7.5.2 Keywords 13
7.5.3 Future Reserved Words 14
7.6 Identifiers 14
7.7 Punctuators 15
7.8 Literals 16
7.8.1 Null Literals 16
7.8.2 Boolean Literals 16
7.8.3 Numeric Literals 16
7.8.4 String Literals 18
7.8.5 Regular Expression Literals 20
7.9 Automatic Semicolon Insertion 21
7.9.1 Rules of Automatic Semicolon Insertion 21
7.9.2 Examples of Automatic Semicolon Insertion 22
8 Types 24
8.1 The Undefined Type 24
8.2 The Null Type 24
8.3 The Boolean Type 24
8.4 The String Type 24
8.5 The Number Type 24
8.6 The Object Type 25
8.6.1 Property Attributes 25
8.6.2 Internal Properties and Methods 26
8.7 The Reference Type 29
8.7.1 GetValue (V) 29
8.7.2 PutValue (V, W) 29
8.8 The List Type 30
8.9 The Completion Type 30
9 Type Conversion 30
9.1 ToPrimitive 30
9.2 ToBoolean 30
9.3 ToNumber 31
9.3.1 ToNumber Applied to the String Type 31
9.4 ToInteger 34
9.5 ToInt32: (Signed 32 Bit Integer) 34
9.6 ToUint32: (Unsigned 32 Bit Integer) 34
9.7 ToUint16: (Unsigned 16 Bit Integer) 35
9.8 ToString 35
9.8.1 ToString Applied to the Number Type 35
9.9 ToObject 36
10 Execution Contexts 37
10.1 Definitions 37
10.1.1 Function Objects 37
10.1.2 Types of Executable Code 37
10.1.3 Variable Instantiation 37
10.1.4 Scope Chain and Identifier Resolution 38
10.1.5 Global Object 38
10.1.6 Activation Object 38
- iii -
10.1.7 This 39
10.1.8 Arguments Object 39
10.2 Entering An Execution Context 39
10.2.1 Global Code 39
10.2.2 Eval Code 39
10.2.3 Function Code 39
11 Expressions 40
11.1 Primary Expressions 40
11.1.1 The this Keyword 40
11.1.2 Identifier Reference 40
11.1.3 Literal Reference 40
11.1.4 Array Initialiser 40
11.1.5 Object Initialiser 41
11.1.6 The Grouping Operator 42
11.2 Left-Hand-Side Expressions 43
11.2.1 Property Accessors 43
11.2.2 The new Operator 44
11.2.3 Function Calls 44
11.2.4 Argument Lists 45
11.2.5 Function Expressions 45
11.3 Postfix Expressions 45
11.3.1 Postfix Increment Operator 45
11.3.2 Postfix Decrement Operator 45
11.4 Unary Operators 46
11.4.1 The delete Operator 46
11.4.2 The void Operator 46
11.4.3 The typeof Operator 46
11.4.4 Prefix Increment Operator 47
11.4.5 Prefix Decrement Operator 47
11.4.6 Unary + Operator 47
11.4.7 Unary - Operator 47
11.4.8 Bitwise NOT Operator ( ~ ) 48
11.4.9 Logical NOT Operator ( ! ) 48
11.5 Multiplicative Operators 48
11.5.1 Applying the * Operator 48
11.5.2 Applying the / Operator 49
11.5.3 Applying the % Operator 49
11.6 Additive Operators 50
11.6.1 The Addition operator ( + ) 50
11.6.2 The Subtraction Operator ( - ) 50
11.6.3 Applying the Additive Operators ( +,- ) to Numbers 51
11.7 Bitwise Shift Operators 51
11.7.1 The Left Shift Operator ( << ) 51
11.7.2 The Signed Right Shift Operator ( >> ) 51
11.7.3 The Unsigned Right Shift Operator ( >>> ) 52
11.4 Relational Operators 52
11.8.1 The Less-than Operator ( < ) 53
11.8.2 The Greater-than Operator ( > ) 53
11.8.3 The Less-than-or-equal Operator ( <= ) 53
11.8.4 The Greater-than-or-equal Operator ( >= ) 53
11.8.5 The Abstract Relational Comparison Algorithm 53
- iv -
12 Statements 61
12.1 Block 61
12.2 Variable statement 62
12.3 Empty Statement 63
12.4 Expression Statement 63
12.5 The if Statement 63
12.6 Iteration Statements 64
12.6.1 The do-while Statement 64
12.6.2 The while statement 64
12.6.3 The for Statement 65
12.6.4 The for-in Statement 65
12.7 The continue Statement 66
12.8 The break Statement 67
12.9 The return Statement 67
12.10 The with Statement 67
12.11 The switch Statement 68
12.12 Labelled Statements 69
12.13 The throw statement 69
12.14 The try statement 70
13 Function Definition 71
13.1 Definitions 72
13.1.1 Equated Grammar Productions 72
13.1.2 Joined Objects 72
13.2 Creating Function Objects 72
13.2.1 [[Call]] 73
13.2.2 [[Construct]] 74
14 Program 75
16 Errors 149
2 Conformance
A conforming implementation of ECMAScript must provide and support all the types, values, objects,
properties, functions, and program syntax and semantics described in this specification.
A conforming implementation of this International standard shall interpret characters in conformance with the
Unicode Standard, Version 2.1 or later, and ISO/IEC 10646-1 with either UCS-2 or UTF-16 as the adopted
encoding form, implementation level 3. If the adopted ISO/IEC 10646-1 subset is not otherwise specified, it is
presumed to be the BMP subset, collection 300. If the adopted encoding form is not otherwise specified, it
presumed to be the UTF-16 encoding form.
A conforming implementation of ECMAScript is permitted to provide additional types, values, objects,
properties, and functions beyond those described in this specification. In particular, a conforming
implementation of ECMAScript is permitted to provide properties not described in this specification, and
values for those properties, for objects that are described in this specification.
A conforming implementation of ECMAScript is permitted to support program and regular expression syntax
not described in this specification. In particular, a conforming implementation of ECMAScript is permitted to
support program syntax that makes use of the “future reserved words” listed in 7.5.3 of this specification.
3 References
ISO/IEC 9899:1996 Programming Languages – C, including amendment 1 and technical corrigenda 1 and 2.
ISO/IEC 10646-1:1993 Information Technology -- Universal Multiple-Octet Coded Character Set (UCS) plus
its amendments and corrigenda.
Unicode Inc. (1996), The Unicode Standard , Version 2.0. ISBN: 0-201-48345-9, Addison-Wesley
Publishing Co., Menlo Park, California.
Unicode Inc. (1998), Unicode Technical Report #8: The Unicode Standard , Version 2.1.
Unicode Inc. (1998), Unicode Technical Report #15: Unicode Normalization Forms.
ANSI/IEEE Std 754-1985: IEEE Standard for Binary Floating-Point Arithmetic. Institute of Electrical and
Electronic Engineers, New York (1985).
4 Overview
This section contains a non-normative overview of the ECMAScript language.
ECMAScript is an object-oriented programming language for performing computations and manipulating
computational objects within a host environment. ECMAScript as defined here is not intended to be
computationally self-sufficient; indeed, there are no provisions in this specification for input of external data
or output of computed results. Instead, it is expected that the computational environment of an ECMAScript
program will provide not only the objects and other facilities described in this specification but also certain
environment-specific host objects, whose description and behaviour are beyond the scope of this specification
except to indicate that they may provide certain properties that can be accessed and certain functions that can
be called from an ECMAScript program.
A scripting language is a programming language that is used to manipulate, customise, and automate the
facilities of an existing system. In such systems, useful functionality is already available through a user
interface, and the scripting language is a mechanism for exposing that functionality to program control. In this
way, the existing system is said to provide a host environment of objects and facilities, which completes the
capabilities of the scripting language. A scripting language is intended for use by both professional and non-
professional programmers. To accommodate non-professional programmers, some aspects of the language
may be somewhat less strict.
- 2 -
ECMAScript was originally designed to be a Web scripting language, providing a mechanism to enliven Web
pages in browsers and to perform server computation as part of a Web-based client-server architecture.
ECMAScript can provide core scripting capabilities for a variety of host environments, and therefore the core
scripting language is specified in this document apart from any particular host environment.
Some of the facilities of ECMAScript are similar to those used in other programming languages; in particular
Java¥ and Self, as described in:
• Gosling, James, Bill Joy and Guy Steele. The Java Language Specification. Addison Wesley Publishing
Co., 1996.
• Ungar, David, and Smith, Randall B. Self: The Power of Simplicity. OOPSLA '87 Conference
Proceedings, pp. 227–241, Orlando, FL, October 1987.
4.1 Web Scripting
A web browser provides an ECMAScript host environment for client-side computation including, for
instance, objects that represent windows, menus, pop-ups, dialog boxes, text areas, anchors, frames,
history, cookies, and input/output. Further, the host environment provides a means to attach scripting code
to events such as change of focus, page and image loading, unloading, error and abort, selection, form
submission, and mouse actions. Scripting code appears within the HTML and the displayed page is a
combination of user interface elements and fixed and computed text and images. The scripting code is
reactive to user interaction and there is no need for a main program.
A web server provides a different host environment for server-side computation including objects
representing requests, clients, and files; and mechanisms to lock and share data. By using browser-side and
server-side scripting together, it is possible to distribute computation between the client and server while
providing a customised user interface for a Web-based application.
Each Web browser and server that supports ECMAScript supplies its own host environment, completing
the ECMAScript execution environment.
4.2 Language Overview
The following is an informal overview of ECMAScript—not all parts of the language are described. This
overview is not part of the standard proper.
ECMAScript is object-based: basic language and host facilities are provided by objects, and an
ECMAScript program is a cluster of communicating objects. An ECMAScript object is an unordered
collection of properties each with zero or more attributes that determine how each property can be used—
for example, when the ReadOnly attribute for a property is set to true, any attempt by executed
ECMAScript code to change the value of the property has no effect. Properties are containers that hold
other objects, primitive values, or methods. A primitive value is a member of one of the following built-in
types: Undefined, Null, Boolean, Number, and String; an object is a member of the remaining built-in
type Object; and a method is a function associated with an object via a property.
ECMAScript defines a collection of built-in objects that round out the definition of ECMAScript entities.
These built-in objects include the Global object, the Object object, the Function object, the Array object,
the String object, the Boolean object, the Number object, the Math object, the Date object, the RegExp
object and the Error objects Error, EvalError, RangeError, ReferenceError, SyntaxError, TypeError
and URIError.
ECMAScript also defines a set of built-in operators that may not be, strictly speaking, functions or
methods. ECMAScript operators include various unary operations, multiplicative operators, additive
operators, bitwise shift operators, relational operators, equality operators, binary bitwise operators, binary
logical operators, assignment operators, and the comma operator.
ECMAScript syntax intentionally resembles Java syntax. ECMAScript syntax is relaxed to enable it to
serve as an easy-to-use scripting language. For example, a variable is not required to have its type declared
nor are types associated with properties, and defined functions are not required to have their declarations
appear textually before calls to them.
- 3 -
4.2.1 Objects
ECMAScript does not contain proper classes such as those in C++, Smalltalk, or Java, but rather,
supports constructors which create objects by executing code that allocates storage for the objects and
initialises all or part of them by assigning initial values to their properties. All constructors are objects,
but not all objects are constructors. Each constructor has a Prototype property that is used to implement
prototype-based inheritance and shared properties. Objects are created by using constructors in new
expressions; for example, new String("A String") creates a new String object. Invoking a
constructor without using new has consequences that depend on the constructor. For example,
String("A String") produces a primitive string, not an object.
ECMAScript supports prototype-based inheritance. Every constructor has an associated prototype, and
every object created by that constructor has an implicit reference to the prototype (called the object’s
prototype) associated with its constructor. Furthermore, a prototype may have a non-null implicit
reference to its prototype, and so on; this is called the prototype chain. When a reference is made to a
property in an object, that reference is to the property of that name in the first object in the prototype
chain that contains a property of that name. In other words, first the object mentioned directly is
examined for such a property; if that object contains the named property, that is the property to which
the reference refers; if that object does not contain the named property, the prototype for that object is
examined next; and so on.
In a class-based object-oriented language, in general, state is carried by instances, methods are carried
by classes, and inheritance is only of structure and behaviour. In ECMAScript, the state and methods are
carried by objects, and structure, behaviour, and state are all inherited.
All objects that do not directly contain a particular property that their prototype contains share that
property and its value. The following diagram illustrates this:
prototype Cfp
P1 CFP1 explicit prototype link
P2
CF is a constructor (and also an object). Five objects have been created by using new expressions: cf 1,
cf 2, cf 3, cf 4, and cf 5. Each of these objects contains properties named q1 and q2. The dashed lines
represent the implicit prototype relationship; so, for example, cf3’s prototype is CF p. The constructor,
CF, has two properties itself, named P1 and P2, which are not visible to CF p, cf 1, cf 2, cf 3, cf 4, or cf 5. The
property named CFP1 in CF p is shared by cf 1, cf 2, cf 3, cf 4, and cf 5 (but not by CF), as are any properties
found in CFp’s implicit prototype chain that are not named q1, q2, or CFP1. Notice that there is no
implicit prototype link between CF and CF p.
Unlike class-based object languages, properties can be added to objects dynamically by assigning values
to them. That is, constructors are not required to name or assign values to all or any of the constructed
object’s properties. In the above diagram, one could add a new shared property for cf 1, cf 2, cf 3, cf 4, and
cf 5 by assigning a new value to the property in CFp.
- 4 -
4.3 Definitions
The following are informal definitions of key terms associated with ECMAScript.
4.3.1 Type
A type is a set of data values.
4.3.2 Primitive Value
A primitive value is a member of one of the types Undefined, Null, Boolean, Number, or String. A
primitive value is a datum that is represented directly at the lowest level of the language implementation.
4.3.3 Object
An object is a member of the type Object. It is an unordered collection of properties each of which
contains a primitive value, object, or function. A function stored in a property of an object is called a
method.
4.3.4 Constructor
A constructor is a Function object that creates and initialises objects. Each constructor has an associated
prototype object that is used to implement inheritance and shared properties.
4.3.5 Prototype
A prototype is an object used to implement structure, state, and behaviour inheritance in ECMAScript.
When a constructor creates an object, that object implicitly references the constructor’s associated
prototype for the purpose of resolving property references. The constructor’s associated prototype can be
referenced by the program expression constructor.prototype , and properties added to an object’s
prototype are shared, through inheritance, by all objects sharing the prototype.
4.3.6 Native Object
A native object is any object supplied by an ECMAScript implementation independent of the host
environment. Standard native objects are defined in this specification. Some native objects are built-in;
others may be constructed during the course of execution of an ECMAScript program.
4.3.7 Built-in Object
A built-in object is any object supplied by an ECMAScript implementation, independent of the host
environment, which is present at the start of the execution of an ECMAScript program. Standard built-in
objects are defined in this specification, and an ECMAScript implementation may specify and define
others. Every built-in object is a native object.
4.3.8 Host Object
A host object is any object supplied by the host environment to complete the execution environment of
ECMAScript. Any object that is not native is a host object.
4.3.9 Undefined Value
The undefined value is a primitive value used when a variable has not been assigned a value.
4.3.10 Undefined Type
The type Undefined has exactly one value, called undefined.
4.3.11 Null Value
The null value is a primitive value that represents the null, empty, or non-existent reference.
4.3.12 Null Type
The type Null has exactly one value, called null.
4.3.13 Boolean Value
A boolean value is a member of the type Boolean and is one of two unique values, true and false.
4.3.14 Boolean Type
The type Boolean represents a logical entity and consists of exactly two unique values. One is called
true and the other is called false.
- 5 -
5 Notational Conventions
Productions of the syntactic grammar are distinguished by having just one colon “:” as punctuation.
The syntactic grammar as presented in sections 0, 0, 0 and 0 is actually not a complete account of which
token sequences are accepted as correct ECMAScript programs. Certain additional token sequences are
also accepted, namely, those that would be described by the grammar if only semicolons were added to
the sequence in certain places (such as before line terminator characters). Furthermore, certain token
sequences that are described by the grammar are not considered acceptable if a terminator character
appears in certain “awkward” places.
5.1.5 Grammar Notation
Terminal symbols of the lexical and string grammars, and some of the terminal symbols of the syntactic
grammar, are shown in fixed width font, both in the productions of the grammars and throughout
this specification whenever the text directly refers to such a terminal symbol. These are to appear in a
program exactly as written. All nonterminal characters specified in this way are to be understood as the
appropriate Unicode character from the ASCII range, as opposed to any similar-looking characters from
other Unicode ranges.
Nonterminal symbols are shown in italic type. The definition of a nonterminal is introduced by the name
of the nonterminal being defined followed by one or more colons. (The number of colons indicates to
which grammar the production belongs.) One or more alternative right-hand sides for the nonterminal
then follow on succeeding lines. For example, the syntactic definition:
WithStatement :
with ( Expression ) Statement
states that the nonterminal WithStatement represents the token with, followed by a left parenthesis
token, followed by an Expression, followed by a right parenthesis token, followed by a Statement. The
occurrences of Expression and Statement are themselves nonterminals. As another example, the syntactic
definition:
ArgumentList :
AssignmentExpression
ArgumentList , AssignmentExpression
states that an ArgumentList may represent either a single AssignmentExpression or an ArgumentList,
followed by a comma, followed by an AssignmentExpression. This definition of ArgumentList is
recursive, that is, it is defined in terms of itself. The result is that an ArgumentList may contain any
positive number of arguments, separated by commas, where each argument expression is an
AssignmentExpression. Such recursive definitions of nonterminals are common.
The subscripted suffix “opt”, which may appear after a terminal or nonterminal, indicates an optional
symbol. The alternative containing the optional symbol actually specifies two right-hand sides, one that
omits the optional element and one that includes it. This means that:
VariableDeclaration :
Identifier Initialiseropt
is a convenient abbreviation for:
VariableDeclaration :
Identifier
Identifier Initialiser
and that:
IterationStatement :
for ( ExpressionNoInopt ; Expressionopt ; Expressionopt ) Statement
is a convenient abbreviation for:
IterationStatement :
for ( ; Expressionopt ; Expressionopt ) Statement
for ( ExpressionNoIn ; Expressionopt ; Expressionopt ) Statement
- 8 -
DecimalDigits ::
DecimalDigit
DecimalDigits DecimalDigit
the definition
LookaheadExample ::
n [lookahead ∉ {1, 3, 5, 7, 9}] DecimalDigits
DecimalDigit [lookahead ∉ DecimalDigit ]
matches either the letter n followed by one or more decimal digits the first of which is even, or a
decimal digit not followed by another decimal digit.
If the phrase “[no LineTerminator here]” appears in the right-hand side of a production of the syntactic
grammar, it indicates that the production is a restricted production: it may not be used if a
LineTerminator occurs in the input stream at the indicated position. For example, the production:
ReturnStatement :
return [no LineTerminator here] Expressionopt ;
indicates that the production may not be used if a LineTerminator occurs in the program between the
return token and the Expression.
Unless the presence of a LineTerminator is forbidden by a restricted production, any number of
occurrences of LineTerminator may appear between any two consecutive tokens in the stream of input
elements without affecting the syntactic acceptability of the program.
- 9 -
When the words “one of” follow the colon(s) in a grammar definition, they signify that each of the
terminal symbols on the following line or lines is an alternative definition. For example, the lexical
grammar for ECMAScript contains the production:
NonZeroDigit :: one of
1 2 3 4 5 6 7 8 9
When an alternative in a production of the lexical grammar or the numeric string grammar appears to be
a multi-character token, it represents the sequence of characters that would make up such a token.
The right-hand side of a production may specify that certain expansions are not permitted by using the
phrase “but not” and then indicating the expansions to be excluded. For example, the production:
Identifier ::
IdentifierName but not ReservedWord
means that the nonterminal Identifier may be replaced by any sequence of characters that could replace
IdentifierName provided that the same sequence of characters could not replace ReservedWord.
Finally, a few nonterminal symbols are described by a descriptive phrase in roman type in cases where it
would be impractical to list all the alternatives:
SourceCharacter ::
any Unicode character
The notation “x modulo y” (y must be finite and nonzero) computes a value k of the same sign as y (or zero)
such that abs(k) < abs(y) and x−k = q × y for some integer q.
The mathematical function floor(x) yields the largest integer (closest to positive infinity) that is not larger
than x.
NOTE
floor(x) = x− (x modulo 1).
If an algorithm is defined to “throw an exception”, execution of the algorithm is terminated and no result is
returned. The calling algorithms are also terminated, until an algorithm step is reached that explicitly deals
with the exception, using terminology such as “If an exception was thrown…”. Once such an algorithm
step has been encountered the exception is no longer considered to have occurred.
6. Source Text
ECMAScript source text is represented as a sequence of characters in the Unicode character encoding, version
2.1 or later, using the UTF-16 transformation format. The text is expected to have been normalised to
Unicode Normalised Form C (canonical composition), as described in Unicode Technical Report #15.
Conforming ECMAScript implementations are not required to perform any normalisation of text, or behave as
though they were performing normalisation of text, themselves.
SourceCharacter ::
any Unicode character
ECMAScript source text can contain any of the Unicode characters. All Unicode white space characters are
treated as white space, and all Unicode line/paragraph separators are treated as line separators. Non-Latin
Unicode characters are allowed in identifiers, string literals, regular expression literals and comments.
Throughout the rest of this document, the phrase “code point” and the word “character” will be used to refer
to a 16-bit unsigned value used to represent a single 16-bit unit of UTF-16 text. The phrase “Unicode
character” will be used to refer to the abstract linguistic or typographical unit represented by a single Unicode
scalar value (which may be longer than 16 bits and thus may be represented by more than one code point).
This only refers to entities represented by single Unicode scalar values: the components of a combining
character sequence are still individual “Unicode characters,” even though a user might think of the whole
sequence as a single character.
In string literals, regular expression literals and identifiers, any character (code point) may also be expressed
as a Unicode escape sequence consisting of six characters, namely \u plus four hexadecimal digits. Within a
comment, such an escape sequence is effectively ignored as part of the comment. Within a string literal or
regular expression literal, the Unicode escape sequence contributes one character to the value of the literal.
Within an identifier, the escape sequence contributes one character to the identifier.
NOTE 1
Although this document sometimes refers to a “transformation” between a “character” within a “string” and
the 16-bit unsigned integer that is the UTF-16 encoding of that character, there is actually no transformation
because a “character” within a “string” is actually represented using that 16-bit unsigned value.
NOTE 2
ECMAScript differs from the Java programming language in the behaviour of Unicode escape sequences. In a
Java program, if the Unicode escape sequence \u000A, for example, occurs within a single-line comment, it
is interpreted as a line terminator (Unicode character 000A is line feed) and therefore the next character is
not part of the comment. Similarly, if the Unicode escape sequence \u000A occurs within a string literal in a
Java program, it is likewise interpreted as a line terminator, which is not allowed within a string literal—one
must write \n instead of \u000A to cause a line feed to be part of the string value of a string literal. In an
ECMAScript program, a Unicode escape sequence occurring within a comment is never interpreted and
therefore cannot contribute to termination of the comment. Similarly, a Unicode escape sequence occurring
within a string literal in an ECMAScript program always contributes a character to the string value of the
literal and is never interpreted as a line terminator or as a quote mark that might terminate the string literal.
- 11 -
7 Lexical Conventions
The source text of an ECMAScript program is first converted into a sequence of input elements, which are
either tokens, line terminators, comments, or white space. The source text is scanned from left to right,
repeatedly taking the longest possible sequence of characters as the next input element.
There are two goal symbols for the lexical grammar. The InputElementDiv symbol is used in those syntactic
grammar contexts where a division (/) or division-assignment (/=) operator is permitted. The
InputElementRegExp symbol is used in other syntactic grammar contexts.
Note that contexts exist in the syntactic grammar where both a division and a RegularExpressionLiteral are
permitted by the syntactic grammar; however, since the lexical grammar uses the InputElementDiv goal
symbol in such cases, the opening slash is not recognised as starting a regular expression literal in such a
context. As a workaround, one may enclose the regular expression literal in parentheses.
Syntax
InputElementDiv ::
WhiteSpace
LineTerminator
Comment
Token
DivPunctuator
InputElementRegExp ::
WhiteSpace
LineTerminator
Comment
Token
RegularExpressionLiteral
Syntax
WhiteSpace ::
<TAB>
<VT>
<FF>
<SP>
<NBSP>
<USP>
Syntax
LineTerminator ::
<LF>
<CR>
<LS>
<PS>
7.4 Comments
Description
Comments can be either single or multi-line. Multi-line comments cannot nest.
Because a single-line comment can contain any character except a LineTerminator character, and because
of the general rule that a token is always as long as possible, a single-line comment always consists of all
characters from the // marker to the end of the line. However, the LineTerminator at the end of the line is
not considered to be part of the single-line comment; it is recognised separately by the lexical grammar and
becomes part of the stream of input elements for the syntactic grammar. This point is very important,
because it implies that the presence or absence of single-line comments does not affect the process of
automatic semicolon insertion (7.9).
Comments behave like white space and are discarded except that, if a MultiLineComment contains a line
terminator character, then the entire comment is considered to be a LineTerminator for purposes of parsing
by the syntactic grammar.
Syntax
Comment ::
MultiLineComment
SingleLineComment
- 13 -
MultiLineComment ::
/* MultiLineCommentCharsopt */
MultiLineCommentChars ::
MultiLineNotAsteriskChar MultiLineCommentCharsopt
* PostAsteriskCommentCharsopt
PostAsteriskCommentChars ::
MultiLineNotForwardSlashOrAsteriskChar MultiLineCommentCharsopt
* PostAsteriskCommentCharsopt
MultiLineNotAsteriskChar ::
SourceCharacter but not asterisk *
MultiLineNotForwardSlashOrAsteriskChar ::
SourceCharacter but not forward-slash / or asterisk *
SingleLineComment ::
// SingleLineCommentCharsopt
SingleLineCommentChars ::
SingleLineCommentChar SingleLineCommentCharsopt
SingleLineCommentChar ::
SourceCharacter but not LineTerminator
7.5 Tokens
Syntax
Token ::
ReservedWord
Identifier
Punctuator
NumericLiteral
StringLiteral
Syntax
ReservedWord ::
Keyword
FutureReservedWord
NullLiteral
BooleanLiteral
7.5.2 Keywords
The following tokens are ECMAScript keywords and may not be used as identifiers in ECMAScript
programs.
- 14 -
Syntax
Keyword :: one of
break else new var
case finally return void
catch for switch while
continue function this with
default if throw
delete in try
do instanceof typeof
Syntax
FutureReservedWord :: one of
abstract enum int short
boolean export interface static
byte extends long super
char final native synchronized
class float package throws
const goto private transient
debugger implements protected volatile
double import public
7.6 Identifiers
Description
Identifiers are interpreted according to the grammar given in Section 5.16 of the upcoming version 3.0 of
the Unicode standard, with some small modifications. This grammar is based on both normative and
informative character categories specified by the Unicode standard. The characters in the specified
categories in version 2.1 of the Unicode standard must be treated as in those categories by all conforming
ECMAScript implementations; however, conforming ECMAScript implementations may allow additional
legal identifier characters based on the category assignment from later versions of Unicode.
This standard specifies one departure from the grammar given in the Unicode standard: The dollar sign ($)
and the underscore (_) are permitted anywhere in an identifier. The dollar sign is intended for use only in
mechanically generated code.
Unicode escape sequences are also permitted in identifiers, where they contribute a single character to the
identifier, as computed by the CV of the UnicodeEscapeSequence (see 7.8.4). The \ preceding the
UnicodeEscapeSequence does not contribute a character to the identifier. A UnicodeEscapeSequence
cannot be used to put a character into an identifier that would otherwise be illegal. In other words, if a \
UnicodeEscapeSequence sequence were replaced by its UnicodeEscapeSequence's CV, the result must still
be a valid Identifier that has the exact same sequence of characters as the original Identifier.
Two identifiers that are canonically equivalent according to the Unicode standard are not equal unless they
are represented by the exact same sequence of code points (in other words, conforming ECMAScript
implementations are only required to do bitwise comparison on identifiers). The intent is that the incoming
source text has been converted to normalised form C before it reaches the compiler.
Syntax
Identifier ::
IdentifierName but not ReservedWord
- 15 -
IdentifierName ::
IdentifierStart
IdentifierName IdentifierPart
IdentifierStart ::
UnicodeLetter
$
_
\ UnicodeEscapeSequence
IdentifierPart ::
IdentifierStart
UnicodeCombiningMark
UnicodeDigit
UnicodeConnectorPunctuation
\ UnicodeEscapeSequence
UnicodeLetter
any character in the Unicode categories “Uppercase letter (Lu)”, “Lowercase letter (Ll)”, “Titlecase letter (Lt)”,
“Modifier letter (Lm)”, “Other letter (Lo)”, or “Letter number (Nl)”.
UnicodeCombiningMark
any character in the Unicode categories “Non-spacing mark (Mn)” or “Combining spacing mark (Mc)”
UnicodeDigit
any character in the Unicode category “Decimal number (Nd)”
UnicodeConnectorPunctuation
any character in the Unicode category “Connector punctuation (Pc)”
UnicodeEscapeSequence
see 7.8.4.
HexDigit :: one of
0 1 2 3 4 5 6 7 8 9 a b c d e f A B C D E F
7.7 Punctuators
Syntax
Punctuator :: one of
{ } ( ) [ ]
. ; , < > <=
>= == != === !==
+ - * % ++ --
<< >> >>> & | ^
! ~ && || ? :
= += -= *= %= <<=
>>= >>>= &= |= ^=
DivPunctuator :: one of
/ /=
- 16 -
7.8 Literals
Syntax
Literal ::
NullLiteral
BooleanLiteral
NumericLiteral
StringLiteral
Semantics
The value of the null literal null is the sole value of the Null type, namely null.
7.8.2 Boolean Literals
Syntax
BooleanLiteral ::
true
false
Semantics
The value of the Boolean literal true is a value of the Boolean type, namely true.
The value of the Boolean literal false is a value of the Boolean type, namely false.
7.8.3 Numeric Literals
Syntax
NumericLiteral ::
DecimalLiteral
HexIntegerLiteral
DecimalLiteral ::
DecimalIntegerLiteral . DecimalDigitsopt ExponentPartopt
. DecimalDigits ExponentPartopt
DecimalIntegerLiteral ExponentPartopt
DecimalIntegerLiteral ::
0
NonZeroDigit DecimalDigitsopt
DecimalDigits ::
DecimalDigit
DecimalDigits DecimalDigit
DecimalDigit :: one of
0 1 2 3 4 5 6 7 8 9
NonZeroDigit :: one of
1 2 3 4 5 6 7 8 9
ExponentPart ::
ExponentIndicator SignedInteger
- 17 -
ExponentIndicator :: one of
e E
SignedInteger ::
DecimalDigits
+ DecimalDigits
- DecimalDigits
HexIntegerLiteral ::
0x HexDigit
0X HexDigit
HexIntegerLiteral HexDigit
Semantics
A numeric literal stands for a value of the Number type. This value is determined in two steps: first, a
mathematical value (MV) is derived from the literal; second, this mathematical value is rounded as
described below.
• The MV of NumericLiteral :: DecimalLiteral is the MV of DecimalLiteral.
• The MV of NumericLiteral :: HexIntegerLiteral is the MV of HexIntegerLiteral.
• The MV of DecimalLiteral :: DecimalIntegerLiteral . is the MV of DecimalIntegerLiteral.
• The MV of DecimalLiteral :: DecimalIntegerLiteral . DecimalDigits is the MV of DecimalIntegerLiteral
plus (the MV of DecimalDigits times 10–n), where n is the number of characters in DecimalDigits.
• The MV of DecimalLiteral :: DecimalIntegerLiteral . ExponentPart is the MV of DecimalIntegerLiteral
times 10e, where e is the MV of ExponentPart.
• The MV of DecimalLiteral :: DecimalIntegerLiteral . DecimalDigits ExponentPart is (the MV of
DecimalIntegerLiteral plus (the MV of DecimalDigits times 10–n)) times 10e, where n is the number of
characters in DecimalDigits and e is the MV of ExponentPart.
• The MV of DecimalLiteral ::. DecimalDigits is the MV of DecimalDigits times 10–n, where n is the
number of characters in DecimalDigits.
• The MV of DecimalLiteral ::. DecimalDigits ExponentPart is the MV of DecimalDigits times 10e–n, where
n is the number of characters in DecimalDigits and e is the MV of ExponentPart.
• The MV of DecimalLiteral :: DecimalIntegerLiteral is the MV of DecimalIntegerLiteral.
• The MV of DecimalLiteral :: DecimalIntegerLiteral ExponentPart is the MV of DecimalIntegerLiteral
times 10e, where e is the MV of ExponentPart.
• The MV of DecimalIntegerLiteral :: 0 is 0.
• The MV of DecimalIntegerLiteral :: NonZeroDigit DecimalDigits is (the MV of NonZeroDigit times 10n)
plus the MV of DecimalDigits, where n is the number of characters in DecimalDigits.
• The MV of DecimalDigits :: DecimalDigit is the MV of DecimalDigit.
• The MV of DecimalDigits :: DecimalDigits DecimalDigit is (the MV of DecimalDigits times 10) plus the
MV of DecimalDigit.
• The MV of ExponentPart :: ExponentIndicator SignedInteger is the MV of SignedInteger.
• The MV of SignedInteger :: DecimalDigits is the MV of DecimalDigits.
• The MV of SignedInteger :: + DecimalDigits is the MV of DecimalDigits.
• The MV of SignedInteger :: - DecimalDigits is the negative of the MV of DecimalDigits.
- 18 -
Syntax
StringLiteral ::
" DoubleStringCharactersopt "
' SingleStringCharactersopt '
DoubleStringCharacters ::
DoubleStringCharacter DoubleStringCharactersopt
SingleStringCharacters ::
SingleStringCharacter SingleStringCharactersopt
DoubleStringCharacter ::
SourceCharacter but not double-quote " or backslash \ or LineTerminator
\ EscapeSequence
- 19 -
SingleStringCharacter ::
SourceCharacter but not single-quote ' or backslash \ or LineTerminator
\ EscapeSequence
EscapeSequence ::
CharacterEscapeSequence
0 [lookahead ∉ DecimalDigit]
HexEscapeSequence
UnicodeEscapeSequence
CharacterEscapeSequence ::
SingleEscapeCharacter
NonEscapeCharacter
SingleEscapeCharacter :: one of
' " \ b f n r t v
NonEscapeCharacter ::
SourceCharacter but not EscapeCharacter or LineTerminator
EscapeCharacter ::
SingleEscapeCharacter
DecimalDigit
x
u
HexEscapeSequence ::
x HexDigit HexDigit
UnicodeEscapeSequence ::
u HexDigit HexDigit HexDigit HexDigit
The definitions of the nonterminal HexDigit is given in section 7.8.3. SourceCharacter is described in
sections 2 and 6.
A string literal stands for a value of the String type. The string value (SV) of the literal is described in
terms of character values (CV) contributed by the various parts of the string literal. As part of this
process, some characters within the string literal are interpreted as having a mathematical value (MV), as
described below or in section 7.8.3.
• The SV of StringLiteral :: "" is the empty character sequence.
• The SV of StringLiteral :: '' is the empty character sequence.
• The SV of StringLiteral :: " DoubleStringCharacters " is the SV of DoubleStringCharacters.
• The SV of StringLiteral :: ' SingleStringCharacters ' is the SV of SingleStringCharacters.
• The SV of DoubleStringCharacters :: DoubleStringCharacter is a sequence of one character, the CV of
DoubleStringCharacter.
• The SV of DoubleStringCharacters :: DoubleStringCharacter DoubleStringCharacters is a sequence of the
CV of DoubleStringCharacter followed by all the characters in the SV of DoubleStringCharacters in order.
• The SV of SingleStringCharacters :: SingleStringCharacter is a sequence of one character, the CV of
SingleStringCharacter.
• The SV of SingleStringCharacters :: SingleStringCharacter SingleStringCharacters is a sequence of the CV
of SingleStringCharacter followed by all the characters in the SV of SingleStringCharacters in order.
• The CV of DoubleStringCharacter :: SourceCharacter but not double-quote " or backslash \ or
LineTerminator is the SourceCharacter character itself.
• The CV of DoubleStringCharacter :: \ EscapeSequence is the CV of the EscapeSequence.
• The CV of SingleStringCharacter :: SourceCharacter but not single-quote ' or backslash \ or
LineTerminator is the SourceCharacter character itself.
- 20 -
Syntax
RegularExpressionLiteral ::
/ RegularExpressionBody / RegularExpressionFlags
RegularExpressionBody ::
RegularExpressionFirstChar RegularExpressionChars
RegularExpressionChars ::
[empty]
RegularExpressionChars RegularExpressionChar
RegularExpressionFirstChar ::
NonTerminator but not * or \ or /
BackslashSequence
RegularExpressionChar ::
NonTerminator but not \ or /
BackslashSequence
BackslashSequence ::
\ NonTerminator
NonTerminator ::
SourceCharacter but not LineTerminator
RegularExpressionFlags ::
[empty]
RegularExpressionFlags IdentifierPart
NOTE
Regular expression literals may not be empty; instead of representing an empty regular expression
literal, the characters // start a single-line comment. To specify an empty regular expression, use
/(?:)/.
Semantics
A regular expression literal stands for a value of the Object type. This value is determined in two steps:
first, the characters comprising the regular expression's RegularExpressionBody and
RegularExpressionFlags production expansions are collected uninterpreted into two strings Pattern and
Flags, respectively. Then the new RegExp constructor is called with two arguments Pattern and Flags
and the result becomes the value of the RegularExpressionLiteral. If the call to new RegExp generates
an error, an implementation may, at its discretion, either report the error immediately while scanning the
program, or it may defer the error until the regular expression literal is evaluated in the course of
program execution.
7.9 Automatic Semicolon Insertion
Certain ECMAScript statements (empty statement, variable statement, expression statement, do-while
statement, continue statement, break statement, return statement, and throw statement) must be
terminated with semicolons. Such semicolons may always appear explicitly in the source text. For
convenience, however, such semicolons may be omitted from the source text in certain situations. These
situations are described by saying that semicolons are automatically inserted into the source code token
stream in those situations.
7.9.1 Rules of Automatic Semicolon Insertion
• When, as the program is parsed from left to right, a token (called the offending token) is encountered
that is not allowed by any production of the grammar, then a semicolon is automatically inserted
before the offending token if one or more of the following conditions is true:
- 22 -
1. The offending token is separated from the previous token by at least one LineTerminator.
2. The offending token is }.
• When, as the program is parsed from left to right, the end of the input stream of tokens is
encountered and the parser is unable to parse the input token stream as a single complete
ECMAScript Program, then a semicolon is automatically inserted at the end of the input stream.
• When, as the program is parsed from left to right, a token is encountered that is allowed by some
production of the grammar, but the production is a restricted production and the token would be the
first token for a terminal or nonterminal immediately following the annotation “[no LineTerminator
here] ” within the restricted production (and therefore such a token is called a restricted token), and
the restricted token is separated from the previous token by at least one LineTerminator, then a
semicolon is automatically inserted before the restricted token.
However, there is an additional overriding condition on the preceding rules: a semicolon is never
inserted automatically if the semicolon would then be parsed as an empty statement or if that semicolon
would become one of the two semicolons in the header of a for statement (section 12.6.3).
NOTE
These are the only restricted productions in the grammar:
PostfixExpression :
LeftHandSideExpression [no LineTerminator here] ++
LeftHandSideExpression [no LineTerminator here] --
ContinueStatement :
continue [no LineTerminator here] Identifieropt ;
BreakStatement :
break [no LineTerminator here] Identifieropt ;
ReturnStatement :
return [no LineTerminator here] Expressionopt ;
ThrowStatement :
throw [no LineTerminator here] Expression ;
is not a valid sentence in the ECMAScript grammar, even with the automatic semicolon insertion rules.
In contrast, the source
- 23 -
{ 1
2 } 3
is also not a valid ECMAScript sentence, but is transformed by automatic semicolon insertion into the
following:
{ 1
;2 ;} 3;
is not a valid ECMAScript sentence and is not altered by automatic semicolon insertion because the
semicolon is needed for the header of a for statement. Automatic semicolon insertion never inserts one
of the two semicolons in the header of a for statement.
The source
return
a + b
NOTE
The expression a + b is not treated as a value to be returned by the return statement, because a
'LineTerminator' separates it from the token return.
The source
a = b
++c
NOTE
The token ++ is not treated as a postfix operator applying to the variable b, because a 'LineTerminator'
occurs between b and ++.
The source
if (a > b)
else c = d
is not a valid ECMAScript sentence and is not altered by automatic semicolon insertion before the else
token, even though no production of the grammar applies at that point, because an automatically inserted
semicolon would then be parsed as an empty statement.
The source
a = b + c
(d + e).print()
is not transformed by automatic semicolon insertion, because the parenthesised expression that begins
the second line can be interpreted as an argument list for a function call:
- 24 -
a = b + c(d + e).print()
In the circumstance that an assignment statement must begin with a left parenthesis, it is a good idea for
the programmer to provide an explicit semicolon at the end of the preceding statement rather than to rely
on automatic semicolon insertion.
8 Types
A value is an entity that takes on one of nine types. There are nine types (Undefined, Null, Boolean, String,
Number, Object, Reference, List, and Completion). Values of type Reference, List, and Completion are used
only as intermediate results of expression evaluation and cannot be stored as properties of objects.
8.1 The Undefined Type
The Undefined type has exactly one value, called undefined. Any variable that has not been assigned a
value has the value undefined.
8.2 The Null Type
The Null type has exactly one value, called null.
8.3 The Boolean Type
The Boolean type represents a logical entity having two values, called true and false.
8.4 The String Type
The String type is the set of all finite ordered sequences of zero or more 16-bit unsigned integer values
(“elements”). The String type is generally used to represent textual data in a running ECMAScript program,
in which case each element in the string is treated as a code point value (see section 6). Each element is
regarded as occupying a position within the sequence. These positions are indexed with nonnegative
integers. The first element (if any) is at position 0, the next element (if any) at position 1, and so on. The
length of a string is the number of elements (i.e., 16-bit values) within it. The empty string has length zero
and therefore contains no elements.
When a string contains actual textual data, each element is considered to be a single UTF-16 unit. Whether
or not this is the actual storage format of a String, the characters within a String are numbered as though
they were represented using UTF-16. All operations on Strings (except as otherwise stated) treat them as
sequences of undifferentiated 16-bit unsigned integers; they do not ensure the resulting string is in
normalised form, nor do they ensure language-sensitive results.
NOTE
The rationale behind these decisions was to keep the implementation of Strings as simple and high-
performing as possible. The intent is that textual data coming into the execution environment from outside
(e.g., user input, text read from a file or received over the network, etc.) be converted to Unicode
Normalised Form C before the running program sees it. Usually this would occur at the same time
incoming text is converted from its original character encoding to Unicode (and would impose no
additional overhead). Since it is recommended that ECMAScript source code be in Normalised Form C,
string literals are guaranteed to be normalised (if source text is guaranteed to be normalised), as long as
they do not contain any Unicode escape sequences.
8.5 The Number Type
The Number type has exactly 18437736874454810627 (that is, 2 64−2 53+3) values, representing the double-
precision 64-bit format IEEE 754 values as specified in the IEEE Standard for Binary Floating-Point
Arithmetic, except that the 9007199254740990 (that is, 2 53−2) distinct “Not-a-Number” values of the IEEE
Standard are represented in ECMAScript as a single special NaN value. (Note that the NaN value is
produced by the program expression NaN, assuming that the globally defined variable NaN has not been
altered by program execution.) In some implementations, external code might be able to detect a difference
between various Non-a-Number values, but such behaviour is implementation-dependent; to ECMAScript
code, all NaN values are indistinguishable from each other.
There are two other special values, called positive Infinity and negative Infinity. For brevity, these values
are also referred to for expository purposes by the symbols +∞ and −∞, respectively. (Note that these two
- 25 -
infinite number values are produced by the program expressions +Infinity (or simply Infinity) and
-Infinity, assuming that the globally defined variable Infinity has not been altered by program
execution.)
The other 18437736874454810624 (that is, 2 64−2 53) values are called the finite numbers. Half of these are
positive numbers and half are negative numbers; for every finite positive number there is a corresponding
negative number having the same magnitude.
Note that there is both a positive zero and a negative zero. For brevity, these values are also referred to for
expository purposes by the symbols +0 and −0, respectively. (Note that these two zero number values are
produced by the program expressions +0 (or simply 0) and -0.)
The 18437736874454810622 (that is, 2 64−2 53−2) finite nonzero values are of two kinds:
18428729675200069632 (that is, 2 64−2 54) of them are normalised, having the form
s × m × 2e
where s is +1 or −1, m is a positive integer less than 2 53 but not less than 2 52, and e is an integer ranging
from −1074 to 971, inclusive.
The remaining 9007199254740990 (that is, 2 53−2) values are denormalised, having the form
s × m × 2e
where s is +1 or −1, m is a positive integer less than 2 52, and e is −1074.
Note that all the positive and negative integers whose magnitude is no greater than 2 53 are representable in
the Number type (indeed, the integer 0 has two representations, +0 and -0).
A finite number has an odd significand if it is nonzero and the integer m used to express it (in one of the
two forms shown above) is odd. Otherwise, it has an even significand.
In this specification, the phrase “the number value for x” where x represents an exact nonzero real
mathematical quantity (which might even be an irrational number such as π) means a number value chosen
in the following manner. Consider the set of all finite values of the Number type, with −0 removed and
with two additional values added to it that are not representable in the Number type, namely 2 1024 (which is
+1 × 2 53 × 2 971) and −2 1024 (which is −1 × 2 53 × 2 971). Choose the member of this set that is closest in value
to x. If two values of the set are equally close, then the one with an even significand is chosen; for this
purpose, the two extra values 2 1024 and −2 1024 are considered to have even significands. Finally, if 2 1024 was
chosen, replace it with +∞; if −2 1024 was chosen, replace it with −∞; if +0 was chosen, replace it with −0 if
and only if x is less than zero; any other chosen value is used unchanged. The result is the number value for
x. (This procedure corresponds exactly to the behaviour of the IEEE 754 “round to nearest” mode.)
Some ECMAScript operators deal only with integers in the range −2 31 through 2 31−1, inclusive, or in the
range 0 through 2 32−1, inclusive. These operators accept any value of the Number type but first convert
each such value to one of 2 32 integer values. See the descriptions of the ToInt32 and ToUint32 operators in
sections 0 and 0, respectively.
8.6 The Object Type
An Object is an unordered collection of properties. Each property consists of a name, a value and a set of
attributes.
8.6.1 Property Attributes
A property can have zero or more attributes from the following set:
- 26 -
Attribute Description
ReadOnly The property is a read-only property. Attempts by ECMAScript code to write to
the property will be ignored. (Note, however, that in some cases the value of a
property with the ReadOnly attribute may change over time because of actions
taken by the host environment; therefore “ReadOnly” does not mean “constant
and unchanging”!)
DontEnum The property is not to be enumerated by a for-in enumeration (section 12.6.4).
DontDelete Attempts to delete the property will be ignored. See the description of the
delete operator in section 11.4.1.
Internal Internal properties have no name and are not directly accessible via the property
accessor operators. How these properties are accessed is implementation specific.
How and when some of these properties are used is specified by the language
specification.
Every object (including host objects) must implement the [[Prototype]] and [[Class]] properties and the
[[Get]], [[Put]], [[CanPut]], [[HasProperty]], [[Delete]], and [[DefaultValue]] methods. (Note, however,
that the [[DefaultValue]] method may, for some objects, simply throw a TypeError exception.)
The value of the [[Prototype]] property must be either an object or null, and every [[Prototype]] chain
must have finite length (that is, starting from any object, recursively accessing the [[Prototype]] property
must eventually lead to a null value). Whether or not a native object can have a host object as its
[[Prototype]] depends on the implementation.
The value of the [[Class]] property is defined by this specification for every kind of built-in object. The
value of the [[Class]] property of a host object may be any value, even a value used by a built-in object
for its [[Class]] property. The value of a [[Class]] property is used internally to distinguish different
kinds of built-in objects. Note that this specification does not provide any means for a program to access
that value except through Object.prototype.toString (see 15.2.4.2).
For native objects the [[Get]], [[Put]], [[CanPut]], [[HasProperty]], [[Delete]] and [[DefaultValue]]
methods behave as described in described in 8.6.2.1, 8.6.2.2, 8.6.2.3, 8.6.2.4, 8.6.2.5 and 8.6.2.6,
respectively, except that Array objects have a slightly different implementation of the [[Put]] method
(see 15.4.5.1). Host objects may implement these methods in any manner unless specified otherwise; for
example, one possibility is that [[Get]] and [[Put]] for a particular host object indeed fetch and store
property values but [[HasProperty]] always generates false.
In the following algorithm descriptions, assume O is a native ECMAScript object and P is a string.
8.6.2.1 [[Get]] (P)
When the [[Get]] method of O is called with property name P, the following steps are taken:
1. If O doesn’t have a property with name P, go to step 4.
2. Get the value of the property.
3. Return Result(2).
- 28 -
When the [[DefaultValue]] method of O is called with hint Number, the following steps are taken:
1. Call the [[Get]] method of object O with argument "valueOf".
- 29 -
When the [[DefaultValue]] method of O is called with no hint, then it behaves as if the hint were
Number, unless O is a Date object (see 15.9), in which case it behaves as if the hint were String.
The above specification of [[DefaultValue]] for native objects can return only primitive values. If a
host object implements its own [[DefaultValue]] method, it must ensure that its [[DefaultValue]]
method can return only primitive values.
8.7 The Reference Type
The internal Reference type is not a language data type. It is defined by this specification purely for
expository purposes. An implementation of ECMAScript must behave as if it produced and operated upon
references in the manner described here. However, a value of type Reference is used only as an
intermediate result of expression evaluation and cannot be stored as the value of a variable or property.
The Reference type is used to explain the behaviour of such operators as delete, typeof, and the
assignment operators. For example, the left-hand operand of an assignment is expected to produce a
reference. The behaviour of assignment could, instead, be explained entirely in terms of a case analysis on
the syntactic form of the left-hand operand of an assignment operator, but for one difficulty: function calls
are permitted to return references. This possibility is admitted purely for the sake of host objects. No built-
in ECMAScript function defined by this specification returns a reference and there is no provision for a
user-defined function to return a reference. (Another reason not to use a syntactic case analysis is that it
would be lengthy and awkward, affecting many parts of the specification.)
Another use of the Reference type is to explain the determination of the this value for a function call.
A Reference is a reference to a property of an object. A Reference consists of two components, the base
object and the property name.
The following abstract operations are used in this specification to access the components of references:
• GetBase(V). Returns the base object component of the reference V.
• GetPropertyName(V). Returns the property name component of the reference V.
The following abstract operations are used in this specification to operate on references:
8.7.1 GetValue (V)
1. If Type(V) is not Reference, return V.
2. Call GetBase(V).
3. If Result(2) is null, throw a ReferenceError exception.
4. Call the [[Get]] method of Result(2), passing GetPropertyName(V) for the property name.
5. Return Result(4).
9 Type Conversion
The ECMAScript runtime system performs automatic type conversion as needed. To clarify the semantics of
certain constructs it is useful to define a set of conversion operators. These operators are not a part of the
language; they are defined here to aid the specification of the semantics of the language. The conversion
operators are polymorphic; that is, they can accept a value of any standard type, but not of type Reference,
List, or Completion (the internal types).
9.1 ToPrimitive
The operator ToPrimitive takes a Value argument and an optional argument PreferredType. The operator
ToPrimitive converts its value argument to a non-Object type. If an object is capable of converting to more
than one primitive type, it may use the optional hint PreferredType to favour that type. Conversion occurs
according to the following table:
9.2 ToBoolean
The operator ToBoolean converts its argument to a value of type Boolean according to the following table:
- 31 -
String The result is false if the argument is the empty string (its length is zero); otherwise
the result is true.
Object true
9.3 ToNumber
The operator ToNumber converts its argument to a value of type Number according to the following table:
StrWhiteSpace :::
StrWhiteSpaceChar StrWhiteSpaceopt
StrWhiteSpaceChar :::
<TAB>
<SP>
<NBSP>
<FF>
<VT>
<CR>
<LF>
<LS>
<PS>
<USP>
StrNumericLiteral :::
StrDecimalLiteral
HexIntegerLiteral
- 32 -
StrDecimalLiteral :::
StrUnsignedDecimalLiteral
+ StrUnsignedDecimalLiteral
- StrUnsignedDecimalLiteral
StrUnsignedDecimalLiteral :::
Infinity
DecimalDigits . DecimalDigitsopt ExponentPartopt
. DecimalDigits ExponentPartopt
DecimalDigits ExponentPartopt
DecimalDigits :::
DecimalDigit
DecimalDigits DecimalDigit
ExponentPart :::
ExponentIndicator SignedInteger
SignedInteger :::
DecimalDigits
+ DecimalDigits
- DecimalDigits
HexIntegerLiteral :::
0x HexDigit
0X HexDigit
HexIntegerLiteral HexDigit
Some differences should be noted between the syntax of a StringNumericLiteral and a NumericLiteral
(see 7.8.3):
• A StringNumericLiteral may be preceded and/or followed by white space and/or line terminators.
• A StringNumericLiteral that is decimal may have any number of leading 0 digits.
• A StringNumericLiteral that is decimal may be preceded by + or - to indicate its sign.
• A StringNumericLiteral that is empty or contains only white space is converted to +0.
The conversion of a string to a number value is similar overall to the determination of the number value
for a numeric literal (see 7.8.3), but some of the details are different, so the process for converting a
string numeric literal to a value of Number type is given here in full. This value is determined in two
steps: first, a mathematical value (MV) is derived from the string numeric literal; second, this
mathematical value is rounded as described below.
• The MV of StringNumericLiteral ::: [empty] is 0.
• The MV of StringNumericLiteral ::: StrWhiteSpace is 0.
• The MV of StringNumericLiteral ::: StrWhiteSpaceopt StrNumericLiteral StrWhiteSpaceopt is the MV
of StrNumericLiteral, no matter whether white space is present or not.
• The MV of StrNumericLiteral ::: StrDecimalLiteral is the MV of StrDecimalLiteral.
• The MV of StrNumericLiteral ::: HexIntegerLiteral is the MV of HexIntegerLiteral.
- 33 -
• The MV of HexIntegerLiteral ::: HexIntegerLiteral HexDigit is (the MV of HexIntegerLiteral times 16) plus
the MV of HexDigit.
Once the exact MV for a string numeric literal has been determined, it is then rounded to a value of the
Number type. If the MV is 0, then the rounded value is +0 unless the first non white space character in
the string numeric literal is ‘-’, in which case the rounded value is −0. Otherwise, the rounded value
must be the number value for the MV (in the sense defined in 8.5), unless the literal includes a
StrUnsignedDecimalLiteral and the literal has more than 20 significant digits, in which case the number
value may be either the number value for the MV of a literal produced by replacing each significant digit
after the 20th with a 0 digit or the number value for the MV of a literal produced by replacing each
significant digit after the 20th with a 0 digit and then incrementing the literal at the 20th digit position.
A digit is significant if it is not part of an ExponentPart and
• it is not 0; or
• there is a nonzero digit to its left and there is a nonzero digit, not in the ExponentPart, to its right.
9.4 ToInteger
The operator ToInteger converts its argument to an integral numeric value. This operator functions as
follows:
1. Call ToNumber on the input argument.
2. If Result(1) is NaN, return +0.
3. If Result(1) is +0, −0, +∞, or −∞, return Result(1).
4. Compute sign(Result(1)) * floor(abs(Result(1))).
5. Return Result(4).
9.5 ToInt32: (Signed 32 Bit Integer)
The operator ToInt32 converts its argument to one of 232 integer values in the range −2 31 through 2 31−1,
inclusive. This operator functions as follows:
1. Call ToNumber on the input argument.
2. If Result(1) is NaN, +0, −0, +∞, or −∞, return +0.
3. Compute sign(Result(1)) * floor(abs(Result(1))).
4. Compute Result(3) modulo 2 32; that is, a finite integer value k of Number type with positive sign and
less than 2 32 in magnitude such the mathematical difference of Result(3) and k is mathematically an
integer multiple of 2 32.
5. If Result(4) is greater than or equal to 2 31, return Result(4)− 2 32, otherwise return Result(4).
NOTE
Given the above definition of ToInt32:
The ToInt32 operation is idempotent: if applied to a result that it produced, the second application leaves that value
unchanged.
ToInt32(ToUint32(x)) is equal to ToInt32(x) for all values of x. (It is to preserve this latter property that +∞ and −∞
are mapped to +0.)
ToInt32 maps − 0 to +0.
NOTE
Given the above definition of ToUInt32:
Step 5 is the only difference between ToUint32 and ToInt32.
The ToUint32 operation is idempotent: if applied to a result that it produced, the second application leaves that value
unchanged.
ToUint32(ToInt32(x)) is equal to ToUint32(x) for all values of x. (It is to preserve this latter property that +∞ and −∞
are mapped to +0.)
ToUint32 maps − 0 to +0.
NOTE
Given the above definition of ToUint16:
The substitution of 2 16 for 2 32 in step 4 is the only difference between ToUint32 and ToUint16.
ToUint16 maps − 0 to +0.
9.8 ToString
The operator ToString converts its argument to a value of type String according to the following table:
that s is not divisible by 10, and that the least significant digit of s is not necessarily uniquely
determined by these criteria.
6. If k ≤ n ≤ 21, return the string consisting of the k digits of the decimal representation of s (in order,
with no leading zeroes), followed by n − k occurrences of the character ‘0’.
7. If 0 < n ≤ 21, return the string consisting of the most significant n digits of the decimal representation
of s, followed by a decimal point ‘.’, followed by the remaining k− n digits of the decimal
representation of s.
8. If −6 < n ≤ 0, return the string consisting of the character ‘0’, followed by a decimal point ‘.’,
followed by −n occurrences of the character ‘0’, followed by the k digits of the decimal
representation of s.
9. Otherwise, if k = 1, return the string consisting of the single digit of s, followed by lowercase
character ‘e’, followed by a plus sign ‘+’ or minus sign ‘−’ according to whether n−1 is positive or
negative, followed by the decimal representation of the integer abs(n−1) (with no leading zeros).
10. Return the string consisting of the most significant digit of the decimal representation of s, followed
by a decimal point ‘.’, folloarwed by the remaining k−1 digits of the decimal representation of s,
followed by the lowercase character ‘e’, followed by a plus sign ‘+’ or minus sign ‘−’ according to
whether n−1 is positive or negative, followed by the decimal representation of the integer abs(n−1)
(with no leading zeros).
NOTE
The following observations may be useful as guidelines for implementations, but are not part of the normative
requirements of this Standard:
If x is any number value other than − 0, then ToNumber(ToString(x)) is exactly the same number value as x.
The least significant digit of s is not always uniquely determined by the requirements listed in step 5.
For implementations that provide more accurate conversions than required by the rules above, it is recommended
that the following alternative version of step 5 be used as a guideline:
Otherwise, let n, k, and s be integers such that k ≥ 1, 10 k − 1 ≤ s < 10 k , the number value for s × 10 n − k is m, and k
is as small as possible. If there are multiple possibilities for s, choose the value of s for which s × 10 n − k is closest
in value to m. If there are two such possible values of s, choose the one that is even. Note that k is the number of
digits in the decimal representation of s and that s is not divisible by 10.
Implementors of ECMAScript may find useful the paper and code written by David M. Gay for binary-to-decimal
conversion of floating-point numbers:
Gay, David M. Correctly Rounded Binary-Decimal and Decimal-Binary Conversions. Numerical Analysis
Manuscript 90-10. AT&T Bell Laboratories (Murray Hill, New Jersey). November 30, 1990. Available as
http://cm.bell-labs.com/cm/cs/doc/90/4-10.ps.gz. Associated code available as
http://cm.bell-labs.com/netlib/fp/dtoa.c.gz and as http://cm.bell-
labs.com/netlib/fp/g_fmt.c.gz and may also be found at the various netlib mirror sites.
9.9 ToObject
The operator ToObject converts its argument to a value of type Object according to the following table:
Input Type Result
Undefined Throw a TypeError exception.
Null Throw a TypeError exception.
Boolean Create a new Boolean object whose [[value]] property is set to the value of the
boolean. See 15.6 for a description of Boolean objects.
Number Create a new Number object whose [[value]] property is set to the value of the
number. See 15.7 for a description of Number objects.
String Create a new String object whose [[value]] property is set to the value of the
string. See 15.5 for a description of String objects.
Object The result is the input argument (no conversion).
- 37 -
10 Execution Contexts
When control is transferred to ECMAScript executable code, control is entering an execution context. Active
execution contexts logically form a stack. The top execution context on this logical stack is the running
execution context.
10.1 Definitions
10.1.1 Function Objects
There are two types of Function objects:
• Program functions are defined in source text by a FunctionDeclaration or created dynamically either
by using a FunctionExpression or by using the built-in Function object as a constructor.
• Internal functions are built-in objects of the language, such as parseInt and Math.exp. An
implementation may also provide implementation-dependent internal functions that are not described
in this specification. These functions do not contain executable code defined by the ECMAScript
grammar, so they are excluded from this discussion of execution contexts.
• Eval code is the source text supplied to the built-in eval function. More precisely, if the parameter
to the built-in eval function is a string, it is treated as an ECMAScript Program. The eval code for a
particular invocation of eval is the global code portion of the string parameter.
• Function code is source text that is parsed as part of a FunctionBody. The function code of a
particular FunctionBody does not include any source text that is parsed as part of a nested
FunctionBody. Function code also denotes the source text supplied when using the built-in
Function object as a constructor. More precisely, the last parameter provided to the Function
constructor is converted to a string and treated as the FunctionBody. If more than one parameter is
provided to the Function constructor, all parameters except the last one are converted to strings
and concatenated together, separated by commas. The resulting string is interpreted as the
FormalParameterList for the FunctionBody defined by the last parameter. The function code for a
particular instantiation of a Function does not include any source text that is parsed as part of a
nested FunctionBody.
• For each FunctionDeclaration in the code, in source text order, create a property of the variable
object whose name is the Identifier in the FunctionDeclaration, whose value is the result returned by
- 38 -
creating a Function object as described in 13, and whose attributes are determined by the type of
code. If the variable object already has a property with this name, replace its value and attributes.
Semantically, this step must follow the creation of FormalParameterList properties.
The result of evaluating an identifier is always a value of type Reference with its member name
component equal to the identifier string.
10.1.5 Global Object
There is a unique global object (15.1), which is created before control enters any execution context.
Initially the global object has the following properties:
• Built-in objects such as Math, String, Date, parseInt, etc. These have attributes { DontEnum }.
• Additional host defined properties. This may include a property whose value is the global object
itself; for example, in the HTML document object model the window property of the global object is
the global object itself.
As control enters execution contexts, and as ECMAScript code is executed, additional properties may be
added to the global object and the initial properties may be changed.
10.1.6 Activation Object
When control enters an execution context for function code, an object called the activation object is
created and associated with the execution context. The activation object is initialised with a property
with name arguments and attributes { DontDelete }. The initial value of this property is the arguments
object described below.
The activation object is then used as the variable object for the purposes of variable instantiation.
The activation object is purely a specification mechanism. It is impossible for an ECMAScript program
to access the activation object. It can access members of the activation object, but not the activation
object itself. When the call operation is applied to a Reference value whose base object is an activation
object, null is used as the this value of the call.
- 39 -
10.1.7 This
There is a this value associated with every active execution context. The this value depends on the caller
and the type of code being executed and is determined when control enters the execution context. The
this value associated with an execution context is immutable.
10.1.8 Arguments Object
When control enters an execution context for function code, an arguments object is created and
initialised as follows:
• The value of the internal [[Prototype]] property of the arguments object is the original Object
prototype object, the one that is the initial value of Object.prototype (see 15.2.3.1).
• A property is created with name callee and property attributes { DontEnum }. The initial value of
this property is the Function object being executed. This allows anonymous functions to be recursive.
• A property is created with name length and property attributes { DontEnum }. The initial value of
this property is the number of actual parameter values supplied by the caller.
• For each non-negative integer, arg, less than the value of the length property, a property is created
with name ToString(arg) and property attributes { DontEnum }. The initial value of this property is
the value of the corresponding actual parameter supplied by the caller. The first actual parameter
value corresponds to arg = 0, the second to arg = 1, and so on. In the case when arg is less than the
number of formal parameters for the Function object, this property shares its value with the
corresponding property of the activation object. This means that changing this property changes the
corresponding property of the activation object and vice versa.
• Variable instantiation is performed using the activation object as the variable object and using
property attributes { DontDelete }.
• The caller provides the this value. If the this value provided by the caller is not an object (including
the case where it is null), then the this value is the global object.
11 Expressions
11.1 Primary Expressions
Syntax
PrimaryExpression :
this
Identifier
Literal
ArrayLiteral
ObjectLiteral
( Expression )
Syntax
ArrayLiteral :
[ Elisionopt ]
[ ElementList ]
[ ElementList , Elisionopt ]
ElementList :
Elisionopt AssignmentExpression
ElementList , Elisionopt AssignmentExpression
Elision :
,
Elision ,
Semantics
The production ArrayLiteral : [ Elision opt ] is evaluated as follows:
1. Create a new array as if by the expression new Array().
- 41 -
Syntax
ObjectLiteral :
{ }
{ PropertyNameAndValueList }
PropertyNameAndValueList :
PropertyName : AssignmentExpression
PropertyNameAndValueList , PropertyName : AssignmentExpression
- 42 -
PropertyName :
Identifier
StringLiteral
NumericLiteral
Semantics
The production ObjectLiteral : { } is evaluated as follows:
1. Create a new object as if by the expression new Object().
2. Return Result(1).
The production
PropertyNameAndValueList : PropertyName : AssignmentExpression
is evaluated as follows:
1. Create a new object as if by the expression new Object().
2. Evaluate PropertyName.
3. Evaluate AssignmentExpression.
4. Call GetValue(Result(3)).
5. Call the [[Put]] method of Result(1) with arguments Result(2) and Result(4).
6. Return Result(1).
The production
PropertyNameAndValueList : PropertyNameAndValueList , PropertyName : AssignmentExpression
is evaluated as follows:
1. Evaluate PropertyNameAndValueList.
2. Evaluate PropertyName.
3. Evaluate AssignmentExpression.
4. Call GetValue(Result(3)).
5. Call the [[Put]] method of Result(1) with arguments Result(2) and Result(4).
6. Return Result(1).
NOTE
This algorithm does not apply GetValue to Result(1). The principal motivation for this is so that
operators such as delete and typeof may be applied to parenthesised expressions.
11.2 Left-Hand-Side Expressions
Syntax
MemberExpression :
PrimaryExpression
FunctionExpression
MemberExpression [ Expression ]
MemberExpression . Identifier
new MemberExpression Arguments
NewExpression :
MemberExpression
new NewExpression
CallExpression :
MemberExpression Arguments
CallExpression Arguments
CallExpression [ Expression ]
CallExpression . Identifier
Arguments :
()
( ArgumentList )
ArgumentList :
AssignmentExpression
ArgumentList , AssignmentExpression
LeftHandSideExpression :
NewExpression
CallExpression
and similarly
CallExpression . Identifier
CallExpression [ <identifier-string> ]
where <identifier-string> is a string literal containing the same sequence of characters as the Identifier.
The production MemberExpression : MemberExpression [ Expression ] is evaluated as follows:
1. Evaluate MemberExpression.
2. Call GetValue(Result(1)).
3. Evaluate Expression.
4. Call GetValue(Result(3)).
5. Call ToObject(Result(2)).
6. Call ToString(Result(4)).
7. Return a value of type Reference whose base object is Result(5) and whose property name is
Result(6).
The production CallExpression : CallExpression Arguments is evaluated in exactly the same manner,
except that the contained CallExpression is evaluated in step 1.
NOTE
Result(8) will never be of type Reference if Result(3) is a native ECMAScript object. Whether calling a
host object can return a value of type Reference is implementation-dependent.
- 45 -
Type Result
Undefined "undefined"
Null "object"
Boolean "boolean"
Number "number"
String "string"
Object (native and "object"
doesn’t implement
[[Call]])
Object (native and "function"
implements [[Call]])
Object (host) Implementation-dependent
Semantics
The production MultiplicativeExpression : MultiplicativeExpression @ UnaryExpression, where @ stands
for one of the operators in the above definitions, is evaluated as follows:
1. Evaluate MultiplicativeExpression.
2. Call GetValue(Result(1)).
3. Evaluate UnaryExpression.
4. Call GetValue(Result(3)).
5. Call ToNumber(Result(2)).
6. Call ToNumber(Result(4)).
7. Apply the specified operation (*, /, or %) to Result(5) and Result(6). See the notes below (11.5.1,
11.5.2, 11.5.3).
8. Return Result(7).
• In the remaining cases, where neither an infinity or NaN is involved, the product is computed and
rounded to the nearest representable value using IEEE 754 round-to-nearest mode. If the magnitude
is too large to represent, the result is then an infinity of appropriate sign. If the magnitude is too
small to represent, the result is then a zero of appropriate sign. The ECMAScript language requires
support of gradual underflow as defined by IEEE 754.
• In the remaining cases, where neither an infinity, nor a zero, nor NaN is involved, the floating-point
remainder r from a dividend n and a divisor d is defined by the mathematical relation r = n − (d * q)
where q is an integer that is negative only if n/d is negative and positive only if n/d is positive, and
whose magnitude is as large as possible without exceeding the magnitude of the true mathematical
quotient of n and d.
NOTE
No hint is provided in the calls to ToPrimitive in steps 5 and 6. All native ECMAScript objects except
Date objects handle the absence of a hint as if the hint Number were given; Date objects handle the
absence of a hint as if the hint String were given. Host objects may handle the absence of a hint in some
other manner.
11.6.2 The Subtraction Operator ( - )
The production AdditiveExpression : AdditiveExpression - MultiplicativeExpression is evaluated as
follows:
1. Evaluate AdditiveExpression.
2. Call GetValue(Result(1)).
3. Evaluate MultiplicativeExpression.
4. Call GetValue(Result(3)).
5. Call ToNumber(Result(2)).
6. Call ToNumber(Result(4)).
7. Apply the subtraction operation to Result(5) and Result(6). See the note below (11.6.3).
8. Return Result(7).
- 51 -
The - operator performs subtraction when applied to two operands of numeric type, producing the
difference of its operands; the left operand is the minuend and the right operand is the subtrahend. Given
numeric operands a and b, it is always the case that a–b produces the same result as a+(–b).
11.7 Bitwise Shift Operators
Syntax
ShiftExpression :
AdditiveExpression
ShiftExpression << AdditiveExpression
ShiftExpression >> AdditiveExpression
ShiftExpression >>> AdditiveExpression
3. Evaluate AdditiveExpression.
4. Call GetValue(Result(3)).
5. Call ToInt32(Result(2)).
6. Call ToUint32(Result(4)).
7. Mask out all but the least significant 5 bits of Result(6), that is, compute Result(6) & 0x1F.
8. Perform sign-extending right shift of Result(5) by Result(7) bits. The most significant bit is
propagated. The result is a signed 32 bit integer.
9. Return Result(8).
RelationalExpressionNoIn :
ShiftExpression
RelationalExpressionNoIn < ShiftExpression
RelationalExpressionNoIn > ShiftExpression
RelationalExpressionNoIn <= ShiftExpression
RelationalExpressionNoIn >= ShiftExpression
RelationalExpressionNoIn instanceof ShiftExpression
NOTE
The 'NoIn' variants are needed to avoid confusing the in operator in a relational expression with the in
operator in a for statement.
Semantics
The result of evaluating a relational operator is always of type Boolean, reflecting whether the relationship
named by the operator holds between its two operands.
The RelationalExpressionNoIn productions are evaluated in the same manner as the RelationalExpression
productions except that the contained RelationalExpressionNoIn is evaluated instead of the contained
RelationalExpression.
- 53 -
EqualityExpressionNoIn :
RelationalExpressionNoIn
EqualityExpressionNoIn == RelationalExpressionNoIn
EqualityExpressionNoIn != RelationalExpressionNoIn
EqualityExpressionNoIn === RelationalExpressionNoIn
EqualityExpressionNoIn !== RelationalExpressionNoIn
Semantics
The result of evaluating an equality operator is always of type Boolean, reflecting whether the relationship
named by the operator holds between its two operands.
The EqualityExpressionNoIn productions are evaluated in the same manner as the EqualityExpression
productions except that the contained EqualityExpressionNoIn and RelationalExpressionNoIn are evaluated
instead of the contained EqualityExpression and RelationalExpression, respectively.
11.9.1 The Equals Operator ( == )
The production EqualityExpression : EqualityExpression == RelationalExpression is evaluated as
follows:
1. Evaluate EqualityExpression.
2. Call GetValue(Result(1)).
3. Evaluate RelationalExpression.
4. Call GetValue(Result(3)).
5. Perform the comparison Result(4) == Result(2). (see 11.9.3).
6. Return Result(5).
BitwiseANDExpressionNoIn :
EqualityExpressionNoIn
BitwiseANDExpressionNoIn & EqualityExpressionNoIn
BitwiseXORExpression :
BitwiseANDExpression
BitwiseXORExpression ^ BitwiseANDExpression
BitwiseXORExpressionNoIn :
BitwiseANDExpressionNoIn
BitwiseXORExpressionNoIn ^ BitwiseANDExpressionNoIn
BitwiseORExpression :
BitwiseXORExpression
BitwiseORExpression | BitwiseXORExpression
BitwiseORExpressionNoIn :
BitwiseXORExpressionNoIn
BitwiseORExpressionNoIn | BitwiseXORExpressionNoIn
Semantics
The production A : A @ B, where @ is one of the bitwise operators in the productions above, is evaluated
as follows:
1. Evaluate A.
2. Call GetValue(Result(1)).
3. Evaluate B.
4. Call GetValue(Result(3)).
5. Call ToInt32(Result(2)).
6. Call ToInt32(Result(4)).
7. Apply the bitwise operator @ to Result(5) and Result(6). The result is a signed 32 bit integer.
8. Return Result(7).
- 58 -
LogicalANDExpressionNoIn :
BitwiseORExpressionNoIn
LogicalANDExpressionNoIn && BitwiseORExpressionNoIn
LogicalORExpression :
LogicalANDExpression
LogicalORExpression || LogicalANDExpression
LogicalORExpressionNoIn :
LogicalANDExpressionNoIn
LogicalORExpressionNoIn || LogicalANDExpressionNoIn
Semantics
The production LogicalANDExpression : LogicalANDExpression && BitwiseORExpression is evaluated as
follows:
1. Evaluate LogicalANDExpression.
2. Call GetValue(Result(1)).
3. Call ToBoolean(Result(2)).
4. If Result(3) is false, return Result(2).
5. Evaluate BitwiseORExpression.
6. Call GetValue(Result(5)).
7. Return Result(6).
ConditionalExpressionNoIn :
LogicalORExpressionNoIn
LogicalORExpressionNoIn ? AssignmentExpression : AssignmentExpressionNoIn
Semantics
The production ConditionalExpression : LogicalORExpression ? AssignmentExpression :
AssignmentExpression is evaluated as follows:
1. Evaluate LogicalORExpression.
2. Call GetValue(Result(1)).
3. Call ToBoolean(Result(2)).
4. If Result(3) is false, go to step 8.
5. Evaluate the first AssignmentExpression.
6. Call GetValue(Result(5)).
7. Return Result(6).
8. Evaluate the second AssignmentExpression.
9. Call GetValue(Result(8)).
10. Return Result(9).
AssignmentExpressionNoIn :
ConditionalExpressionNoIn
LeftHandSideExpression AssignmentOperator AssignmentExpressionNoIn
AssignmentOperator : one of
= *= /= %= += -= <<= >>= >>>= &= ^= |=
Semantics
The AssignmentExpressionNoIn productions are evaluated in the same manner as the AssignmentExpression
productions except that the contained ConditionalExpressionNoIn and AssignmentExpressionNoIn are
evaluated instead of the contained ConditionalExpression and AssignmentExpression, respectively.
11.13.1 Simple Assignment ( = )
The production AssignmentExpression : LeftHandSideExpression = AssignmentExpression is evaluated
as follows:
1. Evaluate LeftHandSideExpression.
2. Evaluate AssignmentExpression.
3. Call GetValue(Result(2)).
4. Call PutValue(Result(1), Result(3)).
- 60 -
5. Return Result(3).
ExpressionNoIn :
AssignmentExpressionNoIn
ExpressionNoIn , AssignmentExpressionNoIn
Semantics
The production Expression : Expression , AssignmentExpression is evaluated as follows:
1. Evaluate Expression.
2. Call GetValue(Result(1)).
3. Evaluate AssignmentExpression.
4. Call GetValue(Result(3)).
5. Return Result(4).
The ExpressionNoIn production is evaluated in the same manner as the Expression production except that
the contained ExpressionNoIn and AssignmentExpressionNoIn are evaluated instead of the contained
Expression and AssignmentExpression, respectively.
- 61 -
12 Statements
Syntax
Statement :
Block
VariableStatement
EmptyStatement
ExpressionStatement
IfStatement
IterationStatement
ContinueStatement
BreakStatement
ReturnStatement
WithStatement
LabelledStatement
SwitchStatement
ThrowStatement
TryStatement
Semantics
A Statement can be part of a LabelledStatement, which itself can be part of a LabelledStatement, and so on.
The labels introduced this way are collectively referred to as the “current label set” when describing the
semantics of individual statements. A LabelledStatement has no semantic meaning other than the introduction
of a label to a label set. The label set of an IterationStatement or a SwitchStatement initially contains the
single element empty. The label set of any other statement is initially empty.
12.1 Block
Syntax
Block :
{ StatementListopt }
StatementList :
Statement
StatementList Statement
Semantics
The production Block : { } is evaluated as follows:
1. Return (normal, empty, empty).
4. If an exception was thrown, return (throw, V, empty) where V is the exception. (Execution now
proceeds as if no exception were thrown.)
5. If Result(3).value is empty, let V = Result(1).value, otherwise let V = Result(3).value.
6. Return (Result(3).type, V, Result(3).target).
VariableDeclarationList :
VariableDeclaration
VariableDeclarationList , VariableDeclaration
VariableDeclarationListNoIn :
VariableDeclarationNoIn
VariableDeclarationListNoIn , VariableDeclarationNoIn
VariableDeclaration :
Identifier Initialiseropt
VariableDeclarationNoIn :
Identifier InitialiserNoInopt
Initialiser :
= AssignmentExpression
InitialiserNoIn :
= AssignmentExpressionNoIn
Description
If the variable statement occurs inside a FunctionDeclaration, the variables are defined with function-local
scope in that function, as described in s10.1.3. Otherwise, they are defined with global scope (that is, they
are created as members of the global object, as described in 10.1.3) using property attributes { DontDelete
}. Variables are created when the execution scope is entered. A Block does not define a new execution
scope. Only Program and FunctionDeclaration produce a new scope. Variables are initialised to undefined
when created. A variable with an Initialiser is assigned the value of its AssignmentExpression when the
VariableStatement is executed, not when the variable is created.
Semantics
The production VariableStatement : var VariableDeclarationList ; is evaluated as follows:
1. Evaluate VariableDeclarationList.
2. Return (normal, empty, empty).
Semantics
The production EmptyStatement : ; is evaluated as follows:
1. Return (normal, empty, empty).
Note that an ExpressionStatement cannot start with an opening curly brace because that might make it
ambiguous with a Block. Also, an ExpressionStatement cannot start with the function keyword because
that might make it ambiguous with a FunctionDeclaration.
Semantics
The production ExpressionStatement : [lookahead ∉ {{, function}] Expression; is evaluated as follows:
1. Evaluate Expression.
2. Call GetValue(Result(1)).
3. Return (normal, Result(2), empty).
Each else for which the choice of associated if is ambiguous shall be associated with the nearest
possible if that would otherwise have no corresponding else.
Semantics
The production IfStatement : if ( Expression ) Statement else Statement is evaluated as follows:
- 64 -
1. Evaluate Expression.
2. Call GetValue(Result(1)).
3. Call ToBoolean(Result(2)).
4. If Result(3) is false, go to step 7.
5. Evaluate the first Statement.
6. Return Result(5).
7. Evaluate the second Statement.
8. Return Result(7).
Syntax
IterationStatement :
do Statement while ( Expression );
while ( Expression ) Statement
for (ExpressionNoInopt; Expressionopt ; Expressionopt ) Statement
for ( var VariableDeclarationListNoIn; Expressionopt ; Expressionopt ) Statement
for ( LeftHandSideExpression in Expression ) Statement
for ( var VariableDeclarationNoIn in Expression ) Statement
9. If Result(6).type is break and Result(6).target is in the current label set, return (normal, V, empty).
10. If Result(6) is an abrupt completion, return Result(6).
11. Go to step 2.
5. Get the name of the next property of Result(3) that doesn’t have the DontEnum attribute. If there is
no such property, go to step 14.
6. Evaluate the LeftHandSideExpression ( it may be evaluated repeatedly).
7. Call PutValue(Result(6), Result(5)).
8. Evaluate Statement.
9. If Result(8).value is not empty, let V = Result(8).value.
10. If Result(8).type is break and Result(8).target is in the current label set, go to step 14.
11. If Result(8).type is continue and Result(8).target is in the current label set, go to step 5.
12. If Result(8) is an abrupt completion, return Result(8).
13. Go to step 5.
14. Return (normal, V, empty).
1. Evaluate VariableDeclarationNoIn.
2. Evaluate Expression.
3. Call GetValue(Result(2)).
4. Call ToObject(Result(3)).
5. Let V = empty.
6. Get the name of the next property of Result(4) that doesn’t have the DontEnum attribute. If there is
no such property, go to step 15.
7. Evaluate Result(1) as if it were an Identifier; see 0 (yes, it may be evaluated repeatedly).
8. Call PutValue(Result(7), Result(6)).
9. Evaluate Statement.
10. If Result(9).value is not empty, let V = Result(9).value.
11. If Result(9).type is break and Result(9).target is in the current label set, go to step 15.
12. If Result(9).type is continue and Result(9).target is in the current label set, go to step 6.
13. If Result(8) is an abrupt completion, return Result(8).
14. Go to step 6.
15. Return (normal, V, empty).
The mechanics of enumerating the properties (step 5 in the first algorithm, step 6 in the second) is
implementation dependent. The order of enumeration is defined by the object. Properties of the object
being enumerated may be deleted during enumeration. If a property that has not yet been visited during
enumeration is deleted, then it will not be visited. If new properties are added to the object being
enumerated during enumeration, the newly added properties are not guaranteed to be visited in the active
enumeration.
Enumerating the properties of an object includes enumerating properties of its prototype, and the
prototype of the prototype, and so on, recursively; but a property of a prototype is not enumerated if it is
“shadowed” because some previous object in the prototype chain has a property with the same name.
12.7 The continue Statement
Syntax
ContinueStatement :
continue [no LineTerminator here] Identifieropt ;
Semantics
A program is considered syntactically incorrect if either of the following are true:
• The program contains a continue statement without the optional Identifier, which is not nested,
directly or indirectly (but not crossing function boundaries), within an IterationStatement.
• The program contains a continue statement with the optional Identifier, where Identifier does not
appear in the label set of an enclosing (but not crossing function boundaries) IterationStatement.
Semantics
A program is considered syntactically incorrect if either of the following are true:
• The program contains a break statement without the optional Identifier, which is not nested, directly
or indirectly (but not crossing function boundaries), within an IterationStatement or a SwitchStatement.
• The program contains a break statement with the optional Identifier, where Identifier does not appear
in the label set of an enclosing (but not crossing function boundaries) Statement.
Semantics
An ECMAScript program is considered syntactically incorrect if it contains a return statement that is not
within a FunctionBody. A return statement causes a function to cease execution and return a value to the
caller. If Expression is omitted, the return value is undefined. Otherwise, the return value is the value of
Expression.
The production ReturnStatement : return [no LineTerminator here] Expression opt ; is evaluated as:
1. If the Expression is not present, return (return, undefined, empty).
2. Evaluate Expression.
3. Call GetValue(Result(2)).
4. Return (return, Result(3), empty).
Description
The with statement adds a computed object to the front of the scope chain of the current execution
context, then executes a statement with this augmented scope chain, then restores the scope chain.
- 68 -
Semantics
The production WithStatement : with ( Expression ) Statement is evaluated as follows:
1. Evaluate Expression.
2. Call GetValue(Result(1)).
3. Call ToObject(Result(2)).
4. Add Result(3) to the front of the scope chain.
5. Evaluate Statement using the augmented scope chain from step 4.
6. Let C be Result(5). If an exception was thrown in step 5, let C be (throw, V, empty), where V is the
exception. (Execution now proceeds as if no exception were thrown.)
7. Remove Result(3) from the front of the scope chain.
8. Return C.
NOTE
No matter how control leaves the embedded 'Statement', whether normally or by some form of abrupt
completion or exception, the scope chain is always restored to its former state.
12.11 The switch Statement
Syntax
SwitchStatement :
switch ( Expression ) CaseBlock
CaseBlock :
{ CaseClausesopt }
{ CaseClausesopt DefaultClause CaseClausesopt }
CaseClauses :
CaseClause
CaseClauses CaseClause
CaseClause :
case Expression : StatementListopt
DefaultClause :
default : StatementListopt
Semantics
The production SwitchStatement : switch ( Expression ) CaseBlock is evaluated as follows:
1. Evaluate Expression.
2. Call GetValue(Result(1)).
3. Evaluate CaseBlock, passing it Result(2) as a parameter.
4. If Result(3).type is break and Result(3).target is in the current label set, return (normal,
Result(3).value, empty).
5. Return Result(3).
NOTE
Evaluating CaseClause does not execute the associated StatementList. It simply evaluates the Expression
and returns the value, which the CaseBlock algorithm uses to determine which StatementList to start
executing.
12.12 Labelled Statements
Syntax
LabelledStatement :
Identifier : Statement
Semantics
A Statement may be prefixed by a label. Labelled statements are only used in conjunction with labelled
break and continue statements. ECMAScript has no goto statement.
An ECMAScript program is considered syntactically incorrect if it contains a LabelledStatement that is
enclosed by a LabelledStatement with the same Identifier as label. This does not apply to labels appearing
within the body of a FunctionDeclaration that is nested, directly or indirectly, within a labelled statement.
The production Identifier : Statement is evaluated by adding Identifier to the label set of Statement and
then evaluating Statement. If the LabelledStatement itself has a non-empty label set, these labels are also
added to the label set of Statement before evaluating it. If the result of evaluating Statement is (break, V,
L) where L is equal to Identifier, the production results in (normal, V, empty).
Prior to the evaluation of a LabelledStatement, the contained Statement is regarded as possessing an empty
label set, except if it is an IterationStatement or a SwitchStatement, in which case it is regarded as
possessing a label set consisting of the single element, empty.
12.13 The throw statement
Syntax
ThrowStatement :
throw [no LineTerminator here] Expression ;
Semantics
The production ThrowStatement : throw [no LineTerminator here] Expression ; is evaluated as:
1. Evaluate Expression.
- 70 -
2. Call GetValue(Result(1)).
3. Return (throw, Result(2), empty).
Catch :
catch (Identifier ) Block
Finally :
finally Block
Description
The try statement encloses a block of code in which an exceptional condition can occur, such as a
runtime error or a throw statement. The catch clause provides the exception-handling code. When a
catch clause catches an exception, its Identifier is bound to that exception.
Semantics
The production TryStatement : try Block Catch is evaluated as follows:
1. Evaluate Block.
2. If Result(1).type is not throw, return Result(1).
3. Evaluate Catch with parameter Result(1).
4. Return Result(3).
1. Evaluate Block.
2. Return Result(1).
13 Function Definition
Syntax
FunctionDeclaration :
function Identifier ( FormalParameterListopt ) { FunctionBody }
FunctionExpression :
function Identifieropt ( FormalParameterListopt ) { FunctionBody }
FormalParameterList :
Identifier
FormalParameterList , Identifier
FunctionBody :
SourceElements
Semantics
The production FunctionDeclaration : function Identifier ( FormalParameterListopt ) { FunctionBody }
is processed for function declarations as follows:
1. Create a new Function object as specified in 13.2 with parameters specified by FormalParameterList, and
body specified by FunctionBody. Pass in the scope chain of the running execution context as the Scope.
2. Create a property of the current variable object (as specified in 10.1.3) with name Identifier and value
Result(1).
The production FunctionExpression : function ( FormalParameterListopt ) { FunctionBody } is evaluated
as follows:
1. Create a new Function object as specified in 13.2 with parameters specified by FormalParameterListopt
and body specified by FunctionBody. Pass in the scope chain of the running execution context as the
Scope.
2. Return Result(2).
The production FunctionExpression : function Identifier ( FormalParameterListopt ) { FunctionBody
} is evaluated as follows:
1. Create a new object as if by the expression new Object().
2. Add Result(1) to the front of the scope chain.
3. Create a new Function object as specified in 13.2 with parameters specified by FormalParameterListopt
and body specified by FunctionBody. Pass in the scope chain of the running execution context as the
Scope.
4. Create a property in the object Result(1). The property's name is Identifier, value is Result(3), and
attributes are { DontDelete, ReadOnly }.
5. Remove Result(1) from the front of the scope chain.
6. Return Result(3).
NOTE
The Identifier in a FunctionExpression can be referenced from inside the FunctionExpression's FunctionBody
to allow the function to call itself recursively. However, unlike in a FunctionDeclaration, the Identifier in a
FunctionExpression cannot be referenced from and does not affect the scope enclosing the
FunctionExpression.
The production FunctionBody : SourceElements is evaluated as follows:
1. Process SourceElements for function declarations.
- 72 -
2. Evaluate SourceElements.
3. Return Result(2).
13.1 Definitions
A couple of definitions are needed to describe the process of creating function objects:
13.1.1 Equated Grammar Productions
Two uses of the FunctionBody grammar production are defined to be equated when one of the following
is true:
• Both uses obtained their FunctionBody from the same location in the source text of the same
ECMAScript program. This source text consists of global code and any contained function codes
according to the definitions in 10.1.2.
• Both uses obtained their FunctionBody from the same location in the source text of the same call to
eval (15.1.2.1). This source text consists of eval code and any contained function codes according
to the definitions in 10.1.2.
NOTE
Two uses of FunctionBody obtained from a call to the Function constructor 15.3.1 and 15.3.2) are
never equated. Also, two uses of FunctionBody obtained from two different calls to eval are never
equated, even if those two calls to eval were passed the same argument.
13.1.2 Joined Objects
When two or more Function objects are joined, they have the following special behaviours:
• Any time a non-internal property of an object O is created or set, the corresponding property is
immediately also created or set with the same value and attributes in all objects joined with O.
• Any time a non-internal property of an object O is deleted, the corresponding property is
immediately also deleted in all objects joined with O.
• If objects O and P are joined, they compare as == and === to each other.
• Joining is transitive and symmetric, so that if objects O and P are joined and objects P and Q are
joined, then objects O and Q are also automatically joined.
NOTE
Two or more objects joined to each other are effectively indistinguishable except that they may have
different internal properties. The only such internal property that may differ in this specification is
[[Scope]].
Joined objects are used as a tool for precise specification technique in this standard. They are not meant
to be used as a guideline to how Function objects are implemented in practice. Rather, in practice an
implementation may detect when the differences in the [[Scope]] properties of two or more joined
Function objects are not externally observable and in those cases reuse the same Function object rather
than making a set of joined Function objects. This is a legal optimisation because this standard only
specifies observable behaviour of ECMAScript programs.
13.2 Creating Function Objects
Given an optional parameter list specified by FormalParameterList, a body specified by FunctionBody, and
a scope chain specified by Scope, a Function object is constructed as follows:
1. If there already exists an object E that was created by an earlier call to this section's algorithm, and if
that call to this section's algorithm was given a FunctionBody that is equated to the FunctionBody
given now, then go to step 13. (If there is more than one object E satisfying these criteria, choose one
at the implementation's discretion.)
2. Create a new native ECMAScript object and let F be that object.
3. Set the [[Class]] property of F to "Function".
4. Set the [[Prototype]] property of F to the original Function prototype object as specified in 15.3.3.1.
5. Set the [[Call]] property of F as described in 13.2.1.
6. Set the [[Construct]] property of F as described in 13.2.2.
7. Set the [[Scope]] property of F to a new scope chain (10.1.4) that contains the same objects as Scope.
- 73 -
8. Set the length property of F to the number of formal properties specified in FormalParameterList. If
no parameters are specified, set the length property of F to 0. This property is given attributes as
specified in 15.3.5.1.
9. Create a new object as would be constructed by the expression new Object().
10. Set the constructor property of Result(9) to F. This property is given attributes { DontEnum }.
11. Set the prototype property of F to Result(9). This property is given attributes as specified in
15.3.5.2.
12. Return F.
13. At the implementation's discretion, go to either step 2 or step 14.
14. Create a new native ECMAScript object joined to E and let F be that object. Copy all non-internal
properties and their attributes from E to F so that all non-internal properties are identical in E and F.
15. Set the [[Class]] property of F to "Function".
16. Set the [[Prototype]] property of F to the original Function prototype object as specified in 15.3.3.1.
17. Set the [[Call]] property of F as described in 13.2.1.
18. Set the [[Construct]] property of F as described in 13.2.2.
19. Set the [[Scope]] property of F to a new scope chain (10.1.4) that contains the same objects as Scope.
20. Return F.
NOTE
A prototype property is automatically created for every function, to allow for the possibility that the
function will be used as a constructor.
Step 1 allows an implementation to optimise the common case of a function A that has a nested function B
where B is not dependent on A. In this case the implementation is allowed to reuse the same object for B
instead of creating a new one every time A is called. Step 13 makes this optimisation optional; an
implementation that chooses not to implement it will go to step 2.
For example, in the code
function A() {
function B(x) {return x*x;}
return B;
}
function C() {
return eval("(function (x) {return x*x;})");
}
var b1 = A();
var b2 = A();
function b3(x) {return x*x;}
function b4(x) {return x*x;}
var b5 = C();
var b6 = C();
an implementation is allowed, but not required, to join b1 and b2. In fact, it may make b1 and b2 the
same object because there is no way to detect the difference between their [[Scope]] properties. On the
other hand, an implementation must not join b3 and b4 because their source codes are not equated
(13.1.1). Also, an implementation must not join b5 and b6 because they were produced by two different
calls to eval and therefore their source codes are not equated.
In practice it's likely to be productive to join two Function objects only in the cases where an
implementation can prove that the differences between their [[Scope]] properties are not observable, so
one object can be reused. By following this policy, an implementation will only encounter the vacuous case
of an object being joined with itself.
13.2.1 [[Call]]
When the [[Call]] property for a Function object F is called, the following steps are taken:
1. Establish a new execution context using F's FormalParameterList, the passed arguments list, and the
this value as described in 10.2.3.
2. Evaluate F's FunctionBody.
3. Exit the execution context established in step 1, restoring the previous execution context.
- 74 -
14 Program
Syntax
Program :
SourceElements
SourceElements :
SourceElement
SourceElements SourceElement
SourceElement :
Statement
FunctionDeclaration
Semantics
The production Program : SourceElements is evaluated as follows:
1. Process SourceElements for function declarations.
2. Evaluate SourceElements.
3. Return Result(2).
The production SourceElements : SourceElement is processed for function declarations as follows:
1. Process SourceElement for function declarations.
The production SourceElements : SourceElement is evaluated as follows:
1. Evaluate SourceElement.
2. Return Result(1).
The production SourceElement : Statement is processed for function declarations by taking no action.
The production SourceElement : Statement is evaluated as follows:
1. Evaluate Statement.
2. Return Result(1).
16. Compute the mathematical integer value that is represented by Z in radix-R notation, using the
letters A-Z and a-z for digits with values 10 through 35. (However, if R is 10 and Z contains more
than 20 significant digits, every significant digit after the 20th may be replaced by a 0 digit, at
the option of the implementation; and if R is not 2, 4, 8, 10, 16, or 32, then Result(16) may be an
implementation-dependent approximation to the mathematical integer value that is represented by
Z in radix-R notation.)
17. Compute the number value for Result(16).
18. Return sign × Result(17).
NOTE
parseInt may interpret only a leading portion of the string as an integer value; it ignores any
characters that cannot be interpreted as part of the notation of an integer, and no indication is given
that any such characters were ignored.
When radix is 0 or undefined and the string's number begins with a 0 digit not followed by an x or X,
then the implementation may, at its discretion, interpret the number either as being octal or as being
decimal. Implementations are encouraged to interpret numbers in this case as being decimal.
15.1.2.3 parseFloat (string)
The parseFloat function produces a number value dictated by interpretation of the contents of the
string argument as a decimal literal.
When the parseFloat function is called, the following steps are taken:
1. Call ToString(string).
2. Compute a substring of Result(1) consisting of the leftmost character that is not a
StrWhiteSpaceChar and all characters to the right of that character.(In other words, remove
leading white space.)
3. If neither Result(2) nor any prefix of Result(2) satisfies the syntax of a StrDecimalLiteral (see 0),
return NaN.
4. Compute the longest prefix of Result(2), which might be Result(2) itself, which satisfies the
syntax of a StrDecimalLiteral.
5. Return the number value for the MV of Result(4).
NOTE
parseFloat may interpret only a leading portion of the string as a number value; it ignores any
characters that cannot be interpreted as part of the notation of an decimal literal, and no indication
is given that any such characters were ignored.
15.1.2.4 isNaN (number)
Applies ToNumber to its argument, then returns true if the result is NaN, and otherwise returns false.
15.1.2.5 isFinite (number)
Applies ToNumber to its argument, then returns false if the result is NaN, +∞, or −∞, and otherwise
returns true.
15.1.3 URI Handling Function Properties
Uniform Resource Identifiers, or URIs, are strings that identify resources (e.g. web pages or files) and
transport protocols by which to access them (e.g. HTTP or FTP) on the Internet. The ECMAScript
language itself does not provide any support for using URIs except for functions that encode and decode
URIs as described in 15.1.3.1, 15.1.3.2, 15.1.3.3 and 15.1.3.4.
NOTE
Many implementations of ECMAScript provide additional functions and methods that manipulate web
pages; these functions are beyond the scope of this standard.
A URI is composed of a sequence of components separated by component separators. The general form
is:
Scheme : First / Second ; Third ? Fourth
- 79 -
where the italicised names represent components and the “:”, “/”, “;” and “?” are reserved characters
used as separators. The encodeURI and decodeURI functions are intended to work with complete
URIs; they assume that any reserved characters in the URI are intended to have special meaning and so
are not encoded. The encodeURIComponent and decodeURIComponent functions are intended
to work with the individual component parts of a URI; they assume that any reserved characters
represent text and so must be encoded so that they are not interpreted as reserved characters when the
component is part of a complete URI.
The following lexical grammar specifies the form of encoded URIs.
uri :::
uriCharactersopt
uriCharacters :::
uriCharacter uriCharactersopt
uriCharacter :::
uriReserved
uriUnescaped
uriEscaped
uriUnescaped :::
uriAlpha
DecimalDigit
uriMark
uriEscaped :::
% HexDigit HexDigit
When a character to be included in a URI is not listed above or is not intended to have the special
meaning sometimes given to the reserved characters, that character must be encoded. The character is
first transformed into a sequence of octets using the UTF-8 transformation, with surrogate pairs first
transformed from their UCS-2 to UCS-4 encodings. (Note that for code points in the range [0,127] this
results in a single octet with the same value.) The resulting sequence of octets is then transformed into a
string with each octet represented by an escape sequence of the form “%xx”.
The encoding and escaping process is described by the hidden function Encode taking two string
arguments string and unescapedSet. This function is defined for expository purpose only.
1. Compute the number of characters in string.
2. Let R be the empty string.
3. Let k be 0.
4. If k equals Result(1), return R.
5. Let C be the character at position k within string.
6. If C is not in unescapedSet, go to step 9.
7. Let S be a string containing only the character C.
8. Go to step 24.
9. If the code point value of C is not less than 0xDC00 and not greater than 0xDFFF, throw a
URIError exception.
- 80 -
10. If the code point value of C is less than 0xD800 or greater than 0xDBFF, let V be the code point
value of C and go to step 16.
11. Increase k by 1.
12. If k equals Result(1), throw a URIError exception.
13. Get the code point value of the character at position k within string.
14. If Result(13) is less than 0xDC00 or greater than 0xDFFF, throw a URIError exception.
15. Let V be (((the code point value of C) – 0xD800) * 0x400 + (Result(13) – 0xDC00) + 0x10000).
16. Let Octets be the array of octets resulting by applying the UTF-8 transformation to V, and let L be
the array size.
17. Let j be 0.
18. Get the value at position j within Octets.
19. Let S be a string containing three characters “%XY” where XY are two uppercase hexadecimal digits
encoding the value of Result(18).
20. Let R be a new string value computed by concatenating the previous value of R and S.
21. Increase j by 1.
22. If j is equal to L, go to step 25.
23. Go to step 18.
24. Let R be a new string value computed by concatenating the previous value of R and S.
25. Increase k by 1.
26. Go to step 4.
The unescaping and decoding process is described by the hidden function Decode taking two string
arguments string and reservedSet. This function is defined for expository purpose only.
1. Compute the number of characters in string.
2. Let R be the empty string.
3. Let k be 0.
4. If k equals Result(1), return R.
5. Let C be the character at position k within string.
6. If C is not ‘%’, go to step 40.
7. Let start be k.
8. If k + 2 is greater than or equal to Result(1), throw a URIError exception.
9. If the characters at position (k+1) and (k + 2) within string do not represent hexadecimal digits,
throw a URIError exception.
10. Let B be the 8-bit value represented by the two hexadecimal digits at position (k + 1) and (k + 2).
11. Increment k by 2.
12. If the most significant bit in B is 0, let C be the character with code point value B and go to step 37.
13. Let n be the smallest non-negative number such that (B << n) & 0x80 is equal to 0.
14. If n equals 1 or n is greater than 4, throw a URIError exception.
15. Let Octets be an array of 8-bit integers of size n.
16. Put B into Octets at position 0.
17. If k + (3 * (n – 1)) is greater than or equal to Result(1), throw a URIError exception.
18. Let j be 1.
19. If j equals n, go to step 29.
20. Increment k by 1.
21. If the character at position k is not ‘%’, throw a URIError exception.
22. If the characters at position (k +1) and (k + 2) within string do not represent hexadecimal digits,
throw a URIError exception.
23. Let B be the 8-bit value represented by the two hexadecimal digits at position (k + 1) and (k + 2).
24. If the two most significant bits in B are not 10, throw a URIError exception.
25. Increment k by 2.
26. Put B into Octets at position j.
27. Increment j by 1.
28. Go to step 19.
29. Let V be the value obtained by applying the UTF-8 transformation to Octets, that is, from an array of
octets into a 32-bit value.
30. If V is less than 0x10000, go to step 36.
31. If V is greater than 0x10FFFF, throw a URIError exception.
- 81 -
Code Point Value Representation 1st Octet 2nd Octet 3rd Octet 4th Octet
0x0000 - 0x007F 00000000 0zzzzzzz 0zzzzzzz
0x0080 - 0x07FF 00000yyy yyzzzzzz 110yyyyy 10zzzzzz
0x0800 - 0xD7FF xxxxyyyy yyzzzzzz 1110xxxx 10yyyyyy 10zzzzzz
0xD800 - 0xDBFF 110110vv vvwwwwxx
followed by followed by 11110uuu 10uuwwww 10xxyyyy 10zzzzzz
0xDC00 – 0xDFFF 110111yy yyzzzzzz
0xD800 - 0xDBFF
not followed by causes URIError
0xDC00 – 0xDFFF
0xDC00 – 0xDFFF causes URIError
0xE000 - 0xFFFF xxxxyyyy yyzzzzzz 1110xxxx 10yyyyyy 10zzzzzz
Where
uuuuu = vvvv + 1
to account for the addition of 0x10000 as in 3.7, Surrogates of the Unicode Standard version 2.0.
The range of code point values 0xD800-0xDFFF is used to encode surrogate pairs; the above
transformation combines a UCS-2 surrogate pair into a UCS-4 representation and encodes the resulting
21-bit value in UTF-8. Decoding reconstructs the surrogate pair.
15.1.3.1 decodeURI (encodedURI)
The decodeURI function computes a new version of a URI in which each escape sequence and UTF-
8 encoding of the sort that might be introduced by the encodeURI function is replaced with the
- 82 -
character that it represents. Escape sequences that could not have been introduced by encodeURI are
not replaced.
When the decodeURI function is called with one argument encodedURI, the following steps are
taken:
1. Call ToString(encodedURI).
2. Let reservedURISet be a string containing one instance of each character valid in uriReserved
plus “#”.
3. Call Decode(Result(1), reservedURISet)
4. Return Result(3).
NOTE
The character “#” is not decoded from escape sequences even though it is not a reserved URI
character.
15.1.3.2 decodeURIComponent (encodedURIComponent)
The decodeURIComponent function computes a new version of a URI in which each escape
sequence and UTF-8 encoding of the sort that might be introduced by the encodeURIComponent
function is replaced with the character that it represents.
When the decodeURIComponent function is called with one argument encodedURIComponent, the
following steps are taken:
1. Call ToString(encodedURIComponent).
2. Let reservedURIComponentSet be the empty string.
3. Call Decode(Result(1), reservedURIComponentSet)
4. Return Result(3).
NOTE
The character “#” is not encoded to an escape sequence even though it is not a reserved or
unescaped URI character.
15.1.3.4 encodeURIComponent (uriComponent)
The encodeURIComponent function computes a new version of a URI in which each instance of
certain characters is replaced by one, two or three escape sequences representing the UTF-8 encoding
of the character.
When the encodeURIComponent function is called with one argument uriComponent, the
following steps are taken:
1. Call ToString(uriComponent).
2. Let unescapedURIComponentSet be a string containing one instance of each character valid in
uriUnescaped.
3. Call Encode(Result(1), unescapedURIComponentSet)
4. Return Result(3).
- 83 -
15.2.4.3 Object.prototype.toLocaleString ( )
This function returns the result of calling toString().
NOTE 1
This function is provided to give all Objects a generic toLocaleString interface, even though not
all may use it. Currently, Array, Number, and Date provide their own locale-sensitive
toLocaleString methods.
NOTE 2
The first parameter to this function is likely to be used in a future version of this standard; it is
recommended that implementations do not use this parameter position for anything else.
- 85 -
15.2.4.4 Object.prototype.valueOf ( )
The valueOf method returns its this value. If the object is the result of calling the Object
constructor with a host object (15.2.2.1), it is implementation-defined whether valueOf returns its
this value or another value such as the host object originally passed to the constructor.
15.2.4.5 Object.prototype.hasOwnProperty (V)
When the hasOwnProperty method is called with argument V, the following steps are taken:
1. Let O be this object.
2. Call ToString(V).
3. If O doesn’t have a property with the name given by Result(2), return false.
4. Return true.
NOTE
Unlike [[HasProperty]] (8.6.2.4), this method does not consider objects in the prototype chain.
15.2.4.6 Object.prototype.isPrototypeOf (V)
When the isPrototypeOf method is called with argument V, the following steps are taken:
1. Let O be this object.
2. If V is not an object, return false.
3. Let V be the value of the [[Prototype]] property of V.
4. if V is null, return false
5. If O and V refer to the same object or if they refer to objects joined to each other (13.1.2), return
true.
6. Go to step 3.
NOTE
This method does not consider objects in the prototype chain.
15.2.5 Properties of Object Instances
Object instances have no special properties beyond those inherited from the Object prototype object.
15.3 Function Objects
15.3.1 The Function Constructor Called as a Function
When Function is called as a function rather than as a constructor, it creates and initialises a new
Function object. Thus the function call Function(…) is equivalent to the object creation expression
new Function(…) with the same arguments.
15.3.1.1 Function (p1, p2, … , pn, body)
When the Function function is called with some arguments p1, p2, … , pn, body (where n might be
0, that is, there are no “p” arguments, and where body might also not be provided), the following steps
are taken:
1. Create and return a new Function object as if the function constructor had been called with the
same arguments (15.3.2.1).
- 86 -
A prototype property is automatically created for every function, to provide for the possibility that
the function will be used as a constructor.
NOTE
It is permissible but not necessary to have one argument for each formal parameter to be specified.
For example, all three of the following expressions produce the same result:
new Function("a", "b", "c", "return a+b+c")
The value of the internal [[Prototype]] property of the Function prototype object is the Object prototype
object (15.3.2.1).
It is a function with an “empty body”; if it is invoked, it merely returns undefined.
The Function prototype object does not have a valueOf property of its own; however, it inherits the
valueOf property from the Object prototype Object.
15.3.4.1 Function.prototype.constructor
The initial value of Function.prototype.constructor is the built-in Function
constructor.
15.3.4.2 Function.prototype.toString ( )
An implementation-dependent representation of the function is returned. This representation has the
syntax of a FunctionDeclaration. Note in particular that the use and placement of white space, line
terminators, and semicolons within the representation string is implementation-dependent.
The toString function is not generic; it throws a TypeError exception if its this value is not a
Function object. Therefore, it cannot be transferred to other kinds of objects for use as a method.
15.3.4.3 Function.prototype.apply (thisArg, argArray)
The apply method takes two arguments, thisArg and argArray, and performs a function call using
the [[Call]] property of the object. If the object does not have a [[Call]] property, a TypeError
exception is thrown.
If thisArg is null or undefined, the called function is passed the global object as the this value.
Otherwise, the called function is passed ToObject(thisArg) as the this value.
If argArray is null or undefined, the called function is passed no arguments. Otherwise, if argArray
is neither an array nor an arguments object (see 10.1.8), a TypeError exception is thrown. If
argArray is either an array or an arguments object, the function is passed the
(ToUint32(argArray.length)) arguments argArray[0], argArray[1], …,
argArray[ToUint32(argArray.length)–1].
The length property of the apply method is 2.
15.3.4.4 Function.prototype.call (thisArg [ , arg1 [ , arg2, … ] ] )
The call method takes one or more arguments, thisArg and (optionally) arg1, arg2 etc, and performs
a function call using the [[Call]] property of the object. If the object does not have a [[Call]] property,
a TypeError exception is thrown. The called function is passed arg1, arg2, etc. as the arguments.
If thisArg is null or undefined, the called function is passed the global object as the this value.
Otherwise, the called function is passed ToObject(thisArg) as the this value.
The length property of the call method is 1.
15.3.5 Properties of Function Instances
In addition to the required internal properties, every function instance has a [[Call]] property, a
[[Construct]] property and a [[Scope]] property (see 8.6.2 and 13.2). The value of the [[Class]] property
is "Function".
15.3.5.1 length
The value of the length property is usually an integer that indicates the “typical” number of
arguments expected by the function. However, the language permits the function to be invoked with
some other number of arguments. The behaviour of a function when invoked on a number of
arguments other than the number specified by its length property depends on the function. This
property has the attributes { DontDelete, ReadOnly, DontEnum }.
15.3.5.2 prototype
The value of the prototype property is used to initialise the internal [[Prototype]] property of a
newly created object before the Function object is invoked as a constructor for that newly created
object. This property has the attribute { DontDelete }.
- 88 -
4. Call ToString(separator).
5. If Result(2) is zero, return the empty string.
6. Call the [[Get]] method of this object with argument "0".
7. If Result(6) is undefined or null, use the empty string; otherwise, call
ToObject(Result(6)).toLocaleString().
8. Let R be Result(7).
9. Let k be 1.
10. If k equals Result(2), return R.
11. Let S be a string value produced by concatenating R and Result(4).
12. Call the [[Get]] method of this object with argument ToString(k).
13. If Result(12) is undefined or null, use the empty string; otherwise, call
ToObject(Result(12)).toLocaleString().
14. Let R be a string value produced by concatenating S and Result(13).
15. Increase k by 1.
16. Go to step 10.
The toLocaleString function is not generic; it throws a TypeError exception if its this value is
not an Array object. Therefore, it cannot be transferred to other kinds of objects for use as a method.
NOTE
The first parameter to this function is likely to be used in a future version of this standard; it is
recommended that implementations do not use this parameter position for anything else.
15.4.4.4 Array.prototype.concat ( [ item1 [ , item2 [ , … ] ] ] )
When the concat method is called with zero or more arguments item1, item2, etc., it returns an array
containing the array elements of the object followed by the array elements of each argument in order.
The following steps are taken:
1. Let A be a new array created as if by the expression new Array().
2. Let n be 0.
3. Let E be this object.
4. If E is not an Array object, go to step 16.
5. Let k be 0.
6. Call the [[Get]] method of E with argument "length".
7. If k equals Result(6) go to step 19.
8. Call ToString(k).
9. If E has a property named by Result(8), go to step 10, but if E has no property named by
Result(8), go to step 13.
10. Call ToString(n).
11. Call the [[Get]] method of E with argument Result(8).
12. Call the [[Put]] method of A with arguments Result(10) and Result(11).
13. Increase n by 1.
14. Increase k by 1.
15. Go to step 7.
16. Call ToString(n).
17. Call the [[Put]] method of A with arguments Result(16) and E.
18. Increase n by 1.
19. Get the next argument in the argument list; if there are no more arguments, go to step 22.
20. Let E be Result(19).
21. Go to step 4.
22. Call the [[Put]] method of A with arguments "length" and n.
23. Return A.
NOTE
The concat function is intentionally generic; it does not require that its this value be an Array
object. Therefore it can be transferred to other kinds of objects for use as a method. Whether the
concat function can be applied successfully to a host object is implementation-dependent.
15.4.4.5 Array.prototype.join (separator)
The elements of the array are converted to strings, and these strings are then concatenated, separated
by occurrences of the separator. If no separator is provided, a single comma is used as the separator.
The join method takes one argument, separator, and performs the following steps:
1. Call the [[Get]] method of this object with argument "length".
2. Call ToUint32(Result(1)).
3. If separator is undefined, let separator be the single-character string ",".
4. Call ToString(separator).
5. If Result(2) is zero, return the empty string.
6. Call the [[Get]] method of this object with argument "0".
7. If Result(6) is undefined or null, use the empty string; otherwise, call ToString(Result(6)).
8. Let R be Result(7).
9. Let k be 1.
10. If k equals Result(2), return R.
11. Let S be a string value produced by concatenating R and Result(4).
12. Call the [[Get]] method of this object with argument ToString(k).
13. If Result(12) is undefined or null, use the empty string; otherwise, call ToString(Result(12)).
14. Let R be a string value produced by concatenating S and Result(13).
15. Increase k by 1.
16. Go to step 10.
NOTE
The pop function is intentionally generic; it does not require that its this value be an Array object.
Therefore it can be transferred to other kinds of objects for use as a method. Whether the pop
function can be applied successfully to a host object is implementation-dependent.
15.4.4.7 Array.prototype.push ( [ item1 [ , item2 [ , … ] ] ] )
The arguments are appended to the end of the array, in the order in which they appear. The new length
of the array is returned as the result of the call.
When the push method is called with zero or more arguments item1, item2, etc., the following steps
are taken:
- 92 -
NOTE
The reverse function is intentionally generic; it does not require that its this value be an Array
object. Therefore, it can be transferred to other kinds of objects for use as a method. Whether the
reverse function can be applied successfully to a host object is implementation-dependent.
15.4.4.9 Array.prototype.shift ( )
The first element of the array is removed from the array and returned.
- 93 -
NOTE
The shift function is intentionally generic; it does not require that its this value be an Array object.
Therefore it can be transferred to other kinds of objects for use as a method. Whether the shift
function can be applied successfully to a host object is implementation-dependent.
15.4.4.10 Array.prototype.slice (start, end)
The slice method takes two arguments, start and end, and returns an array containing the elements
of the array from element start up to, but not including, element end (or through the end of the array
if end is undefined). If start is negative, it is treated as (length+start) where length is the length of
the array. If end is negative, it is treated as (length+end) where length is the length of the array. The
following steps are taken:
1. Let A be a new array created as if by the expression new Array().
2. Call the [[Get]] method of this object with argument "length".
3. Call ToUint32(Result(2)).
4. Call ToInteger(start).
5. If Result(4) is negative, use max((Result(3)+Result(4)),0); else use min(Result(4),Result(3)).
6. Let k be Result(5).
7. If end is undefined, use Result(3); else use ToInteger(end).
8. If Result(7) is negative, use max((Result(3)+Result(7)),0); else use min(Result(7),Result(3)).
9. Let n be 0.
10. If k is greater than or equal to Result(8), go to step 19.
11. Call ToString(k).
12. If this object has a property named by Result(11), go to step 13; but if this object has no property
named by Result(11), then go to step 16.
13. Call ToString(n).
14. Call the [[Get]] method of this object with argument Result(11).
15. Call the [[Put]] method of A with arguments Result(13) and Result(14).
16. Increase k by 1.
17. Increase n by 1.
18. Go to step 10.
19. Call the [[Put]] method of A with arguments "length" and n.
20. Return A.
NOTE
The slice function is intentionally generic; it does not require that its this value be an Array object.
Therefore it can be transferred to other kinds of objects for use as a method. Whether the slice
function can be applied successfully to a host object is implementation-dependent.
15.4.4.11 Array.prototype.sort (comparefn)
The elements of this array are sorted. The sort is not necessarily stable (that is, elements that compare
equal do not necessarily remain in their original order). If comparefn is not undefined, it should be a
function that accepts two arguments x and y and returns a negative value if x < y, zero if x = y, or a
positive value if x > y.
If comparefn is not undefined and is not a consistent comparison function for the elements of this
array (see below), the behaviour of sort is implementation-defined. Let len be
ToUint32(this.length). If there exist integers i and j and an object P such that all of the
conditions below are satisfied then the behaviour of sort is implementation-defined:
• 0 ≤ i < len
• 0 ≤ j < len
• this does not have a property with name ToString(i)
• P is obtained by following one or more [[Prototype]] properties starting at this
• P has a property with name ToString(j)
Here the notation old[j] is used to refer to the hypothetical result of calling the [[Get]] method of
this object with argument j before this function is executed, and the notation new[j] to refer to the
hypothetical result of calling the [[Get]] method of this object with argument j after this function has
been executed.
A function comparefn is a consistent comparison function for a set of values S if all of the
requirements below are met for all values a, b, and c (possibly the same value) in the set S: The
notation a <CF b means comparefn(a,b) < 0; a =CF b means comparefn(a,b) = 0 (of either sign); and
a >CF b means comparefn(a,b) > 0.
• Calling comparefn(a,b) always returns the same value v when given a specific pair of values a
and b as its two arguments. Furthermore, v has type Number, and v is not NaN. Note that this
implies that exactly one of a <CF b, a =CF b, and a >CF b will be true for a given pair of a and b.
• a =CF a (reflexivity)
• If a =CF b, then b =CF a (symmetry)
• If a =CF b and b =CF c, then a =CF c (transitivity of = CF)
• If a <CF b and b <CF c, then a <CF c (transitivity of < CF)
- 95 -
NOTE
The above conditions are necessary and sufficient to ensure that comparefn divides the set S into
equivalence classes and that these equivalence classes are totally ordered.
When the SortCompare operator is called with two arguments j and k, the following steps are taken:
1. Call ToString(j).
2. Call ToString(k).
3. If this object does not have a property named by Result(1), and this object does not have a
property named by Result(2), return +0.
4. If this object does not have a property named by Result(1), return 1.
5. If this object does not have a property named by Result(2), return –1.
6. Call the [[Get]] method of this object with argument Result(1).
7. Call the [[Get]] method of this object with argument Result(2).
8. Let x be Result(6).
9. Let y be Result(7).
10. If x and y are both undefined, return +0.
11. If x is undefined, return 1.
12. If y is undefined, return −1.
13. If the argument comparefn is undefined, go to step 16.
14. Call comparefn with arguments x and y.
15. Return Result(14).
16. Call ToString(x).
17. Call ToString(y).
18. If Result(16) < Result(17), return −1.
19. If Result(16) > Result(17), return 1.
20. Return +0.
NOTE 1
Because non-existent property values always compare greater than undefined property values, and
undefined always compares greater than any other value, undefined property values always sort to
the end of the result, followed by non-existent property values.
NOTE 2
The sort function is intentionally generic; it does not require that its this value be an Array object.
Therefore, it can be transferred to other kinds of objects for use as a method. Whether the sort
function can be applied successfully to a host object is implementation-dependent.
15.4.4.12 Array.prototype.splice (start, deleteCount [ , item1 [ , item2 [ , … ] ] ] )
When the splice method is called with two or more arguments start, deleteCount and (optionally)
item1, item2, etc., the deleteCount elements of the array starting at array index start are replaced by
the arguments item1, item2, etc. The following steps are taken:
1. Let A be a new array created as if by the expression new Array().
2. Call the [[Get]] method of this object with argument "length".
3. Call ToUint32(Result(2)).
4. Call ToInteger(start).
5. If Result(4) is negative, use max((Result(3)+Result(4)),0); else use min(Result(4),Result(3)).
6. Compute min(max(ToInteger(deleteCount),0),Result(3)–Result(5)).
7. Let k be 0.
8. If k equals Result(6), go to step 16.
9. Call ToString(Result(5)+k).
10. If this object has a property named by Result(9), go to step 11; but if this object has no property
named by Result(9), then go to step 14.
11. Call ToString(k).
12. Call the [[Get]] method of this object with argument Result(9).
13. Call the [[Put]] method of A with arguments Result(11) and Result(12).
- 96 -
14. Increment k by 1.
15. Go to step 8.
16. Call the [[Put]] method of A with arguments "length" and Result(6).
17. Compute the number of additional arguments item1, item2, etc.
18. If Result(17) is equal to Result(6), go to step 48.
19. If Result(17) is greater than Result(6), go to step 37.
20. Let k be Result(5).
21. If k is equal to (Result(3)–Result(6)), go to step 31.
22. Call ToString(k+Result(6)).
23. Call ToString(k+Result(17)).
24. If this object has a property named by Result(22), go to step 25; but if this object has no property
named by Result(22), then go to step 28.
25. Call the [[Get]] method of this object with argument Result(22).
26. Call the [[Put]] method of this object with arguments Result(23) and Result(25).
27. Go to step 29.
28. Call the [[Delete]] method of this object with argument Result(23).
29. Increase k by 1.
30. Go to step 21.
31. Let k be Result(3).
32. If k is equal to (Result(3)–Result(6)+Result(17)), go to step 48.
33. Call ToString(k–1).
34. Call the [[Delete]] method of this object with argument Result(33).
35. Decrease k by 1.
36. Go to step 32.
37. Let k be (Result(3)–Result(6)).
38. If k is equal to Result(5), go to step 48.
39. Call ToString(k+Result(6)–1).
40. Call ToString(k+Result(17)–1)
41. If this object has a property named by Result(39), go to step 42; but if this object has no property
named by Result(39), then go to step 45.
42. Call the [[Get]] method of this object with argument Result(39).
43. Call the [[Put]] method of this object with arguments Result(40) and Result(42).
44. Go to step 46.
45. Call the [[Delete]] method of this object with argument Result(40).
46. Decrease k by 1.
47. Go to step 38.
48. Let k be Result(5).
49. Get the next argument in the part of the argument list that starts with item1; if there are no more
arguments, go to step 53.
50. Call the [[Put]] method of this object with arguments ToString(k) and Result(49).
51. Increase k by 1.
52. Go to step 49.
53. Call the [[Put]] method of this object with arguments "length" and (Result(3)–
Result(6)+Result(17)).
54. Return A.
The length property of the splice method is 2.
NOTE
The splice function is intentionally generic; it does not require that its this value be an Array
object. Therefore it can be transferred to other kinds of objects for use as a method. Whether the
splice function can be applied successfully to a host object is implementation-dependent.
15.4.4.13 Array.prototype.unshift ( [ item1 [ , item2 [ , … ] ] ] )
The arguments are prepended to the start of the array, such that their order within the array is the
same as the order in which they appear in the argument list.
When the unshift method is called with zero or more arguments item1, item2, etc., the following
steps are taken:
- 97 -
15.4.5.2 length
The length property of this Array object is always numerically greater than the name of every
property whose name is an array index.
The length property has the attributes { DontEnum, DontDelete }.
15.5 String Objects
15.5.1 The String Constructor Called as a Function
When String is called as a function rather than as a constructor, it performs a type conversion.
15.5.1.1 String ( [ value ] )
Returns a string value (not a String object) computed by ToString(value). If value is not supplied, the
empty string "" is returned.
15.5.2 The String Constructor
When String is called as part of a new expression, it is a constructor: it initialises the newly created
object.
15.5.2.1 new String ( [ value ] )
The [[Prototype]] property of the newly constructed object is set to the original String prototype
object, the one that is the initial value of String.prototype (15.5.3.1).
The [[Class]] property of the newly constructed object is set to "String".
The [[Value]] property of the newly constructed object is set to ToString(value), or to the empty
string if value is not supplied.
15.5.3 Properties of the String Constructor
The value of the internal [[Prototype]] property of the String constructor is the Function prototype object
(15.3.4).
Besides the internal properties and the length property (whose value is 1), the String constructor has
the following properties:
15.5.3.1 String.prototype
The initial value of String.prototype is the String prototype object (15.5.4).
This property has the attributes { DontEnum, DontDelete, ReadOnly }.
15.5.3.2 String.fromCharCode ( [ char0 [ , char1 [ , … ] ] ] )
Returns a string value containing as many characters as the number of arguments. Each argument
specifies one character of the resulting string, with the first argument specifying the first character,
and so on, from left to right. An argument is converted to a character by applying the operation
ToUint16 (9.7) and regarding the resulting 16-bit integer as the code point value of a character. If no
arguments are supplied, the result is the empty string.
The length property of the fromCharCode function is 1.
15.5.4 Properties of the String Prototype Object
The String prototype object is itself a String object (its [[Class]] is "String") whose value is an empty
string.
The value of the internal [[Prototype]] property of the String prototype object is the Object prototype
object (15.2.3.1).
15.5.4.1 String.prototype.constructor
The initial value of String.prototype.constructor is the built-in String constructor.
- 99 -
15.5.4.2 String.prototype.toString ( )
Returns this string value. (Note that, for a String object, the toString method happens to return the
same thing as the valueOf method.)
The toString function is not generic; it throws a TypeError exception if its this value is not a
String object. Therefore, it cannot be transferred to other kinds of objects for use as a method.
15.5.4.3 String.prototype.valueOf ( )
Returns this string value.
The valueOf function is not generic; it throws a TypeError exception if its this value is not a
String object. Therefore, it cannot be transferred to other kinds of objects for use as a method.
15.5.4.4 String.prototype.charAt (pos)
Returns a string containing the character at position pos in the string resulting from converting this
object to a string. If there is no character at that position, the result is the empty string. The result is a
string value, not a String object.
If pos is a value of Number type that is an integer, then the result of x.charAt(pos) is equal to the
result of x.substring(pos, pos+1).
When the charAt method is called with one argument pos, the following steps are taken:
1. Call ToString, giving it the this value as its argument.
2. Call ToInteger(pos).
3. Compute the number of characters in Result(1).
4. If Result(2) is less than 0 or is not less than Result(3), return the empty string.
5. Return a string of length 1, containing one character from Result(1), namely the character at
position Result(2), where the first (leftmost) character in Result(1) is considered to be at position
0, the next one at position 1, and so on.
NOTE
The charAt function is intentionally generic; it does not require that its this value be a String
object. Therefore, it can be transferred to other kinds of objects for use as a method.
15.5.4.5 String.prototype.charCodeAt (pos)
Returns a number (a nonnegative integer less than 2 16) representing the code point value of the
character at position pos in the string resulting from converting this object to a string. If there is no
character at that position, the result is NaN.
When the charCodeAt method is called with one argument pos, the following steps are taken:
1. Call ToString, giving it the this value as its argument.
2. Call ToInteger(pos).
3. Compute the number of characters in Result(1).
4. If Result(2) is less than 0 or is not less than Result(3), return NaN.
5. Return a value of Number type, whose value is the code point value of the character at position
Result(2) in the string Result(1), where the first (leftmost) character in Result(1) is considered to
be at position 0, the next one at position 1, and so on.
NOTE
The charCodeAt function is intentionally generic; it does not require that its this value be a String
object. Therefore it can be transferred to other kinds of objects for use as a method.
15.5.4.6 String.prototype.concat ( [ string1 [ , string2 [ , … ] ] ] )
When the concat method is called with zero or more arguments string1, string2, etc., it returns a
string consisting of the characters of this object (converted to a string) followed by the characters of
each of string1, string2, etc. (where each argument is converted to a string). The result is a string
value, not a String object. The following steps are taken:
1. Call ToString, giving it the this value as its argument.
2. Let R be Result(1).
- 100 -
3. Get the next argument in the argument list; if there are no more arguments, go to step 7.
4. Call ToString(Result(3)).
5. Let R be the string value consisting of the characters in the previous value of R followed by the
characters Result(4).
6. Go to step 3.
7. Return R.
9. Return Result(8).
length property set to n and properties 0 through n–1 corresponding to the first elements of the
results of all matching invocations of RegExp.prototype.exec.
NOTE
The match function is intentionally generic; it does not require that its this value be a String object.
Therefore, it can be transferred to other kinds of objects for use as a method.
15.5.4.11 String.prototype.replace (searchValue, replaceValue)
Let string denote the result of converting the this value to a string.
If searchValue is a regular expression (an object whose [[Class]] property is "RegExp"), do the
following: If searchValue.global is false, then search string for the first match of the regular
expression searchValue. If searchValue.global is true, then search string for all matches of the
regular expression searchValue. Do the search in the same manner as in
String.prototype.match, including the update of searchValue.lastIndex. Let m be the
number of left capturing parentheses in searchValue (NCapturingParens as specified in 15.10.2.1).
If searchValue is not a regular expression, let searchString be ToString(searchValue) and search
string for the first occurrence of searchString. Let m be 0.
If replaceValue is a function, then for each matched substring, call the function with the following m
+ 3 arguments. Argument 1 is the substring that matched. If searchValue is a regular expression, the
next m arguments are all of the captures in the MatchResult (see 15.10.2.1). Argument m + 2 is the
offset within string where the match occurred, and argument m + 3 is string. The result is a string
value derived from the original input by replacing each matched substring with the corresponding
return value of the function call, converted to a string if need be.
Otherwise, let newstring denote the result of converting replaceValue to a string. The result is a string
value derived from the original input string by replacing each matched substring with a string derived
from newstring by replacing characters in newstring by replacement text as specified in the following
table. These $ replacements are done left-to-right, and, once such a replacement is performed, the new
replacement text is not subject to further replacements. For example,
"$1,$2".replace(/(\$(\d))/g, "$$1-$1$2") returns "$1-$11,$1-$22". A $ in
newstring that does not match any of the forms below is left as is.
NOTE
The replace function is intentionally generic; it does not require that its this value be a String
object. Therefore, it can be transferred to other kinds of objects for use as a method.
15.5.4.12 String.prototype.search (regexp)
If regexp is not an object whose [[Class]] property is "RegExp", it is replaced with the result of the
expression new RegExp(regexp). Let string denote the result of converting the this value to a
string.
- 103 -
The value string is searched from its beginning for an occurrence of the regular expression pattern
regexp. The result is a number indicating the offset within the string where the pattern matched, or –1
if there was no match.
NOTE 1
This method ignores the lastIndex and global properties of regexp. The lastIndex property
of regexp is left unchanged.
NOTE 2
The search function is intentionally generic; it does not require that its this value be a String
object. Therefore, it can be transferred to other kinds of objects for use as a method.
15.5.4.13 String.prototype.slice (start, end)
The slice method takes two arguments, start and end, and returns a substring of the result of
converting this object to a string, starting from character position start and running to, but not
including, character position end (or through the end of the string if end is undefined). If start is
negative, it is treated as (sourceLength+start) where sourceLength is the length of the string. If end is
negative, it is treated as (sourceLength+end) where sourceLength is the length of the string. The result
is a string value, not a String object. The following steps are taken:
1. Call ToString, giving it the this value as its argument.
2. Compute the number of characters in Result(1).
3. Call ToInteger(start).
4. If end is undefined, use Result(2); else use ToInteger(end).
5. If Result(3) is negative, use max(Result(2)+Result(3),0); else use min(Result(3),Result(2)).
6. If Result(4) is negative, use max(Result(2)+Result(4),0); else use min(Result(4),Result(2)).
7. Compute max(Result(6)–Result(5),0).
8. Return a string containing Result(7) consecutive characters from Result(1) beginning with the
character at position Result(5).
the array ["A", undefined, "B", "bold", "/", "B", "and", undefined,
"CODE", "coded", "/", "CODE", ""].)
If separator is undefined, then the result array contains just one string, which is the this value
(converted to a string). If limit is not undefined, then the output array is truncated so that it contains
no more than limit elements.
When the split method is called, the following steps are taken:
1. Let S = ToString(this).
2. Let A be a new array created as if by the expression new Array().
3. If limit is undefined, let lim = 2 32–1; else let lim = ToUint32(limit).
4. Let s be the number of characters in S.
5. Let p = 0.
6. If separator is a RegExp object (its [[Class]] is "RegExp"), let R = separator; otherwise let R =
ToString(separator).
7. If lim = 0, return A.
8. If separator is undefined, go to step 33.
9. If s = 0, go to step 31.
10. Let q = p.
11. If q = s, go to step 28.
12. Call SplitMatch(R, S, q) and let z be its MatchResult result.
13. If z is failure, go to step 26.
14. z must be a State. Let e be z's endIndex and let cap be z's captures array.
15. If e = p, go to step 26.
16. Let T be a string value equal to the substring of S consisting of the characters at positions p
(inclusive) through q (exclusive).
17. Call the [[Put]] method of A with arguments A.length and T.
18. If A.length = lim, return A.
19. Let p = e.
20. Let i = 0.
21. If i is equal to the number of elements in cap, go to step 10.
22. Let i = i+1.
23. Call the [[Put]] method of A with arguments A.length and cap[i].
24. If A.length = lim, return A.
25. Go to step 21.
26. Let q = q+1.
27. Go to step 11.
28. Let T be a string value equal to the substring of S consisting of the characters at positions p
(inclusive) through s (exclusive).
29. Call the [[Put]] method of A with arguments A.length and T.
30. Return A.
31. Call SplitMatch(R, S, 0) and let z be its MatchResult result.
32. If z is not failure, return A.
33. Call the [[Put]] method of A with arguments "0" and S.
34. Return A.
The internal helper function SplitMatch takes three parameters, a string S, an integer q, and a string or
RegExp R, and performs the following in order to return a MatchResult (see 15.10.2.1):
1. If R is a RegExp object (its [[Class]] is "RegExp"), go to step 8.
2. R must be a string. Let r be the number of characters in R.
3. Let s be the number of characters in S.
4. If q+r > s then return the MatchResult failure.
5. If there exists an integer i between 0 (inclusive) and r (exclusive) such that the character at
position q+i of S is different from the character at position i of R, then return failure.
6. Let cap be an empty array of captures (see 15.10.2.1).
7. Return the State (q+r, cap). (see 15.10.2.1)
8. Call the [[Match]] method of R giving it the arguments S and q, and return the MatchResult result.
- 105 -
NOTE 1
The toLocaleLowerCase function is intentionally generic; it does not require that its this value
be a String object. Therefore, it can be transferred to other kinds of objects for use as a method.
NOTE 2
The first parameter to this function is likely to be used in a future version of this standard; it is
recommended that implementations do not use this parameter position for anything else.
15.5.4.18 String.prototype.toUpperCase ( )
This function behaves in exactly the same way as String.prototype.toLowerCase, except
that characters are mapped to their uppercase equivalents as specified in the Unicode Character
Database.
NOTE 1
Because both toUpperCase and toLowerCase have context-sensitive behaviour, the functions
are not symmetrical. In other words, s.toUpperCase().toLowerCase() is not necessarily
equal to s.toLowerCase().
NOTE 2
The toUpperCase function is intentionally generic; it does not require that its this value be a
String object. Therefore, it can be transferred to other kinds of objects for use as a method.
15.5.4.19 String.prototype.toLocaleUpperCase ( )
This function works exactly the same as toUpperCase except that its result is intended to yield the
correct result for the host environment’s current locale, rather than a locale-independent result. There
will only be a difference in the few cases (such as Turkish) where the rules for that language conflict
with the regular Unicode case mappings.
NOTE 1
The toLocaleUpperCase function is intentionally generic; it does not require that its this value
be a String object. Therefore, it can be transferred to other kinds of objects for use as a method.
NOTE 2
The first parameter to this function is likely to be used in a future version of this standard; it is
recommended that implementations do not use this parameter position for anything else.
15.5.5 Properties of String Instances
String instances inherit properties from the String prototype object and also have a [[Value]] property
and a length property.
The [[Value]] property is the string value represented by this String object.
15.5.5.1 length
The number of characters in the String value represented by this String object.
Once a String object is created, this property is unchanging. It has the attributes { DontEnum,
DontDelete, ReadOnly }.
15.6 Boolean Objects
15.6.1 The Boolean Constructor Called as a Function
When Boolean is called as a function rather than as a constructor, it performs a type conversion.
15.6.1.1 Boolean (value)
Returns a boolean value (not a Boolean object) computed by ToBoolean(value).
15.6.2 The Boolean Constructor
When Boolean is called as part of a new expression it is a constructor: it initialises the newly created
object.
15.6.2.1 new Boolean (value)
The [[Prototype]] property of the newly constructed object is set to the original Boolean prototype
object, the one that is the initial value of Boolean.prototype (15.6.3.1).
- 107 -
The [[Class]] property of the newly constructed Boolean object is set to "Boolean".
The [[Value]] property of the newly constructed Boolean object is set to ToBoolean(value).
15.6.3 Properties of the Boolean Constructor
The value of the internal [[Prototype]] property of the Boolean constructor is the Function prototype
object (15.3.4).
Besides the internal properties and the length property (whose value is 1), the Boolean constructor has
the following property:
15.6.3.1 Boolean.prototype
The initial value of Boolean.prototype is the Boolean prototype object (15.6.4).
This property has the attributes { DontEnum, DontDelete, ReadOnly }.
15.6.4 Properties of the Boolean Prototype Object
The Boolean prototype object is itself a Boolean object (its [[Class]] is "Boolean") whose value is
false.
The value of the internal [[Prototype]] property of the Boolean prototype object is the Object prototype
object (15.2.3.1).
In following descriptions of functions that are properties of the Boolean prototype object, the phrase
“this Boolean object” refers to the object that is the this value for the invocation of the function; a
TypeError exception is thrown if the this value is not an object for which the value of the internal
[[Class]] property is "Boolean". Also, the phrase “this boolean value” refers to the boolean value
represented by this Boolean object, that is, the value of the internal [[Value]] property of this Boolean
object.
15.6.4.1 Boolean.prototype.constructor
The initial value of Boolean.prototype.constructor is the built-in Boolean constructor.
15.6.4.2 Boolean.prototype.toString ( )
If this boolean value is true, then the string "true" is returned. Otherwise, this boolean value must
be false, and the string "false" is returned.
The toString function is not generic; it throws a TypeError exception if its this value is not a
Boolean object. Therefore, it cannot be transferred to other kinds of objects for use as a method.
15.6.4.3 Boolean.prototype.valueOf ( )
Returns this boolean value.
The valueOf function is not generic; it throws a TypeError exception if its this value is not a
Boolean object. Therefore, it cannot be transferred to other kinds of objects for use as a method.
15.6.5 Properties of Boolean Instances
Boolean instances have no special properties beyond those inherited from the Boolean prototype object.
15.7 Number Objects
15.7.1 The Number Constructor Called as a Function
When Number is called as a function rather than as a constructor, it performs a type conversion.
15.7.1.1 Number ( [ value ] )
Returns a number value (not a Number object) computed by ToNumber(value) if value was supplied,
else returns +0.
15.7.2 The Number Constructor
When Number is called as part of a new expression it is a constructor: it initialises the newly created
object.
- 108 -
An implementation is permitted to extend the behaviour of toFixed for values of fractionDigits less
than 0 or greater than 20. In this case toFixed would not necessarily throw RangeError for such
values.
NOTE
The output of toFixed may be more precise than toString for some values because toString only
prints enough significant digits to distinguish the number from adjacent number values. For example,
(1000000000000000128).toString() returns "1000000000000000100", while
(1000000000000000128).toFixed(0) returns "1000000000000000128".
15.7.4.6 Number.prototype.toExponential (fractionDigits)
Return a string containing the number represented in exponential notation with one digit before the
significand's decimal point and fractionDigits digits after the significand's decimal point. If
fractionDigits is undefined, include as many significand digits as necessary to uniquely specify the
number (just like in ToString except that in this case the number is always output in exponential
notation). Specifically, perform the following steps:
1. Let x be this number value.
2. Let f be ToInteger(fractionDigits).
3. If x is NaN, return the string "NaN".
4. Let s be the empty string.
5. If x ≥ 0, go to step 8.
6. Let s be "-".
7. Let x = –x.
8. If x = +∞, let m = "Infinity" and go to step 30.
9. If fractionDigits is undefined, go to step 14.
10. If f < 0 or f > 20, throw a RangeError exception.
11. If x = 0, go to step 16.
12. Let e and n be integers such that 10 f ≤ n < 10 f+1 and for which the exact mathematical value of n ×
10 e–f – x is as close to zero as possible. If there are two such sets of e and n, pick the e and n for
which n × 10 e–f is larger.
13. Go to step 20.
14. If x ≠ 0, go to step 19.
15. Let f = 0.
16. Let m be the string consisting of f+1 occurrences of the character ‘0’.
17. Let e = 0.
18. Go to step 21.
19. Let e, n, and f be integers such that f ≥ 0, 10 f ≤ n < 10 f+1, the number value for n × 10 e–f is x, and f
is as small as possible. Note that the decimal representation of n has f+1 digits, n is not divisible
by 10, and the least significant digit of n is not necessarily uniquely determined by these criteria.
20. Let m be the string consisting of the digits of the decimal representation of n (in order, with no
leading zeroes).
21. If f = 0, go to step 24.
22. Let a be the first character of m, and let b be the remaining f characters of m.
23. Let m be the concatenation of the three strings a, ".", and b.
24. If e = 0, let c = "+" and d = "0" and go to step 29.
25. If e > 0, let c = "+" and go to step 28.
26. Let c = "-".
27. Let e = –e.
28. Let d be the string consisting of the digits of the decimal representation of e (in order, with no
leading zeroes).
29. Let m be the concatenation of the four strings m, "e", c, and d.
30. Return the concatenation of the strings s and m.
The length property of the toExponential method is 1.
If the toExponential method is called with more than one argument, then the behaviour is
undefined (see clause 15).
- 111 -
NOTE
The value of Math.floor(x) is the same as the value of -Math.ceil(-x).
15.8.2.10 log (x)
Returns an implementation-dependent approximation to the natural logarithm of x.
• If x is NaN, the result is NaN.
• If x is less than 0, the result is NaN.
• If x is +0 or −0, the result is −∞.
• If x is 1, the result is +0.
• If x is +∞, the result is +∞.
15.8.2.14 random ( )
Returns a number value with positive sign, greater than or equal to 0 but less than 1, chosen randomly
or pseudo randomly with approximately uniform distribution over that range, using an
implementation-dependent algorithm or strategy. This function takes no arguments.
15.8.2.15 round (x)
Returns the number value that is closest to x and is equal to a mathematical integer. If two integer
number values are equally close to x, then the result is the number value that is closer to +∞. If x is
already an integer, the result is x.
• If x is NaN, the result is NaN.
• If x is +0, the result is +0.
• If x is −0, the result is −0.
• If x is +∞, the result is +∞.
- 117 -
NOTE 1
Math.round(3.5) returns 4, but Math.round(–3.5) returns –3.
NOTE 2
The value of Math.round(x) is the same as the value of Math.floor(x+0.5), except when x is
− 0 or is less than 0 but greater than or equal to -0.5; for these cases Math.round(x) returns − 0,
but Math.floor(x+0.5) returns +0.
15.8.2.16 sin (x)
Returns an implementation-dependent approximation to the sine of x. The argument is expressed in
radians.
• If x is NaN, the result is NaN.
• If x is +0, the result is +0.
• If x is −0, the result is −0.
• If x is +∞ or −∞, the result is NaN.
1970 UTC. This gives a range of 8,640,000,000,000,000 milliseconds to either side of 01 January,
1970 UTC.
The exact moment of midnight at the beginning of 01 January, 1970 UTC is represented by the value
+0.
15.9.1.2 Day Number and Time within Day
A given time value t belongs to day number
Day(t) = floor(t / msPerDay)
All non-leap years have 365 days with the usual number of days per month and leap years have an
extra day in February. The day number of the first day of year y is given by:
YearFromTime(t) = the largest integer y (closest to positive infinity) such that TimeFromYear(y) ≤ t
The leap-year function is 1 for a time within a leap year and otherwise is zero:
DayWithinYear(t) = Day(t)−DayFromYear(YearFromTime(t))
A month value of 0 specifies January; 1 specifies February; 2 specifies March; 3 specifies April;
4 specifies May; 5 specifies June; 6 specifies July; 7 specifies August; 8 specifies September; 9
specifies October; 10 specifies November; and 11 specifies December. Note that MonthFromTime(0)
= 0, corresponding to Thursday, 01 January, 1970.
15.9.1.5 Date Number
A date number is identified by an integer in the range 1 through 31, inclusive. The mapping
DateFromTime(t) from a time value t to a month number is defined by:
DateFromTime(t) = DayWithinYear(t)+1 if MonthFromTime(t)=0
= DayWithinYear(t)−30 if MonthFromTime(t)=1
= DayWithinYear(t)−58−InLeapYear(t) if MonthFromTime(t)=2
= DayWithinYear(t)−89−InLeapYear(t) if MonthFromTime(t)=3
= DayWithinYear(t)−119−InLeapYear(t) if MonthFromTime(t)=4
= DayWithinYear(t)−150−InLeapYear(t) if MonthFromTime(t)=5
= DayWithinYear(t)−180−InLeapYear(t) if MonthFromTime(t)=6
= DayWithinYear(t)−211−InLeapYear(t) if MonthFromTime(t)=7
= DayWithinYear(t)−242−InLeapYear(t) if MonthFromTime(t)=8
= DayWithinYear(t)−272−InLeapYear(t) if MonthFromTime(t)=9
= DayWithinYear(t)−303−InLeapYear(t) if MonthFromTime(t)=10
= DayWithinYear(t)−333−InLeapYear(t) if MonthFromTime(t)=11
15.9.1.6 Week Day
The weekday for a particular time value t is defined as
t – TimeFromYear(YearFromTime(t))
(2) whether t is in a leap year
InLeapYear(t)
(3) the week day of the beginning of the year
WeekDay(TimeFromYear(YearFromTime(t))
- 120 -
1. If year is not finite or month is not finite or date is not finite, return NaN.
2. Call ToInteger(year).
3. Call ToInteger(month).
4. Call ToInteger(date).
5. Compute Result(2) + floor(Result(3)/12).
6. Compute Result(3) modulo 12.
7. Find a value t such that YearFromTime(t) == Result(5) and MonthFromTime(t) == Result(6) and
DateFromTime(t) == 1; but if this is not possible (because some argument is out of range), return
NaN.
8. Compute Day(Result(7)) + Result(4) − 1.
9. Return Result(8).
NOTE
The point of step 3 is that an implementation is permitted a choice of internal representations of time
values, for example as a 64-bit signed integer or as a 64-bit floating-point value. Depending on the
implementation, this internal representation may or may not distinguish − 0 and +0.
15.9.2 The Date Constructor Called as a Function
When Date is called as a function rather than as a constructor, it returns a string representing the
current time (UTC).
NOTE
The function call Date(…) is not equivalent to the object creation expression new Date(…) with the
same arguments.
15.9.2.1 Date ( [ year [, month [, date [, hours [, minutes [, seconds [, ms ] ] ] ] ] ] ] )
All of the arguments are optional; any arguments supplied are accepted but are completely ignored. A
string is created and returned as if by the expression (new Date()).toString().
15.9.3 The Date Constructor
When Date is called as part of a new expression, it is a constructor: it initialises the newly created
object.
15.9.3.1 new Date (year, month [, date [, hours [, minutes [, seconds [, ms ] ] ] ] ] )
When Date is called with two to seven arguments, it computes the date from year, month, and
(optionally) date, hours, minutes, seconds and ms.
The [[Prototype]] property of the newly constructed object is set to the original Date prototype object,
the one that is the initial value of Date.prototype (15.9.4.1).
The [[Class]] property of the newly constructed object is set to "Date".
- 122 -
Date.parse(x.toString())
Date.parse(x.toUTCString())
15.9.5.3 Date.prototype.toDateString ( )
This function returns a string value. The contents of the string are implementation-dependent, but are
intended to represent the “date” portion of the Date in the current time zone in a convenient, human-
readable form.
15.9.5.4 Date.prototype.toTimeString ( )
This function returns a string value. The contents of the string are implementation-dependent, but are
intended to represent the “time” portion of the Date in the current time zone in a convenient, human-
readable form.
15.9.5.5 Date.prototype.toLocaleString ( )
This function returns a string value. The contents of the string are implementation-dependent, but are
intended to represent the Date in the current time zone in a convenient, human-readable form that
corresponds to the conventions of the host environment’s current locale.
NOTE
The first parameter to this function is likely to be used in a future version of this standard; it is
recommended that implementations do not use this parameter position for anything else.
15.9.5.6 Date.prototype.toLocaleDateString ( )
This function returns a string value. The contents of the string are implementation-dependent, but are
intended to represent the “date” portion of the Date in the current time zone in a convenient, human-
readable form that corresponds to the conventions of the host environment’s current locale.
NOTE
The first parameter to this function is likely to be used in a future version of this standard; it is
recommended that implementations do not use this parameter position for anything else.
15.9.5.7 Date.prototype.toLocaleTimeString ( )
This function returns a string value. The contents of the string are implementation-dependent, but are
intended to represent the “time” portion of the Date in the current time zone in a convenient, human-
readable form that corresponds to the conventions of the host environment’s current locale.
NOTE
The first parameter to this function is likely to be used in a future version of this standard; it is
recommended that implementations do not use this parameter position for anything else.
15.9.5.8 Date.prototype.valueOf ( )
The valueOf function returns a number, which is this time value.
15.9.5.9 Date.prototype.getTime ( )
1. If the this value is not an object whose [[Class]] property is "Date", throw a TypeError
exception.
2. Return this time value.
15.9.5.10 Date.prototype.getFullYear ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return YearFromTime(LocalTime(t)).
15.9.5.11 Date.prototype.getUTCFullYear ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return YearFromTime(t).
15.9.5.12 Date.prototype.getMonth ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return MonthFromTime(LocalTime(t)).
- 125 -
15.9.5.13 Date.prototype.getUTCMonth ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return MonthFromTime(t).
15.9.5.14 Date.prototype.getDate ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return DateFromTime(LocalTime(t)).
15.9.5.15 Date.prototype.getUTCDate ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return DateFromTime(t).
15.9.5.16 Date.prototype.getDay ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return WeekDay(LocalTime(t)).
15.9.5.17 Date.prototype.getUTCDay ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return WeekDay(t).
15.9.5.18 Date.prototype.getHours ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return HourFromTime(LocalTime(t)).
15.9.5.19 Date.prototype.getUTCHours ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return HourFromTime(t).
15.9.5.20 Date.prototype.getMinutes ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return MinFromTime(LocalTime(t)).
15.9.5.21 Date.prototype.getUTCMinutes ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return MinFromTime(t).
15.9.5.22 Date.prototype.getSeconds ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return SecFromTime(LocalTime(t)).
15.9.5.23 Date.prototype.getUTCSeconds ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return SecFromTime(t).
- 126 -
15.9.5.24 Date.prototype.getMilliseconds ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return msFromTime(LocalTime(t)).
15.9.5.25 Date.prototype.getUTCMilliseconds ( )
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return msFromTime(t).
15.9.5.26 Date.prototype.getTimezoneOffset ( )
Returns the difference between local time and UTC time in minutes.
1. Let t be this time value.
2. If t is NaN, return NaN.
3. Return (t − LocalTime(t)) / msPerMinute.
If sec is not specified, this behaves as if sec were specified with the value getUTCSeconds( ).
If ms is not specified, this behaves as if ms were specified with the value getUTCMilliseconds( ).
1. Let t be this time value.
2. Call ToNumber(hour).
3. If min is not specified, compute MinFromTime(t); otherwise, call ToNumber(min).
4. If sec is not specified, compute SecFromTime(t); otherwise, call ToNumber(sec).
5. If ms is not specified, compute msFromTime(t); otherwise, call ToNumber(ms).
6. Compute MakeTime(Result(2), Result(3), Result(4), Result(5)).
7. Compute MakeDate(Day(t), Result(6)).
8. Set the [[Value]] property of the this value to TimeClip(Result(7)).
9. Return the value of the [[Value]] property of the this value.
If date is not specified, this behaves as if date were specified with the value getDate( ).
1. Let t be the result of LocalTime(this time value); but if this time value is NaN, let t be +0.
2. Call ToNumber(year).
3. If month is not specified, compute MonthFromTime(t); otherwise, call ToNumber(month).
4. If date is not specified, compute DateFromTime(t); otherwise, call ToNumber(date).
5. Compute MakeDay(Result(2), Result(3), Result(4)).
6. Compute UTC(MakeDate(Result(5), TimeWithinDay(t))).
7. Set the [[Value]] property of the this value to TimeClip(Result(6)).
8. Return the value of the [[Value]] property of the this value.
Syntax
Pattern ::
Disjunction
Disjunction ::
Alternative
Alternative | Disjunction
Alternative ::
[empty]
Alternative Term
- 130 -
Term ::
Assertion
Atom
Atom Quantifier
Assertion ::
^
$
\ b
\ B
Quantifier ::
QuantifierPrefix
QuantifierPrefix ?
QuantifierPrefix ::
*
+
?
{ DecimalDigits }
{ DecimalDigits , }
{ DecimalDigits , DecimalDigits }
Atom ::
PatternCharacter
.
\ AtomEscape
CharacterClass
( Disjunction )
( ? : Disjunction )
( ? = Disjunction )
( ? ! Disjunction )
AtomEscape ::
DecimalEscape
CharacterEscape
CharacterClassEscape
CharacterEscape ::
ControlEscape
c ControlLetter
HexEscapeSequence
UnicodeEscapeSequence
IdentityEscape
ControlEscape :: one of
f n r t v
ControlLetter :: one of
a b c d e f g h i j k l m n o p q r s t u v w x y z
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
IdentityEscape ::
SourceCharacter but not IdentifierPart
- 131 -
DecimalEscape ::
DecimalIntegerLiteral [lookahead ∉ DecimalDigit]
CharacterClassEscape :: one of
d D s S w W
CharacterClass ::
[ [lookahead ∉ {^}] ClassRanges ]
[ ^ ClassRanges ]
ClassRanges ::
[empty]
NonemptyClassRanges
NonemptyClassRanges ::
ClassAtom
ClassAtom NonemptyClassRangesNoDash
ClassAtom - ClassAtom ClassRanges
NonemptyClassRangesNoDash ::
ClassAtom
ClassAtomNoDash NonemptyClassRangesNoDash
ClassAtomNoDash - ClassAtom ClassRanges
ClassAtom ::
-
ClassAtomNoDash
ClassAtomNoDash ::
SourceCharacter but not one of \ ] -
\ ClassEscape
ClassEscape ::
DecimalEscape
b
CharacterEscape
CharacterClassEscape
Furthermore, the descriptions below use the following internal data structures:
• A CharSet is a mathematical set of characters.
- 132 -
• A State is an ordered pair (endIndex, captures) where endIndex is an integer and captures is an
internal array of NCapturingParens values. States are used to represent partial match states in the
regular expression matching algorithms. The endIndex is one plus the index of the last input
character matched so far by the pattern, while captures holds the results of capturing parentheses.
The nth element of captures is either a string that represents the value obtained by the nth set of
capturing parentheses or undefined if the nth set of capturing parentheses hasn't been reached yet.
Due to backtracking, many states may be in use at any time during the matching process.
• A MatchResult is either a State or the special token failure that indicates that the match failed.
• A Continuation function is an internal closure (i.e. an internal function with some arguments
already bound to values) that takes one State argument and returns a MatchResult result. If an
internal closure references variables bound in the function that creates the closure, the closure uses
the values that these variables had at the time the closure was created. The continuation attempts to
match the remaining portion (specified by the closure's already-bound arguments) of the pattern
against the input string, starting at the intermediate state given by its State argument. If the match
succeeds, the continuation returns the final State that it reached; if the match fails, the continuation
returns failure.
• A Matcher function is an internal closure that takes two arguments -- a State and a Continuation --
and returns a MatchResult result. The matcher attempts to match a middle subpattern (specified by
the closure's already-bound arguments) of the pattern against the input string, starting at the
intermediate state given by its State argument. The Continuation argument should be a closure that
matches the rest of the pattern. After matching the subpattern of a pattern to obtain a new State, the
matcher then calls Continuation on that state to test if the rest of the pattern can match as well. If it
can, the matcher returns the state returned by the continuation; if not, the matcher may try different
choices at its choice points, repeatedly calling Continuation until it either succeeds or all
possibilities have been exhausted.
• An AssertionTester function is an internal closure that takes a State argument and returns a boolean
result. The assertion tester tests a specific condition (specified by the closure's already-bound
arguments) against the current place in the input string and returns true if the condition matched or
false if not.
• An EscapeValue is either a character or an integer. An EscapeValue is used to denote the
interpretation of a DecimalEscape escape sequence: a character ch means that the escape sequence
is interpreted as the character ch, while an integer n means that the escape sequence is interpreted
as a backreference to the nth set of capturing parentheses.
15.10.2.2 Pattern
The production Pattern :: Disjunction evaluates as follows:
1. Evaluate Disjunction to obtain a Matcher m.
2. Return an internal closure that takes two arguments, a string str and an integer index, and
performs the following:
1. Let Input be the given string str. This variable will be used throughout the functions in
15.10.2.
2. Let InputLength be the length of Input. This variable will be used throughout the functions in
15.10.2.
3. Let c be a Continuation that always returns its State argument as a successful MatchResult.
4. Let cap be an internal array of NCapturingParens undefined values, indexed 1 through
NCapturingParens.
5. Let x be the State (index, cap).
6. Call m(x, c) and return its result.
an exception (except for any host-defined exceptions that can occur anywhere such as out-of-
memory).
15.10.2.3 Disjunction
The production Disjunction :: Alternative evaluates by evaluating Alternative to obtain a Matcher and
returning that Matcher.
The production Disjunction :: Alternative | Disjunction evaluates as follows:
1. Evaluate Alternative to obtain a Matcher m1.
2. Evaluate Disjunction to obtain a Matcher m2.
3. Return an internal Matcher closure that takes two arguments, a State x and a Continuation c, and
performs the following:
1. Call m1(x, c) and let r be its result.
2. If r isn't failure, return r.
3. Call m2(x, c) and return its result.
Informative comments: The | regular expression operator separates two alternatives. The pattern
first tries to match the left Alternative (followed by the sequel of the regular expression); if it fails, it
tries to match the right Disjunction (followed by the sequel of the regular expression). If the left
Alternative, the right Disjunction, and the sequel all have choice points, all choices in the sequel are
tried before moving on to the next choice in the left Alternative. If choices in the left Alternative are
exhausted, the right Disjunction is tried instead of the left Alternative. Any capturing parentheses
inside a portion of the pattern skipped by | produce undefined values instead of strings. Thus, for
example,
/a|ab/.exec("abc")
returns the result "a" and not "ab". Moreover,
/((a)|(ab))((c)|(bc))/.exec("abc")
returns the array
["abc", "a", "a", undefined, "bc", undefined, "bc"]
and not
["abc", "ab", undefined, "ab", "c", "c", undefined]
15.10.2.4 Alternative
The production Alternative :: [empty] evaluates by returning a Matcher that takes two arguments, a
State x and a Continuation c, and returns the result of calling c(x).
The production Alternative :: Alternative Term evaluates as follows:
1. Evaluate Alternative to obtain a Matcher m1.
2. Evaluate Term to obtain a Matcher m2.
3. Return an internal Matcher closure that takes two arguments, a State x and a Continuation c, and
performs the following:
1. Create a Continuation d that takes a State argument y and returns the result of calling m2(y,
c).
2. Call m1(x, d) and return its result.
Informative comments: Consecutive Terms try to simultaneously match consecutive portions of the
input string. If the left Alternative, the right Term, and the sequel of the regular expression all have
choice points, all choices in the sequel are tried before moving on to the next choice in the right Term,
and all choices in the right Term are tried before moving on to the next choice in the left Alternative.
15.10.2.5 Term
The production Term :: Assertion evaluates by returning an internal Matcher closure that takes two
arguments, a State x and a Continuation c, and performs the following:
- 134 -
The production Term :: Atom evaluates by evaluating Atom to obtain a Matcher and returning that
Matcher.
The production Term :: Atom Quantifier evaluates as follows:
1. Evaluate Atom to obtain a Matcher m.
2. Evaluate Quantifier to obtain the three results: an integer min, an integer (or ∞) max, and boolean
greedy.
3. If max is finite and less than min, then throw a SyntaxError exception.
4. Let parenIndex be the number of left capturing parentheses in the entire regular expression that
occur to the left of this production expansion's Term. This is the total number of times the Atom ::
( Disjunction ) production is expanded prior to this production's Term plus the total number of
Atom :: ( Disjunction ) productions enclosing this Term.
5. Let parenCount be the number of left capturing parentheses in the expansion of this production's
Atom. This is the total number of Atom :: ( Disjunction ) productions enclosed by this
production's Atom.
6. Return an internal Matcher closure that takes two arguments, a State x and a Continuation c, and
performs the following:
1. Call RepeatMatcher(m, min, max, greedy, x, c, parenIndex, parenCount) and return its result.
The internal helper function RepeatMatcher takes eight parameters, a Matcher m, an integer min, an
integer (or ∞) max, a boolean greedy, a State x, a Continuation c, an integer parenIndex, and an
integer parenCount, and performs the following:
1. If max is zero, then call c(x) and return its result.
2. Create an internal Continuation closure d that takes one State argument y and performs the
following:
1. If min is zero and y's endIndex is equal to x's endIndex, then return failure.
2. If min is zero then let min2 be zero; otherwise let min2 be min–1.
3. If max is ∞, then let max2 be ∞; otherwise let max2 be max–1.
4. Call RepeatMatcher(m, min2, max2, greedy, y, c, parenIndex, parenCount) and return its
result.
3. Let cap be a fresh copy of x's captures internal array.
4. For every integer k that satisfies parenIndex < k and k ≤ parenIndex+parenCount, set cap[k] to
undefined.
5. Let e be x's endIndex.
6. Let xr be the State (e, cap).
7. If min is not zero, then call m(xr, d) and return its result.
8. If greedy is true, then go to step 12.
9. Call c(x) and let z be its result.
10. If z is not failure, return z.
11. Call m(xr, d) and return its result.
12. Call m(xr, d) and let z be its result.
13. If z is not failure, return z.
14. Call c(x) and return its result.
Informative comments: An Atom followed by a Quantifier is repeated the number of times specified
by the Quantifier. A quantifier can be non-greedy, in which case the Atom pattern is repeated as few
times as possible while still matching the sequel, or it can be greedy, in which case the Atom pattern is
repeated as many times as possible while still matching the sequel. The Atom pattern is repeated rather
than the input string that it matches, so different repetitions of the Atom can match different input
substrings.
- 135 -
If the Atom and the sequel of the regular expression all have choice points, the Atom is first matched
as many (or as few, if non-greedy) times as possible. All choices in the sequel are tried before moving
on to the next choice in the last repetition of Atom. All choices in the last (n th) repetition of Atom are
tried before moving on to the next choice in the next-to-last (n–1) st repetition of Atom; at which point
it may turn out that more or fewer repetitions of Atom are now possible; these are exhausted (again,
starting with either as few or as many as possible) before moving on to the next choice in the (n-1) st
repetition of Atom and so on.
Compare
/a[a-z]{2,4}/.exec("abcdefghi")
which returns "abcde" with
/a[a-z]{2,4}?/.exec("abcdefghi")
which returns "abc".
Consider also
/(aa|aabaac|ba|b|c)*/.exec("aabaac")
which, by the choice point ordering above, returns the array
["aaba", "ba"]
and not any of:
["aabaac", "aabaac"]
["aabaac", "c"]
The above ordering of choice points can be used to write a regular expression that calculates the
greatest common divisor of two numbers (represented in unary notation). The following example
calculates the gcd of 10 and 15:
"aaaaaaaaaa,aaaaaaaaaaaaaaa".replace(/^(a+)\1*,\1+$/,"$1")
which returns the gcd in unary notation "aaaaa".
Step 4 of the RepeatMatcher clears Atom's captures each time Atom is repeated. We can see its
behaviour in the regular expression
/(z)((a+)?(b+)?(c))*/.exec("zaacbbbcac")
which returns the array
["zaacbbbcac", "z", "ac", "a", undefined, "c"]
and not
["zaacbbbcac", "z", "ac", "a", "bbb", "c"]
because each iteration of the outermost * clears all captured strings contained in the quantified Atom,
which in this case includes capture strings numbered 2, 3, and 4.
Step 1 of the RepeatMatcher's closure d states that, once the minimum number of repetitions has been
satisfied, any more expansions of Atom that match the empty string are not considered for further
repetitions. This prevents the regular expression engine from falling into an infinite loop on patterns
such as:
/(a*)*/.exec("b")
or the slightly more complicated:
/(a*)b\1+/.exec("baaaac")
which returns the array
["b", ""]
- 136 -
15.10.2.6 Assertion
The production Assertion :: ^ evaluates by returning an internal AssertionTester closure that takes a
State argument x and performs the following:
1. Let e be x's endIndex.
2. If e is zero, return true.
3. If Multiline is false, return false.
4. If the character Input[e–1] is one of the line terminator characters <LF>, <CR>, <LS>, or <PS>,
return true.
5. Return false.
The production Assertion :: $ evaluates by returning an internal AssertionTester closure that takes a
State argument x and performs the following:
1. Let e be x's endIndex.
2. If e is equal to InputLength, return true.
3. If multiline is false, return false.
4. If the character Input[e] is one of the line terminator characters <LF>, <CR>, <LS>, or <PS>,
return true.
5. Return false.
The production Assertion :: \ b evaluates by returning an internal AssertionTester closure that takes
a State argument x and performs the following:
1. Let e be x's endIndex.
2. Call IsWordChar(e–1) and let a be the boolean result.
3. Call IsWordChar(e) and let b be the boolean result.
4. If a is true and b is false, return true.
5. If a is false and b is true, return true.
6. Return false.
The production Assertion :: \ B evaluates by returning an internal AssertionTester closure that takes
a State argument x and performs the following:
1. Let e be x's endIndex.
2. Call IsWordChar(e–1) and let a be the boolean result.
3. Call IsWordChar(e) and let b be the boolean result.
4. If a is true and b is false, return false.
5. If a is false and b is true, return false.
6. Return true.
The internal helper function IsWordChar takes an integer parameter e and performs the following:
1. If e == –1 or e == InputLength, return false.
2. Let c be the character Input[e].
3. If c is one of the sixty-three characters in the table below, return true.
a b c d e f g h i j k l m n o p q r s t u v w x y z
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
0 1 2 3 4 5 6 7 8 9 _
4. Return false.
15.10.2.7 Quantifier
The production Quantifier :: QuantifierPrefix evaluates as follows:
1. Evaluate QuantifierPrefix to obtain the two results: an integer min and an integer (or ∞) max.
2. Return the three results min , max, and true.
- 137 -
15.10.2.8 Atom
The production Atom :: PatternCharacter evaluates as follows:
1. Let ch be the character represented by PatternCharacter.
2. Let A be a one-element CharSet containing the character ch.
3. Call CharacterSetMatcher(A, false) and return its Matcher result.
The production Atom :: \ AtomEscape evaluates by evaluating AtomEscape to obtain a Matcher and
returning that Matcher.
The production Atom :: CharacterClass evaluates as follows:
1. Evaluate CharacterClass to obtain a CharSet A and a boolean invert.
2. Call CharacterSetMatcher(A, invert) and return its Matcher result.
4. Let s be a fresh string whose characters are the characters of Input at positions xe
(inclusive) through ye (exclusive).
5. Set cap[parenIndex+1] to s.
6. Let z be the State (ye, cap).
7. Call c(z) and return its result.
2. Call m(x, d) and return its result.
The production Atom :: ( ? : Disjunction ) evaluates by evaluating Disjunction to obtain a
Matcher and returning that Matcher.
The production Atom :: ( ? = Disjunction ) evaluates as follows:
1. Evaluate Disjunction to obtain a Matcher m.
2. Return an internal Matcher closure that takes two arguments, a State x and a Continuation c, and
performs the following:
1. Let d be a Continuation that always returns its State argument as a successful MatchResult.
2. Call m(x, d) and let r be its result.
3. If r is failure, return failure.
4. Let y be r's State.
5. Let cap be y's captures internal array.
6. Let xe be x's endIndex.
7. Let z be the State (xe, cap).
8. Call c(z) and return its result.
The internal helper function CharacterSetMatcher takes two arguments, a CharSet A and a boolean
flag invert, and performs the following:
1. Return an internal Matcher closure that takes two arguments, a State x and a Continuation c, and
performs the following:
1. Let e be x's endIndex.
2. If e == InputLength, return failure.
3. Let c be the character Input[e].
4. Let cc be the result of Canonicalize(c).
5. If invert is true, go to step 8.
6. If there does not exist a member a of set A such that Canonicalize(a) == cc, then return
failure.
7. Go to step 9.
8. If there exists a member a of set A such that Canonicalize(a) == cc, then return failure.
9. Let cap be x's captures internal array.
10. Let y be the State (e+1, cap).
11. Call c(y) and return its result.
The internal helper function Canonicalize takes a character parameter ch and performs the following:
1. If IgnoreCase is false, return ch.
2. Let u be ch converted to upper case as if by calling String.prototype.toUpperCase on
the one-character string ch.
3. If u does not consist of a single character, return ch.
4. Let cu be u's character.
- 139 -
5. If ch's code point value is greater than or equal to decimal 128 and cu's code point value is less
than decimal 128, then return ch.
6. Return cu.
Informative comments: Parentheses of the form ( Disjunction ) serve both to group the
components of the Disjunction pattern together and to save the result of the match. The result can be
used either in a backreference (\ followed by a nonzero decimal number), referenced in a replace
string, or returned as part of an array from the regular expression matching function. To inhibit the
capturing behaviour of parentheses, use the form (?: Disjunction ) instead.
The form (?= Disjunction ) specifies a zero-width positive lookahead. In order for it to succeed,
the pattern inside Disjunction must match at the current position, but the current position is not
advanced before matching the sequel. If Disjunction can match at the current position in several ways,
only the first one is tried. Unlike other regular expression operators, there is no backtracking into a
(?= form (this unusual behaviour is inherited from Perl). This only matters when the Disjunction
contains capturing parentheses and the sequel of the pattern contains backreferences to those captures.
For example,
/(?=(a+))/.exec("baaabac")
matches the empty string immediately after the first b and therefore returns the array:
["", "aaa"]
To illustrate the lack of backtracking into the lookahead, consider:
/(?=(a+))a*b\1/.exec("baaabac")
This expression returns
["aba", "a"]
and not:
["aaaba", "a"]
The form (?! Disjunction ) specifies a zero-width negative lookahead. In order for it to succeed,
the pattern inside Disjunction must fail to match at the current position. The current position is not
advanced before matching the sequel. Disjunction can contain capturing parentheses, but
backreferences to them only make sense from within Disjunction itself. Backreferences to these
capturing parentheses from elsewhere in the pattern always return undefined because the negative
lookahead must fail for the pattern to succeed. For example,
/(.*?)a(?!(a+)b\2c)\2(.*)/.exec("baaabaac")
looks for an a not immediately followed by some positive number n of a's, a b, another n a's
(specified by the first \2) and a c. The second \2 is outside the negative lookahead, so it matches
against undefined and therefore always succeeds. The whole expression returns the array:
["baaabaac", "ba", undefined, "abaac"]
In case-insignificant matches all characters are implicitly converted to upper case immediately before
they are compared. However, if converting a character to upper case would expand that character into
more than one character (such as converting "ß" (\u00DF) into "SS"), then the character is left as-
is instead. The character is also left as-is if it is not an ASCII character but converting it to upper case
would make it into an ASCII character. This prevents Unicode characters such as \u0131 and
\u017F from matching regular expressions such as /[a-z]/i, which are only intended to match
ASCII letters. Furthermore, if these conversions were allowed, then /[^\W]/i would match each of
a, b, …, h, but not i or s.
15.10.2.9 AtomEscape
The production AtomEscape :: DecimalEscape evaluates as follows:
1. Evaluate DecimalEscape to obtain an EscapeValue E.
2. If E is not a character then go to step 6.
- 140 -
Informative comments: An escape sequence of the form \ followed by a nonzero decimal number n
matches the result of the nth set of capturing parentheses (see 15.10.2.11). It is an error if the regular
expression has fewer than n capturing parentheses. If the regular expression has n or more capturing
parentheses but the nth one is undefined because it hasn't captured anything, then the backreference
always succeeds.
15.10.2.10 CharacterEscape
The production CharacterEscape :: ControlEscape evaluates by returning the character according to
the table below:
ControlEscape Unicode Value Name Symbol
t \u0009 horizontal tab <HT>
n \u000A line feed (new line) <LF>
v \u000B vertical tab <VT>
f \u000C form feed <FF>
r \u000D carriage return <CR>
The internal helper function CharacterRange takes two CharSet parameters A and B and performs the
following:
1. If A does not contain exactly one character or B does not contain exactly one character then throw
a SyntaxError exception.
2. Let a be the one character in CharSet A.
3. Let b be the one character in CharSet B.
4. Let i be the code point value of character a.
5. Let j be the code point value of character b.
6. If I > j then throw a SyntaxError exception.
7. Return the set containing all characters numbered i through j, inclusive.
15.10.2.16 NonemptyClassRangesNoDash
The production NonemptyClassRangesNoDash :: ClassAtom evaluates by evaluating ClassAtom to
obtain a CharSet and returning that CharSet.
The production NonemptyClassRangesNoDash :: ClassAtomNoDash NonemptyClassRangesNoDash
evaluates as follows:
1. Evaluate ClassAtomNoDash to obtain a CharSet A.
2. Evaluate NonemptyClassRangesNoDash to obtain a CharSet B.
3. Return the union of CharSets A and B.
Informative comments: ClassRanges can expand into single ClassAtoms and/or ranges of two
ClassAtoms separated by dashes. In the latter case the ClassRanges includes all characters between
the first ClassAtom and the second ClassAtom, inclusive; an error occurs if either ClassAtom does not
represent a single character (for example, if one is \w) or if the first ClassAtom's code point value is
greater than the second ClassAtom's code point value.
Even if the pattern ignores case, the case of the two ends of a range is significant in determining
which characters belong to the range. Thus, for example, the pattern /[E-F]/i matches only the
letters E, F, e, and f, while the pattern /[E-f]/i matches all upper and lower-case ASCII letters as
well as the symbols [, \, ], ^, _, and `.
A - character can be treated literally or it can denote a range. It is treated literally if it is the first or
last character of ClassRanges, the beginning or end limit of a range specification, or immediately
follows a range specification.
15.10.2.17 ClassAtom
The production ClassAtom :: - evaluates by returning the CharSet containing the one character -.
- 143 -
The production ClassEscape :: b evaluates by returning the CharSet containing the one character
<BS> (Unicode value 0008).
The production ClassEscape :: CharacterEscape evaluates by evaluating CharacterEscape to obtain a
character and returning a one-element CharSet containing that character.
The production ClassEscape :: CharacterClassEscape evaluates by evaluating CharacterClassEscape
to obtain a CharSet and returning that CharSet.
Informative comments: A ClassAtom can use any of the escape sequences that are allowed in the rest
of the regular expression except for \b, \B, and backreferences. Inside a CharacterClass, \b means
the backspace character, while \B and backreferences raise errors. Using a backreference inside a
ClassAtom causes an error.
15.10.3 The RegExp Constructor Called as a Function
15.10.3.1 RegExp(pattern, flags)
If pattern is an object R whose [[Class]] property is "RegExp" and flags is undefined, then return R
unchanged. Otherwise call the RegExp constructor (15.10.4.1), passing it the pattern and flags
arguments and return the object constructed by that constructor.
15.10.4 The RegExp Constructor
When RegExp is called as part of a new expression, it is a constructor: it initialises the newly created
object.
15.10.4.1 new RegExp(pattern, flags)
If pattern is an object R whose [[Class]] property is "RegExp" and flags is undefined, then let P be
the pattern used to construct R and let F be the flags used to construct R. If pattern is an object R
whose [[Class]] property is "RegExp" and flags is not undefined, then throw a TypeError
exception. Otherwise, let P be the empty string if pattern is undefined and ToString(pattern)
otherwise, and let F be the empty string if flags is undefined and ToString(flags) otherwise.
The global property of the newly constructed object is set to a Boolean value that is true if F
contains the character “g” and false otherwise.
The ignoreCase property of the newly constructed object is set to a Boolean value that is true if F
contains the character “i” and false otherwise.
The multiline property of the newly constructed object is set to a Boolean value that is true if F
contains the character “m” and false otherwise.
If F contains any character other than “g”, “i”, or “m”, or if it contains the same one more than
once, then throw a SyntaxError exception.
- 144 -
If P's characters do not have the form Pattern, then throw a SyntaxError exception. Otherwise let the
newly constructed object have a [[Match]] property obtained by evaluating ("compiling") Pattern.
Note that evaluating Pattern may throw a SyntaxError exception. (Note: if pattern is a StringLiteral,
the usual escape sequence substitutions are performed before the string is processed by RegExp. If
pattern must contain an escape sequence to be recognised by RegExp, the “\” character must be
escaped within the StringLiteral to prevent its being removed when the contents of the StringLiteral
are formed.)
The source property of the newly constructed object is set to an implementation-defined string
value in the form of a Pattern based on P.
The lastIndex property of the newly constructed object is set to 0.
The [[Prototype]] property of the newly constructed object is set to the original RegExp prototype
object, the one that is the initial value of RegExp.prototype.
The [[Class]] property of the newly constructed object is set to "RegExp".
15.10.5 Properties of the RegExp Constructor
The value of the internal [[Prototype]] property of the RegExp constructor is the Function prototype
object (15.3.4).
Besides the internal properties and the length property (whose value is 2), the RegExp constructor has
the following properties:
15.10.5.1 RegExp.prototype
The initial value of RegExp.prototype is the RegExp prototype object (15.10.6).
This property shall have the attributes { DontEnum, DontDelete, ReadOnly }.
15.10.6 Properties of the RegExp Prototype Object
The value of the internal [[Prototype]] property of the RegExp prototype object is the Object prototype.
The value of the internal [[Class]] property of the RegExp prototype object is "Object".
The RegExp prototype object does not have a valueOf property of its own; however, it inherits the
valueOf property from the Object prototype object.
In the following descriptions of functions that are properties of the RegExp prototype object, the phrase
“this RegExp object” refers to the object that is the this value for the invocation of the function; a
TypeError exception is thrown if the this value is not an object for which the value of the internal
[[Class]] property is "RegExp".
15.10.6.1 RegExp.prototype.constructor
The initial value of RegExp.prototype.constructor is the built-in RegExp constructor.
15.10.6.2 RegExp.prototype.exec(string)
Performs a regular expression match of string against the regular expression and returns an Array
object containing the results of the match, or null if the string did not match
The string ToString(string) is searched for an occurrence of the regular expression pattern as follows:
1. Let S be the value of ToString(string).
2. Let length be the length of S.
3. Let lastIndex be the value of the lastIndex property.
4. Let i be the value of ToInteger(lastIndex).
5. If the global property is false, let i = 0.
6. If I < 0 or I > length then set lastIndex to 0 and return null.
7. Call [[Match]], giving it the arguments S and i. If [[Match]] returned failure, go to step 8;
otherwise let r be its State result and go to step 10.
8. Let i = i+1.
9. Go to step 6.
10. Let e be r's endIndex value.
- 145 -
15.10.6.3 RegExp.prototype.test(string)
Equivalent to the expression RegExp.prototype.exec(string) != null.
15.10.6.4 RegExp.prototype.toString()
Let src be a string in the form of a Pattern representing the current regular expression. src may or
may not be identical to the source property or to the source code supplied to the RegExp
constructor; however, if src were supplied to the RegExp constructor along with the current regular
expression's flags, the resulting regular expression must behave identically to the current regular
expression.
toString returns a string value formed by concatenating the strings "/", src, and "/"; plus "g" if
the global property is true, "i" if the ignoreCase property is true, and "m" if the multiline
property is true.
NOTE
An implementation may choose to take advantage of src being allowed to be different from the source
passed to the RegExp constructor to escape special characters in src. For example, in the regular
expression obtained from new RegExp("/"), src could be, among other possibilities, "/" or
"\/". The latter would permit the entire result ("/\//") of the toString call to have the form
RegularExpressionLiteral.
15.10.7 Properties of RegExp Instances
RegExp instances inherit properties from their [[Prototype]] object as specified above and also have the
following properties.
15.10.7.1 source
The value of the source property is string in the form of a Pattern representing the current regular
expression. This property shall have the attributes { DontDelete, ReadOnly, DontEnum }.
15.10.7.2 global
The value of the global property is a Boolean value indicating whether the flags contained the
character “g”. This property shall have the attributes { DontDelete, ReadOnly, DontEnum }.
15.10.7.3 ignoreCase
The value of the ignoreCase property is a Boolean value indicating whether the flags contained the
character “i”. This property shall have the attributes { DontDelete, ReadOnly, DontEnum }.
15.10.7.4 multiline
The value of the multiline property is a Boolean value indicating whether the flags contained the
character “m”. This property shall have the attributes { DontDelete, ReadOnly, DontEnum }.
15.10.7.5 lastIndex
The value of the lastIndex property is an integer that specifies the string position at which to start
the next match. This property shall have the attributes { DontDelete, DontEnum }.
- 146 -
If the argument message is not undefined, the message property of the newly constructed object is
set to ToString(message).
15.11.7.5 Properties of the NativeError Constructors
The value of the internal [[Prototype]] property of a NativeError constructor is the Function prototype
object (15.3.4).
Besides the internal properties and the length property (whose value is 1), each NativeError
constructor has the following property:
15.11.7.6 NativeError.prototype
The initial value of NativeError.prototype is a NativeError prototype object (15.11.7.7). Each
NativeError constructor has a separate prototype object.
This property has the attributes { DontEnum, DontDelete, ReadOnly }.
15.11.7.7 Properties of the NativeError Prototype Objects
Each NativeError prototype object is an Error object (its [[Class]] is "Error").
The value of the internal [[Prototype]] property of each NativeError prototype object is the Error
prototype object (15.11.4).
15.11.7.8 NativeError.prototype.constructor
The initial value of the constructor property of the prototype for a given NativeError constructor
is the NativeError constructor function itself (15.11.7).
15.11.7.9 NativeError.prototype.name
The initial value of the name property of the prototype for a given NativeError constructor is the
name of the constructor (the name used instead of NativeError).
15.11.7.10 NativeError.prototype.message
The initial value of the message property of the prototype for a given NativeError constructor is an
implementation-defined string.
NOTE
The prototypes for the NativeError constructors do not themselves provide a toString function,
but instances of errors will inherit it from the Error prototype object.
15.11.7.11 Properties of NativeError Instances
NativeError instances have no special properties beyond those inherited from the Error prototype
object.
- 149 -
16 Errors
An implementation should report runtime errors at the time the relevant language construct is evaluated. An
implementation may report syntax errors in the program at the time the program is read in, or it may, at its
option, defer reporting syntax errors until the relevant statement is reached. An implementation may report
syntax errors in eval code at the time eval is called, or it may, at its option, defer reporting syntax errors
until the relevant statement is reached.
An implementation may treat any instance of the following kinds of runtime errors as a syntax error and
therefore report it early:
• Improper uses of return, break, and continue.
• Using the eval property other than via a direct call.
• Errors in regular expression literals.
• Attempts to call PutValue on a value that is not a reference (for example, executing the assignment
statement 3=4).
An implementation shall not report other kinds of runtime errors early even if the compiler can prove that a
construct cannot execute without error under any circumstances. An implementation may issue an early
warning in such a case, but it should not report the error until the relevant construct is actually executed.
An implementation shall report all errors as specified, except for the following:
• An implementation may extend program and regular expression syntax. To permit this, all operations (such
as calling eval, using a regular expression literal, or using the Function or RegExp constructor) that
are allowed to throw SyntaxError are permitted to exhibit implementation-defined behaviour instead of
throwing SyntaxError when they encounter an implementation-defined extension to the program or regular
expression syntax.
• An implementation may provide additional types, values, objects, properties, and functions beyond those
described in this specification. This may cause constructs (such as looking up a variable in the global
scope) to have implementation-defined behaviour instead of throwing an error (such as ReferenceError).
• An implementation is not required to detect EvalError. If it chooses not to detect EvalError, the
implementation must allow eval to be used indirectly and/or allow assignments to eval.
• An implementation may define behaviour other than throwing RangeError for toFixed,
toExponential, and toPrecision when the fractionDigits or precision argument is outside the
specified range.
- 150 -
- 151 -
Annex A
(informative)
Grammar Summary
A.3 Expressions
PrimaryExpression : See 11.1
this
Identifier
Literal
ArrayLiteral
ObjectLiteral
( Expression )
- 158 -
A.4 Statements
Statement : See clause 12
Block
VariableStatement
EmptyStatement
ExpressionStatement
IfStatement
IterationStatement
ContinueStatement
BreakStatement
ReturnStatement
WithStatement
LabelledStatement
SwitchStatement
ThrowStatement
TryStatement
Annex B
(informative)
Compatibility
Syntax
NumericLiteral ::
DecimalLiteral
HexIntegerLiteral
OctalIntegerLiteral
OctalIntegerLiteral ::
0 OctalDigit
OctalIntegerLiteral OctalDigit
Semantics
• The MV of NumericLiteral :: OctalIntegerLiteral is the MV of OctalIntegerLiteral.
• The MV of OctalDigit :: 0 is 0.
• The MV of OctalDigit :: 1 is 1.
• The MV of OctalDigit :: 2 is 2.
• The MV of OctalDigit :: 3 is 3.
• The MV of OctalDigit :: 4 is 4.
• The MV of OctalDigit :: 5 is 5.
• The MV of OctalDigit :: 6 is 6.
• The MV of OctalDigit :: 7 is 7.
• The MV of OctalIntegerLiteral :: 0 OctalDigit is the MV of OctalDigit.
• The MV of OctalIntegerLiteral :: OctalIntegerLiteral OctalDigit is (the MV of OctalIntegerLiteral times 8) plus
the MV of OctalDigit.
B.1.2 String Literals
The syntax and semantics of 7.8.4 can be extended as follows:
Syntax
EscapeSequence ::
CharacterEscapeSequence
OctalEscapeSequence
HexEscapeSequence
UnicodeEscapeSequence
- 170 -
OctalEscapeSequence ::
OctalDigit [lookahead ∉ DecimalDigit]
ZeroToThree OctalDigit [lookahead ∉ DecimalDigit]
FourToSeven OctalDigit
ZeroToThree OctalDigit OctalDigit
ZeroToThree :: one of
0 1 2 3
FourToSeven :: one of
4 5 6 7
Semantics
• The CV of EscapeSequence :: OctalEscapeSequence is the CV of the OctalEscapeSequence.
• The CV of OctalEscapeSequence :: OctalDigit [lookahead ∉ DecimalDigit] is the character whose code point value is
the MV of the OctalDigit.
• The CV of OctalEscapeSequence :: ZeroToThree OctalDigit [lookahead ∉ DecimalDigit] is the character whose code
point value is (8 times the MV of the ZeroToThree) plus the MV of the OctalDigit.
• The CV of OctalEscapeSequence :: FourToSeven OctalDigit is the character whose code point value is (8 times
the MV of the FourToSeven) plus the MV of the OctalDigit.
• The CV of OctalEscapeSequence :: ZeroToThree OctalDigit OctalDigit is the character whose code point value
is (64 (that is, 82) times the MV of the ZeroToThree) plus (8 times the MV of the first OctalDigit) plus the MV
of the second OctalDigit.
• The MV of ZeroToThree :: 0 is 0.
• The MV of ZeroToThree :: 1 is 1.
• The MV of ZeroToThree :: 2 is 2.
• The MV of ZeroToThree :: 3 is 3.
• The MV of FourToSeven :: 4 is 4.
• The MV of FourToSeven :: 5 is 5.
• The MV of FourToSeven :: 6 is 6.
• The MV of FourToSeven :: 7 is 7.
./”
then go to step 13.
8. If Result(6), is less than 256, go to step 11.
9. Let S be a string containing six characters “%uwxyz” where wxyz are four hexadecimal digits encoding
the value of Result(6).
10. Go to step 14.
11. Let S be a string containing three characters “%xy” where xy are two hexadecimal digits encoding the
value of Result(6).
12. Go to step 14.
13. Let S be a string containing the single character Result(6).
14. Let R be a new string value computed by concatenating the previous value of R and S.
15. Increase k by 1.
16. Go to step 5.
NOTE
The encoding is partly based on the encoding described in RFC1738, but the entire encoding specified in
this standard is described above without regard to the contents of RFC1738.
B.2.2 unescape (string)
The unescape function is a property of the global object. It computes a new version of a string value in
which each escape sequence of the sort that might be introduced by the escape function is replaced with
the character that it represents.
When the unescape function is called with one argument string, the following steps are taken:
1. Call ToString(string).
2. Compute the number of characters in Result(1).
3. Let R be the empty string.
4. Let k be 0.
5. If k equals Result(2), return R.
6. Let c be the character at position k within Result(1).
7. If c is not %, go to step 18.
8. If k is greater than Result(2)−6, go to step 14.
9. If the character at position k+1 within Result(1) is not u, go to step 14.
10. If the four characters at positions k+2, k+3, k+4, and k+5 within Result(1) are not all hexadecimal
digits, go to step 14.
11. Let c be the character whose code point value is the integer represented by the four hexadecimal digits
at positions k+2, k+3, k+4, and k+5 within Result(1).
12. Increase k by 5.
13. Go to step 18.
14. If k is greater than Result(2)−3, go to step 18.
15. If the two characters at positions k+1 and k+2 within Result(1) are not both hexadecimal digits, go to
step 18.
16. Let c be the character whose code point value is the integer represented by two zeroes plus the two
hexadecimal digits at positions k+1 and k+2 within Result(1).
17. Increase k by 2.
18. Let R be a new string value computed by concatenating the previous value of R and c.
19. Increase k by 1.
20. Go to step 5.
2. Call ToInteger(start).
3. If length is undefined, use +∞; otherwise call ToInteger(length).
4. Compute the number of characters in Result(1).
5. If Result(2) is positive or zero, use Result(2); else use max(Result(4)+Result(2),0).
6. Compute min(max(Result(3),0), Result(4)–Result(5)).
7. If Result(6) ≤ 0, return the empty string “”.
8. Return a string containing Result(6) consecutive characters from Result(1) beginning with the
character at position Result(5).
B.2.6 Date.prototype.toGMTString ( )
NOTE
The property toUTCString is preferred. The toGMTString property is provided principally for
compatibility with old code. It is recommended that the toUTCString property be used in new
ECMAScript code.
The Function object that is the initial value of Date.prototype.toGMTString is the same Function
object that is the initial value of Date.prototype.toUTCString.
Free printed copies can be ordered from:
ECMA
114 Rue du Rhône
CH-1204 Geneva
Switzerland
Fax: +41 22 849.60.01
Internet: [email protected]
Files of this Standard can be freely downloaded from our ECMA web site (www.ecma.ch). This site gives full
information on ECMA, ECMA activities, ECMA Standards and Technical Reports.
ECMA
114 Rue du Rhône
CH-1204 Geneva
Switzerland
See inside cover page for obtaining further soft or hard copies.