Академический Документы
Профессиональный Документы
Культура Документы
Purpose
The first and primary purpose of this document is to serve as a complete technical specification
of Automatic Reference Counting. Given a core Objective-C compiler and runtime, it should be
possible to write a compiler and runtime which implements these new semantics.
The secondary purpose is to act as a rationale for why ARC was designed in this way. This
should remain tightly focused on the technical design and should not stray into marketing
speculation.
Background
This document assumes a basic familiarity with C.
Blocks are a C language extension for creating anonymous functions. Users interact with and
transfer block objects using block pointers, which are represented like a normal pointer. A block
may capture values from local variables; when this occurs, memory must be dynamically
allocated. The initial allocation is done on the stack, but the runtime provides a Block_copy
function which, given a block pointer, either copies the underlying block object to the heap,
setting its reference count to 1 and returning the new block pointer, or (if the block object is
already on the heap) increases its reference count by 1. The paired function is Block_release,
which decreases the reference count by 1 and destroys the object if the count reaches zero and is
on the heap.
Objective-C is a set of language extensions, significant enough to be considered a different
language. It is a strict superset of C. The extensions can also be imposed on C++, producing a
language called Objective-C++. The primary feature is a single-inheritance object system; we
briefly describe the modern dialect.
Objective-C defines a new type kind, collectively called the object pointer types. This kind has
two notable builtin members, id and Class; id is the final supertype of all object pointers. The
validity of conversions between object pointer types is not checked at runtime. Users may define
classes; each class is a type, and the pointer to that type is an object pointer type. A class may
have a superclass; its pointer type is a subtype of its superclasss pointer type. A class has a set of
ivars, fields which appear on all instances of that class. For every class T theres an associated
metaclass; it has no fields, its superclass is the metaclass of Ts superclass, and its metaclass is a
global class. Every class has a global object whose class is the classs metaclass; metaclasses
have no associated type, so pointers to this object have type Class.
A class declaration (@interface) declares a set of methods. A method has a return type, a list of
argument types, and a selector: a name like foo:bar:baz:, where the number of colons
corresponds to the number of formal arguments. A method may be an instance method, in which
case it can be invoked on objects of the class, or a class method, in which case it can be invoked
Page 1
This looks in the dynamic class of the receiver for a method with this name, then in that classs
superclass, etc., until it finds something it can execute. The receiver expression may also be the
name of a class, in which case the actual receiver is the class object for that class, or (within
method definitions) it may be super, in which case the lookup algorithm starts with the static
superclass instead of the dynamic class. The actual methods dynamically found in a class are not
those declared in the @interface, but those defined in a separate @implementation declaration;
however, when compiling a call, typechecking is done based on the methods declared in the
@interface.
Method declarations may also be grouped into protocols, which are not inherently associated
with any class, but which classes may claim to follow. Object pointer types may be qualified
with additional protocols that the object is known to support.
Class extensions are collections of ivars and methods, designed to allow a classs @interface to
be split across multiple files; however, there is still a primary implementation file which must see
the @interfaces of all class extensions. Categories allow methods (but not ivars) to be declared
post hoc on an arbitrary class; the methods in the categorys @implementation will be
dynamically added to that classs method tables which the category is loaded at runtime,
replacing those methods in case of a collision.
In the standard environment, objects are allocated on the heap, and their lifetime is manually
managed using a reference count. This is done using two instance methods which all classes are
expected to implement: retain increases the objects reference count by 1, whereas release
decreases it by 1 and calls the instance method dealloc if the count reaches 0. To simplify certain
operations, there is also an autorelease pool, a thread-local list of objects to call release on later;
an object can be added to this pool by calling autorelease on it.
Block pointers may be converted to type id; block objects are laid out in a way that makes them
compatible with Objective-C objects. There is a builtin class that all block objects are considered
to be objects of; this class implements retain by adjusting the reference count, not by calling
Block_copy.
Evolution
ARC is under continual evolution, and this document must be updated as the language
progresses.
If a change increases the expressiveness of the language, for example by lifting a restriction or
by adding new syntax, the change will be annotated with a revision marker, like so:
Page 2
General
Automatic Reference Counting implements automatic memory management for Objective-C
objects and blocks, freeing the programmer from the need to explicitly insert retains and releases.
It does not provide a cycle collector; users must explicitly manage the lifetime of their objects,
breaking cycles manually or with weak or unsafe references.
ARC may be explicitly enabled with the compiler flag -fobjc-arc. It may also be explicitly
disabled with the compiler flag -fno-objc-arc. The last of these two flags appearing on the
compile line wins.
If ARC is enabled, __has_feature(objc_arc) will expand to 1 in the preprocessor. For more
information about __has_feature, see the language extensions document.
block pointers (formed by applying the caret (^) declarator sigil to a function type)
Page 3
The type system must reliably identify which objects are to be managed. An int* might be
a pointer to a malloced array, or it might be an interior pointer to such an array, or it
might point to some field or local variable. In contrast, values of the retainable object
pointer types are never interior.
The type system must reliably indicate how to manage objects of a type. This usually
means that the type must imply a procedure for incrementing and decrementing retain
counts. Supporting single-ownership objects requires a lot more explicit mediation in the
language.
There must be reliable conventions for whether and when ownership is passed between
caller and callee, for both arguments and return values. Objective-C methods follow such
a convention very reliably, at least for system libraries on Mac OS X, and functions
always pass objects at +0. The C-based APIs for Core Foundation objects, on the other
hand, have much more varied transfer semantics.
Page 4
The behavior of these methods is constrained in the following ways. The term high-level
semantics is an intentionally vague term; the intent is that programmers must implement these
methods in a way such that the compiler, modifying code in ways it deems safe according to
these constraints, will not violate their requirements. For example, if the user puts logging
statements in retain, they should not be surprised if those statements are executed more or less
often depending on optimization settings. These constraints are not exhaustive of the
optimization opportunities: values held in local variables are subject to additional restrictions,
described later in this document.
It is undefined behavior if a computation history featuring a send of retain followed by a send of
release to the same object, with no intervening release on that object, is not equivalent under the
high-level semantics to a computation history in which these sends are removed. Note that this
implies that these methods may not raise exceptions.
It is undefined behavior if a computation history features any use whatsoever of an object
following the completion of a send of release that is not preceded by a send of retain to the same
object.
The behavior of autorelease must be equivalent to sending release when one of the autorelease
pools currently in scope is popped. It may not throw an exception.
When the semantics call for performing one of these operations on a retainable object pointer, if
that pointer is null then the effect is a no-op.
All of the semantics described in this document are subject to additional optimization rules
which permit the removal or optimization of operations based on local knowledge of data flow.
The semantics describe the high-level behaviors that the compiler implements, not an exact
sequence of operations that a program will be compiled into.
Page 5
Rationale
While this might seem uncontroversial, it is actually unsafe when multiple expressions are
evaluated in parallel, as with binary operators and calls, because (for example) one expression
might load from an object while another writes to it. However, C and C++ already call this
undefined behavior because the evaluations are unsequenced, and ARC simply exploits that here
to avoid needing to retain arguments across a large number of calls.
The remainder of this section describes exceptions to these rules, how those exceptions are
detected, and what those exceptions imply semantically.
Consumed parameters
A function or method parameter of retainable object pointer type may be marked as consumed,
signifying that the callee expects to take ownership of a +1 retain count. This is done by adding
the ns_consumed attribute to the parameter declaration, like so:
void foo(__attribute((ns_consumed)) id x);
- (void) foo: (id) __attribute((ns_consumed)) x;
This attribute is part of the type of the function or method, not the type of the parameter. It
controls only how the argument is passed and received.
When passing such an argument, ARC retains the argument prior to making the call.
When receiving such an argument, ARC releases the argument at the end of the function, subject
to the usual optimizations for local values.
Rationale
This formalizes direct transfers of ownership from a caller to a callee. The most common
scenario here is passing the self parameter to init, but it is useful to generalize. Typically, local
optimization will remove any extra retains and releases: on the caller side the retain will be
merged with a +1 source, and on the callee side the release will be rolled into the initialization of
the parameter.
The implicit self parameter of a method may be marked as consumed by adding
__attribute__((ns_consumes_self)) to the method declaration. Methods in the init family are
treated as if they were implicitly marked with this attribute.
It is undefined behavior if an Objective-C message send to a method with ns_consumed
parameters (other than self) is made with a null receiver. It is undefined behavior if the method to
iOS Helping Hands
Page 6
Page 7
(__bridge_retained T) op casts the operand, which must have retainable object pointer
type, to the destination type, which must be a non-retainable pointer type. ARC retains
the value, subject to the usual optimizations on local values, and the recipient is
responsible for balancing that +1.
Page 8
Page 9