A Derivation of Special Relativity
from Causal Sets
arXiv:1005.4172v2 [math-ph] 29 Aug 2010
Kevin H. Knuth
Departments of Physics and Informatics
University at Albany (SUNY)
Albany NY 12222, USA
Newshaw Bahreyni
Department of Physics
University at Albany (SUNY)
Albany NY 12222, USA
October 29, 2018
Abstract
We present a derivation of special relativity based on the quantification
of causally-ordered events. We postulate that events are fundamental, and
that some events have the potential to influence other events, but not vice
versa. This leads to the concept of a partially-ordered set of events, which
is called a causal set. Quantification proceeds by selecting two chains of
coordinated events, each of which represents an observer, and assigning
a valuation to each chain. An event can be projected onto each chain by
identifying the earliest event on the chain that can be informed about the
event. In this way, events can be quantified by a pair of numbers, referred
to as a pair, that derives from the valuations on the chains. Pairs can
be decomposed into a sum of symmetric and antisymmetric pairs, which
correspond to time-like and space-like coordinates. From this pair, we
derive a scalar measure and show that this is the Minkowski metric. The
Lorentz transformations follow, as well as the fact that speed is a relevant
quantity relating two inertial frames, and that there exists a maximal
speed, which is invariant in all inertial frames. Furthermore, the form
of the Lorentz transformation in this picture offers a glimpse into the
origin of spin. All results follow directly from the event postulate and the
adopted quantification scheme.
1
Introduction
In the early 1900s, Albert Einstein radically altered our picture of the universe
by doing away with the Newtonian concepts of absolute space and time [6], and
replacing them with relativity and the space-time continuum. Since then, we
have come to imagine space-time to be the fundamental fabric out of which
the universe is constructed, yet at the same time we appreciate that different
observers can interpret events differently with respect to space and time. The
1
former perspective provides us with a fundamental framework reminiscent of
classical physics, whereas the latter perspective places the observer in a central
role similar to what we see in quantum mechanics.
We consider a picture of the universe as being described by a set of events.
We do not need to specify precisely what these events refer to, although we visualize them as representing some degree of distinguishability along a chain, which
can be used to represent a physical object. Most importantly, these events do
not happen in a space-time. Instead, the events themselves are considered to be
fundamental. Put simply, events happen. We assume only minimal additional
structure, and assert that some events have the potential to be influenced by
other events. However, this potential is not reciprocal. That is, if an event
A can be influenced by event B, then it is not possible that event B can be
influenced by event A. The result is that events can be partially ordered. We
stress that we make no assumptions about positions of events in space or time,
no assumptions about velocities or angles; we merely assert that some events
can be ordered and others cannot.
What follows is a theory describing the physics of events. That is, we investigate what can be said about events given only their relationships to one
another. The set of events, in conjunction with the ordering relation, gives rise
to a partially ordered set. The theory itself originates from the quantification
of the partially ordered set. This is done by selecting and quantifying a distinguished chain of events called an observer. By introducing two observers, we can
quantify other events in the poset by projecting them onto the two chains and
assigning each event with a pair of numbers. We show that the assigned pairs
can be decomposed into a symmetric part, which is related to chains, and an
anti-symmetric part, which is related to anti-chains. The result is a decomposition into one-dimensional time and n-dimensional space. This decomposition
naturally results in the Minkowski metric as a measure of distance between
events. Furthermore, we derive Lorentz transformations by considering changes
of perspective induced by considering alternate observer chains. Rather than
being fundamental, we find that space-time arises as a construct made to make
chains of events look simple.
2
Partially Ordered Sets of Events
Our approach relies on the
Event Postulate: Events are fundamental. Some events have the
potential to be influenced by other events. However, this potential
is not reciprocal. That is, if event A can be influenced by event B,
then it is not possible that event B can be influenced by event A.
This potential to be influenced can be viewed as a binary ordering relation,
which relates pairs of events and enables one to impose a partial order. If event
A has the potential to be influenced by event B, we say that A includes B and
write A ≥ B. This notion of inclusion is transitive, so that if A ≥ B and B ≥ C,
then it is also true that A ≥ C. Given any pair of events, it is not necessarily
true that one can be informed about the other. In this case, we say that the
events are incomparable and write A||C. The relationships A ≥ B and B ≥ A
can only hold simultaneously if A = B.
2
Figure 1: Both diagrams represent the same poset of events. In these diagrams,
there is no meaning to the horizontal or vertical spacing of the events. We have
the freedom to draw the diagram so as to make chains look simple, but this is
merely illustrative. More importantly, we find that we have the similar freedom
to make chains look simple quantitatively. Note that these are not exactly Hasse
diagrams as more connections than just the covers are displayed.
Taken together, a set of events and the described ordering relation results
in a partially-ordered set, or poset, of events. Such a poset of events is called a
causal set [2]. However, we note that the ordering relation need not assume a
strict causal relationship, but only the potential for influence. Causal sets have
been employed in approaches to quantum gravity, and are typically endowed
with, or embedded within, a Minkowski geometry exhibiting Lorentz invariance
[3].
We approach the problem from another direction entirely. Given that the
poset is considered to be fundamental, we aim to derive a means to quantify
events.
Additional structure is introduced to the poset by identifying a distinguished
set of events called an observer. An observer is a chain of events, which means
that the events are totally ordered so that they occur in succession. That is, a
chain is a set of events P such that for all events x and y in P, we have that
either x ≤ y or y ≤ x. The events describing an observer reflect distinguishable
units of change. Physically, one can imagine them to be generated by a clock.
As Figure 1 illustrates, depending on how the poset is displayed, chains
can be made to look complicated or simple. The overall goal is to develop a
description of events, and we shall do this in such a way to make chains look
simple. Since, at this point, we have no notion of an interval either in space or
time, we are at liberty to stretch and squeeze the poset so that certain events in
a chain of our choice are drawn at equally-spaced intervals. While stretching or
squeezing a diagram is merely illustrative, we have the similar freedom to make
chains look simple quantitatively.
3
3
Quantification
We introduce quantification by assigning a valuation to a chain. First we select
a subset of events on the chain that we will use for quantification. Not all
events on the chain need be used, nor will we display additional events in the
subsequent figures. Events on the chain that are to be used for quantification
are assigned a real number such that for any two of these events x and y on the
chain P related by x ≤ y we assign real numbers px ≤ py . We are free to adopt
any valuation we please. To make chains look simple, we assign a valuation,
such that for successive quantifying events x ≺ y, py = px + c where c is a
positive real number. Without loss of generality, we choose c = 1 and label
the quantifying events with successive integers. From now on, we will refer to
events on the chain using their label, so that event px is assigned a value of
px , where from the context it will be apparent whether px refers to the poset
element representing the event or its valuation.
An event x can be projected onto a chain P if there exists an event p ∈ P
such that x ≤ p. Since any event p+ ≥ p on the chain also includes x by
transitivity, and the chain is finite, there must exist a least event px ∈ P such
that px ≥ x. The projection of x onto the chain P is given by the least event
px on the chain P such that x ≤ px . If one considers the sub-poset consisting
only of the element x and the elements comprising the chain P , then in this
sub-poset px covers x, px ≻ x. If the projection exists, the element x can then
be “quantified” by assigning to the element x the numeric label assigned to the
element px ∈ P . Note that this quantification scheme does not ensure that all
events in the poset will be quantified. For example, if the observer cannot be
informed about the event, then the event does not project to the chain and thus
will not be quantified.
Quantification can be made more rich by introducing another observer. We
implement this by identifying a second chain Q, and endowing it with a valuation
of its own. Events used for quantification are selected carefully so that they are
synchronized to the quantifying events of the first chain P. This can be done
in the standard way by considering projections of events on P onto Q, and vice
versa, and requiring that successive quantifying events on one chain project
to successive quantifying events on the other. Note that a consequence of the
synchronization requirement is that not all chains qualify as observers.
3.1
Interval Pair (Pair)
We want to quantify relationships between events. This requires that we focus
on the difference in the way that a pair of events is projected onto a pair
of chains. We begin by identifying one way to quantify a pair of events. In
the follow section, we will introduce a second technique and insist that it be
consistent with the first.
The first method involves forming a pair of numbers from the direct product
of the independent measures obtained by projecting an event onto each of the
two reference chains P and Q. The result is that an event x is quantified by
the pair of numbers, (px , qx ). To quantify an interval, we designate one event
as the origin 0, and comparing its projection to the projection of another event,
4
A
B
qx
px
x
P
P
Q
Q
Figure 2: A) The projection of an event x onto a chain is the minimal event on
the chain that can be potentially influenced by x, such that for all pw ≤ px we
have that x ∨ pw = px . B) Chains can be synchronized by selecting quantifying
events on the chains such that successive quantifying events on one chain project
to successive quantifying events on the other and vice versa.
which we will label as 1. The result is the pair
(∆p, ∆q) = (p1 , q1 ) − (p0 , q0 ) = (p1 − p0 , q1 − q0 ).
(1)
From now on, we will suppress the deltas in the notation, and refer to such a
pair of differences as an interval pair, or more simply as a pair.
Note that some pairs of events project in such a way that both chains agree
as to the order in which they are informed about the events (Figure 3, right);
whereas other pairs of events project in such a way that the order in which one
chain is informed is reverse that of the other chain (Figure 3, left). This fact
suggests a convenient decomposition. Given a pair (p, q), we can decompose it
into the sum of a symmetric pair and an antisymmetric pair, such that
p + q p + q p − q q − p
(p, q) =
,
,
+
.
(2)
2
2
2
2
This decomposition, which we call the symmetric/antisymmetric decomposition, distinguishes between the two distinct relationships involving differences
between pairs of events and the reference chains: that of chains and antichains
(Figure 4).
3.2
Scalar Measures
The second method of quantification involves taking the direct product of the
chains themselves and forming the unique scalar measure on the product lattice.
Differences are handled by defining the origin of the valuation on each chain to
be the minimal projected event of the pair of events. We aim to identify a
unique scalar measure that is a non-trivial function of the pair. To do this, we
define the function f as an unknown map from a pair to a real scalar, and insist
5
p2
q1
p2
p1
q2
p1
1
q2
q1
2
2
1
Q
P
Q
P
p2
q1
q2
q1
q2
p2
p1
q2
p2
q1
2
p1
2
1
2
1
p1
1
P
Q
P
Q
P
Q
Figure 3: This figure illustrates five classes of relationships between two events
and the observer chains. (Top Left) These events form an antichain and are
recorded in opposite order by the two chains and are interpreted by the observers
as being separated only in space. (Top Right) In contrast, these events form a
chain and are observed to occur in the same order with respect to the two chains.
They are interpreted by the observers as being separated in time. (Bottom Left)
These events are interpreted as being time-like separated. (Bottom Center)
These events are interpreted as being light-like separated. (Bottom Right) These
events are interpreted as being space-like separated.
that the scalar obeys the symmetric/antisymmetric decomposition (2)
f (a, b) = f
(a + b) (a + b)
(a − b) −(a − b)
,
,
+f
.
2
2
2
2
(3)
This functional equation has several solutions:
F 1. f (a, b) =
a
(4)
F 2. f (a, b) =
F 3. f (a, b) =
b
ab
(5)
(6)
F 4. f (a, b) =
F 5. f (a, b) =
(a + b)n n ∈ odd
a2 + b 2
(7)
(8)
We gain some valuable insight by recognizing that this is a special case of
the functional equation
f (a1 + b1 , a2 + b2 ) = f (a1 , a2 ) + f (b1 , b2 ).
6
(9)
p2
q2
q ª Dq = q2 - q1
2
p ª Dp = p2 - p1
q1
p1
1
P
Q
Figure 4: The interval defined by the events, quantified by the pair of numbers
(p, q), can be decomposed into symmetric and antisymmetric parts where the
symmetric part is chain-like and the antisymmetric part is antichain-like.
where a = a1 + b1 and b = a2 + b2 with a1 = a2 and b1 = −b2 . We call (9)
the Orthogonality Relation, and note that this represents a mapping from a real
pair to a real scalar, such that when one adds two pairs, the resulting scalar is
also arrived at by simple addition.
Rather than taking the direct product of measures from the two chains and
transforming them to a scalar, we can quantify events with a scalar measure
assigned to the direct product of the chains. Consistency requires that the two
approaches agree with one another. The lattice product is associative, which
means that the scalar measure also must obey the associativity equation [1, 11]
g(f (a, b)) = g(a) + g(b),
(10)
where g is an arbitrary function.
The result is that there are two remaining solutions. The first solution,
f (a, b) = a + b, is given by F 4 with n = 1 and g(·) equal to the identity. This
solution is proportional to the symmetric component of the decomposition, and
is referred to as the symmetric scalar. The symmetric scalar trivially satisfies
additivity under the symmetric/antisymmetric decomposition. However, while
the antisymmetric component does satisfy additivity, it does not satisfy associativity and therefore it is not a consistent measure for the interval. The second
solution is F 3, where we have f (a, b) = ab with g(·) = log(·), so that the scalar
associated with the pair (a, b) is ab. We refer to this as the interval scalar and
denote the interval scalar with the symbol ∆s2
∆s2 = (pb − pa )(qb − qa ).
keeping in mind that, with respect to the pair, nothing is really being squared.
It is straightforward to verify that the interval scalar obeys additivity under this
decomposition, since
p + q p + q p − q q − p
pq =
+
,
(11)
2
2
2
2
7
which can be rewritten as
p + q 2
pq =
2
−
p − q 2
2
.
(12)
Furthermore, it is important to note that any power of the scalar measure
can be written in the same form
pk + q k 2 pk − q k 2
pk q k =
−
,
(13)
2
2
where
(pk , q k ) =
4
pk + q k pk + q k pk − q k q k − pk
+
.
,
,
2
2
2
2
(14)
Coordinates
We have shown that the interval pair can be used to form two scalar measures:
the interval scalar and the symmetric scalar. Here we explore the relationships
between these scalar measures. We begin by selecting an event 0 to serve as the
origin. The interval between event a and the origin is quantified by forming the
pair (pa −p0 , qa −q0 ). The symmetric scalar, and its antisymmetric counterpart,
which is not a proper measure, can be used to define coordinates for event a by
ta
=
xa
=
(pa − p0 ) + (qa − q0 )
2
(pa − p0 ) − (qa − q0 )
,
2
so that the pair can be written as (ta + xa , ta − xa ) and decomposed into the
sum of two pair
(∆p, ∆q) = (pa − p0 , qa − q0 ) = (ta + xa , ta − xa ) = (ta , ta ) + (xa , −xa ),
each of which depends only on one of the two coordinates. We can construct
similar coordinates for event b,
tb
=
xb
=
(pb − p0 ) + (qb − q0 )
2
(pb − p0 ) − (qb − q0 )
,
2
so that
(pb − p0 , qb − q0 ) = (tb + xb , tb − xb ) = (tb , tb ) + (xb , −xb ).
It is easily verified that the interval between events a and b can be quantified by
taking the differences of their respective pairs formed with respect to the origin.
The result is that
(pb − pa , qb − qa ) = (tb − ta , tb − ta ) + (xb − xa , −(xb − xa )),
so scalar comparisons can be made simply by constructing the difference between
their coordinates. By denoting such differences in general as
∆t
∆x
= tb − ta
= xb − xa ,
8
s1
s2
p1
r1
p2
q1
r2
q2
1
P
2
Q
R
S
Figure 5: This figure illustrates two events that are quantified appropriately by
chains Q and R, but not by the pair of chains P and Q or the pair of chains R
and S, both of which view the events as simply being distinct in time.
the pair can be written simply as
(∆p, ∆q) = (∆t + ∆x, ∆t − ∆x) = (∆t, ∆t) + (∆x, −∆x).
(15)
Given the coordinate representation of the pair (15), we can write the interval
scalar as
∆s2 = ∆p∆q = ∆t2 − ∆x2 ,
(16)
which we recognize immediately as the Minkowski metric.
5
Consistency
Given a set of three or more mutually synchronized chains, quantification of an
interval via projection to one pair of chains in the set need not necessarily agree
with the quantification obtained by projecting to another pair of chains in the
same set. Figure 5 illustrates an interval that is bounded by some pairs of chains,
but not others. This particular situation is characterized by the fact that some
pairs of chains result in a unique time-like quantification, whereas other pairs
of chains will result in a consistent quantification. This time-like quantification
can be envisioned by imagining that you and a friend are looking along the
same line-of-sight and you each observe two distant flashes, one occurring after
another. In this case it is impossible to determine whether these two flashes
originated from the same place at different times, or different places at the
same time, or any other situation in between.
Another situation that can occur is one in which the event projects to a
set of synchronized chains in such a way that the event is first observed by one
chain, followed by its neighboring chains, and so on. In this case, we must resort
to an additional decomposition.
5.1
The Pythagorean Decomposition
What follows is a derivation of the Pythagorean theorem, which describes how
the space coordinate can be further decomposed into additional coordinates.
9
Y
D
2
Dd
Dy
X
1
Dx
X
3
D
Y
Figure 6: Here we illustrate how one can decompose the space-like aspect of
an interval ∆d into components ∆x and ∆y. This figure represents an ‘aerial
view’ of the poset looking down on the reference chains, which are indicated by
cross-hairs. The original interval between events 2 and 3 has been ‘decomposed’
by carefully selecting a third event labeled 1.
Here we derive a decomposition into two spatial coordinates and note that
subsequent decompositions into additional spatial dimensions proceed similarly.
Consider Figure 6 where a pair of events labeled 2, 3 that have been quanti¯ and found to have identical time coordinates t = t ,
fied by two chains D̄ and D̄
2
3
so that the interval pair is (d¯3 − d¯2 , d¯3 − d¯2 ) = (d3 − d2 , −(d3 − d2 )), where d¯i
¯ , d¯ represents the projection of event
represents the projection of event i onto D̄
i
i onto D̄, and di represents the spatial (antisymmetric) coordinate assigned to
event i. We then select a special event 1 such that its time coordinate is iden¯ , which quantify
tical to the others t1 = t2 = t3 . We introduce chains X̄ and X̄
¯
the interval between events 1 and 3, and chains Ȳ and Ȳ, which quantify the
interval between events 1 and 2. Furthermore, event 1 is selected so that the
three intervals satisfy the orthogonality equation (9). We no longer expect the
coordinates themselves to sum in a pair-wise fashion since they refer to distinct
sets of chains, but we can select an event 1 so that the scalar interval sums
¯3 − x̄¯1 )(x̄3 − x̄1 ) + (ȳ¯2 − ȳ¯1 )(ȳ2 − ȳ1 ).
(d¯3 − d¯2 )(d¯3 − d¯2 ) = (x̄
which requires that the x and y coordinates of event 1 are given by
x1
y1
= d3 − x3
= d2 − y2 ,
since the time coordinates of the three events are equal and events 2 and 3 are
independent of one another. Writing the scalar interval in terms of the new
coordinates immediately results in
(d3 − d2 )2 = (x3 − x1 )2 + (y2 − y1 )2 ,
10
which we recognize as the Pythagorean theorem
∆d2 = ∆x2 + ∆y 2 .
(17)
The result is that the Minkowski metric
∆s2 = ∆t2 − ∆d2 ,
can be further decomposed into
∆s2 = ∆t2 − ∆x2 − ∆y 2 ,
or
∆s2 = ∆t2 − ∆x2 − ∆y 2 − ∆z 2 ,
as necessary.
5.2
Time and Space
At this point we have derived that the two classes of coordinates, which we
call space and time, are related to the interval scalar via the Minkowski metric.
Rather than forming a fabric or arena where events take place, space and time
are quantifications assigned to make relationships between events and chains
of events look simple. We see that the concepts of time and space arise from
the symmetric and antisymmetric decomposition, which originates from the
orthogonality of chains and antichains, respectively. Time, being related to
chains, is necessarily one-dimensional. Space, being related to antichains, can
be decomposed into multiple dimensions.
6
Relating Observers
We could have chosen another pair of synchronized chains as a basis of quantification. That is, instead of choosing synchronized chains P and Q, we could
have chosen two other synchronized chains P′ and Q′ , (Figure 7) such that
successive events in P′ and Q′ each comprise an interval which nave non-zero
projections ∆p = m and ∆q = n. We say that chains P and P′ are coordinated,
and refer to each pair of chains as an inertial frame of reference, or a frame for
short.
6.1
Invariance of the Interval Scalar
We say that the pair of synchronized observers P and Q comprise frame 1, and
will refer to them from now on as P1 and Q1. Similarly, the synchronized pair
P′ and Q′ comprise frame 2, where we will refer to them as P2 and Q2. Choosing two events, we denote the interval scalar measured between the events in
frame 1 as ∆s21 and the interval scalar measured in frame 2 as ∆s22 . The interval
scalar measured in each frame must somehow be related. More specifically, the
interval scalar measured in reference frame 1 must be a function of the interval
scalar measured in reference frame 2, as well as the only other possible quantities relating the frames, the projections m and n. We also must allow for the
11
n
m
P
Q
P’
P
Q’
Q
Figure 7: On the left we illustrate another possible relationship among chains.
The new chain is coordinated to the original pair such that successive events
result in projections ∆p = m and ∆q = n. This new chain can be used to
construct a second pair of observers and thus define another frame of reference.
fact that observers can choose different scales for labeling their events. Thus we
have the functional relationship
∆s22 = σ21 2 f (ρ12 , ∆s21 ),
(18)
where ρ21 = ρ(m21 , n21 ) relates the frames by projecting successive events from
the chains comprising frame 2 onto the chains comprising frame 1, and σ21
converts the arbitrary scale from one frame to the other. We can also relate the
interval scalar measured in frame 1 to the interval scalar measured in frame 2
by
∆s21 = σ12 2 f (ρ12 , ∆s22 ),
(19)
where ρ12 is a different ratio relating frame 1 to frame 2. Taking the derivative
of (19) with respect to ∆s21 , we find that
df df
= 1.
σ12 2 σ21 2
d∆s21 d∆s22
Since the choice of scale is independent of the transformation, we have that
σ12 2 =
and
1
σ21 2
df df
= 1.
d∆s21 d∆s22
(20)
(21)
By introducing a third observer and relating three observers to one another
in a pairwise fashion, we have
df df df df df df
=
=
= 1,
d∆s21 d∆s22
d∆s22 d∆s23
d∆s21 d∆s23
12
and by comparing pairs of equalities, we find that
df
= ±1,
d∆s2
(22)
for any interval ∆s2 . The second derivative is
d2 f
= 0,
d∆s2
(23)
which means that the function f is linear in its second argument. We can then
rewrite the original transformation as
∆s22 = σ21 2 ∆s21 g(ρ21 ) + C,
where g is a function of ρ. However, we know that in the special case where
∆s21 = 0, then it is always true that ∆s22 = 0, so C = 0 leaving us with
∆s22 = σ21 2 ∆s21 g(ρ21 ).
Furthermore, since the first derivative was equal to ±1, we have that
g(ρ) = ±1.
However,g(ρ) = −1 would change the sign of the interval violating the partial
order. The result is that for all coordinated observers the scalar interval is
invariant, up to an arbitrary observer-selected scale,
∆s22 = σ21 2 ∆s21 .
6.2
(24)
Transformations
We can use the invariance of the interval scalar to relate quantification of an
interval in one frame to quantification in another. Without loss of generality,
we can assume that the arbitrary scale is σ12 2 = σ21 2 = 1. The projection p2
can be written as some function of p1 , so that
p2 = f (p1 , ρ21 ),
where f is an unknown function depending on the only possible variables relating
the projections: p1 and ρ21 . We can write the projection q2 in terms of q1
similarly
q2 = g(q1 , ρ21 ),
so that the pair transforms as
(p2 , q2 ) = (f (p1 , ρ21 ), g(q1 , ρ21 )).
We begin by rewriting the interval scalar (24) in terms of the projections
p1 q1 = p2 q2 = f (p1 , ρ21 )g(q1 , ρ21 )
(25)
where f and g are unknown functions. Taking the second derivative with respect
to p1
d2 (p1 q1 )
d2 f (p1 , ρ21 )
=
0
=
g(q
,
ρ
)
1
21
dp1 2
dp1 2
13
we find that f must be linear in p
f (p1 , ρ21 ) = cp1 φ(ρ21 ) + b,
where φ is a function of ρ alone. In the special case where the frames are
synchronized with one another ρ21 = 1, which implies that b = 0
f (p1 , ρ21 ) = cp1 φ(ρ21 ).
(26)
Similarly, by taking the second derivative with respect to q1 , we can show that
g(q1 , ρ21 ) = kq1 γ(ρ21 ).
(27)
By transforming from frame 1 to frame 2 and back again, we have
p1 = c2 p1 φ(ρ12 )φ(ρ21 ).
(28)
c = ±1
(29)
φ(ρ12 ) = φ(ρ21 )−1 .
(30)
This implies that
and
Similarly, considering q1 we have that
k = ±1
(31)
γ(ρ12 ) = γ(ρ21 )−1 .
(32)
and
Note that k and c must be of like sign, so that ck = 1 and the scalar interval
does not change sign.
Rewriting (25)
p1 q1 = p1 q1 φ(ρ21 )γ(ρ21 ),
(33)
which implies that, in general,
φ(ρ) = γ −1 (ρ).
(34)
Last, we observe that the order in which the chains are represented in the
pair is irrelevant, but that interchanging P and Q results in inverting the ratio
ρ. This implies that
φ(ρ) = γ(ρ−1 ).
(35)
Equating (34) and (35) we have that
γ −1 (ρ) = γ(ρ−1 ).
(36)
Taking the derivative with respect to ρ, we find that
γ −2 (ρ) = ρ−2 ,
(37)
γ(ρ) = ±ρ
(38)
φ(ρ) = ±ρ−1 ,
(39)
which implies that
and from (34) that
14
where the two functions must be the same sign.
The transformation of pairs of projections is given by the simple relation
(p′ , q ′ ) = (pρ−1 , qρ).
(40)
We now determine the function ρ = ρ(m, n) by considering a special case.
Consider an interval defined by two successive events on chain Q′ . These events
have some projection q ′ onto Q′ , the same projection p′ = q ′ onto P′ and
projections m and n onto chains P and Q, respectively. This results in the
relation
(q ′ , q ′ ) = (mρ−1 (m, n), nρ(m, n)).
(41)
Equating the elements of the pair on the right side results in
mρ−1 (m, n) = nρ(m, n),
which results in
ρ(m, n) = ±
r
n
m
since neither of the projections m nor n are zero.
The final pair transformation from one frame to another is
r
r
m
n
,q
).
(p′ , q ′ ) = ±(p
n
m
(42)
(43)
(44)
The fundamental nature of the pair of projections is manifest in the simplicity
of this transformation. We observe that this is related to the Bondi k-calculus
[4], and that one can write this relation as a pair-wise multiplication between
(ρ21 −1 , ρ21 ), which has a scalar measure of unity, and (p2 , q2 ) as in Kauffman’s
iterant algebra [9].
Changing variables to the coordinates, mixes the pair resulting in a linear
transformation
(∆t2 + ∆x2 , ∆t2 − ∆x2 ) = ((∆t1 + ∆x1 )ρ21 −1 , (∆t1 − ∆x1 )ρ21 ),
(45)
which can be represented by a matrix multiplication. Solving for ∆t2 and ∆x2 ,
we find that
∆t2
=
∆x2
=
ρ21 + ρ21 −1
ρ21 − ρ21 −1
∆t1 +
∆x1
2
2
−1
−1
ρ21 − ρ21
ρ21 + ρ21
∆t1 +
∆x1
2
2
(46)
(47)
By defining
β21 =
ρ21 2 − 1
,
ρ21 2 + 1
(48)
we find the Lorentz transformation in coordinate form
∆t2
=
∆x2
=
−β21
1
p
∆t1 + p
∆x1
2
1 − β21
1 − β21 2
−β21
1
p
∆t1 + p
∆x1 .
2
1 − β21
1 − β21 2
15
(49)
(50)
7
Speed
The derivation above reveals that relevant quantity relating two inertial frames
is β, which we recognize as the speed. It is more clearly written in terms of the
projections m and n or in terms of the coordinates
β=
∆x
m−n
=
.
m+n
∆t
(51)
We see that if the two pairs of frames are synchronized with one another, then
m = n = 1 so that β = 0. In this case we say that the frames are at rest
with respect to one another. The maximal speed is attained in the limit where
m → 0 or n → 0, which results in β → ±1. Since the interval scalar is given by
ds2 = ∆p∆q = mn, we have in this case that ds2 = 0. Given that the interval
scalar is invariant, if |β| is unity (maximal) in one frame, it is unity (maximal) in
all frames. We have found that there is an ultimate speed limit that is invariant
for all inertial frames.
8
Conclusion
We present a picture of the universe where events are fundamental. By asserting
that some events have the potential to be influenced by other events, but that
this potential is not reciprocal, we can describe the set of all events as a partially
ordered set or poset, which is typically known as a causal set. Quantification of
the poset proceeds by distinguishing particular events on pairs of chains, such
that successive quantifying events on one chain project to successive quantifying
events on another. Events can then be labeled by their projections onto the
quantifying events of the two reference chains. This results in a quantification
scheme that consists of pairs of numbers, which we show can be mapped to two
possible scalar measures; one of which is the interval scalar. The interval scalar
under the symmetric/antisymmetric decomposition gives rise to the Minkowski
metric. The Lorentz transformations are derived, as well as the fact that speed
is a relevant quantity that has a maximal value invariant in all inertial frames.
We emphasize that all this is derived without assuming the existence of space
or time, motion, constancy of the speed of light, or the principle of relativity.
All follow from the Event Postulate and the adopted quantification scheme.
The ordering of events leads to both the notion of a one-dimensional time and
a multi-dimensional space. Time is distinguished from ordering in the sense that
time includes a measure of closeness, which is a result of quantification. Time
is symmetric in the sense that two events separated only in time are observed
to occur in the same order by all observers comprising a given inertial frame.
The antisymmetry of space arises from the fact that the order in which two
events are observed can be interchanged when considering different observers
in the same inertial frame. In this picture, time is related to chains and space
is related to antichains. The rich mathematics of quaternions and geometric
algebra all follow from this simple fact.
In addition to the Event Postulate, we have made an assumption about the
poset structure. Specifically, we assume that events are sufficiently dense so
that we are able to construct synchronized chains of events as well as identify
events that enable us to decompose intervals into orthogonal components. While
16
such an assumption leads to special relativity and Minkowski space-time, it may
require modification to obtain cosmological expansion or gravity.
The simplicity of the Lorentz transformation when expressed in terms of
pairs has been noted before by Bondi [4] and later explored by Kauffman [9].
Though while attaining a similar formalism, these authors worked within the
usual space-time framework. We approached the problem with the goal of consistently quantifying a partially-ordered set of events, and consequently arrive
at space-time via a convenient decomposition. After submission of the first
version of this paper to the arXiv, we have been introduced to the work of Giacomo Mauro D’Ariano [5] who showed how the Lorentz transformations can
be, in principle, derived from event-counting performed by an observer within
a causal network implemented by a quantum computer. D’Ariano’s approach is
similar in spirit to ours in that causality plays a central role, however it differs
in that it lies within a quantum mechanical framework. We find that the only
necessary feature is the causal relationship, which we represent by a partially
ordered set of events.
It is surprising that arbitrary powers of the interval scalar can be decomposed into the same Minkowskian form. The fact that the Lorentz transformation depends on the square roots of the projections rather than the values
of the projections themselves suggests that square roots of projections are of
fundamental importance. If one rotates the spatial coordinates by a given angle, the projections rotate by the same amount so that a rotation of 2π brings
the coordinates and the projections back to the initial state. However, if one
instead considers quantities dependent on the square roots of projections, these
will rotate at one half the rate so that a rotation of 4π is necessary to return
them to the original state. This suggests that the square roots of projections
are described by the spin group Spin(n), which is a double-cover of the special
orthogonal group SO(n) of rotations. This does not affect the Lorentz transformations since the square roots of projections appear as ratios where the effect of
any rotation is canceled. Therefore the fact that these fundamental quantities
require that one complete two full rotations rather than a single full rotation
does not affect the mechanics and is not readily physically apparent.
It may be of interest to the reader to note that while solution F3 of the
Orthogonality Relation (9) along with the symmetric/antisymmetric decomposition gives rise to the square in the metric, solution F5 gives rise to the square in
the Born Rule of quantum mechanics. More importantly, our recent explorations
into the foundations of quantum mechanics postulated that pairs of numbers
are required to quantify quantum amplitudes [8]. Here we find pairs of numbers
again playing a critical role in the fundamental formulation of special relativity.
It is expected that these results will provide new insights into the meaning of
the pair in quantum mechanics. It is entirely possible that the pair in quantum
mechanics is comprised of square roots of projections rather than the projections
themselves. This would lead to a natural description of Fermions (spin-1/2 particles). Indeed, we already have recognized connections between the proposed
method of poset quantification and the Feynman checkerboard [7] where the
right and left moves correspond to projections (or more likely, the square roots
of projections) onto the two chains. Furthermore, the more fundamental understanding of the metric introduced here has the potential to facilitate the union
of quantum mechanics and gravity resulting in quantum gravity. Already in
our previous work, we showed that quantum theory is more fundamental than
17
space [8]. Here we show that space-time itself is not fundamental, but rather is
a convenient construct chosen to make events look simple.
Given the belief that physical law reflects an underlying order, one would
expect that given this underlying order, one ought to be able to derive the most
fundamental aspects of physical law. This fundamental principle is outlined in
an earlier work [10], and has been recently demonstrated to provide insights into
the foundations of quantum mechanics [8]. Here we apply this methodology to
understand the physics of events and derive specific notions of time, space and
motion.
9
Acknowledgements
Kevin Knuth would like to thank Philip Goyal, Keith Earle, Ariel Caticha, John
Skilling, Seth Chaiken, Adom Giffin, Jeff Scargle and Jeffrey Jewell for many
insightful discussions and comments. He would also like to thank Rockne, Ann
and Emily Knuth for their support, and Henry Knuth for suggesting that he
‘try using a J’. Newshaw Bahreyni would like to thank Shahram Pourmand for
his helpful discussions and Mahshid Zahiri, Mohammad Bahreyni and Shima
Bahreyni for their continued support. The authors would also like to thank
Giacomo Mauro D’Ariano, Alessandro Tosini, Cristi Stoica and Patrick O’Keefe
for valuable comments that have improved the quality of this work.
References
[1] J. Aczél, Lectures on functional equations and their applications, Academic
Press, New York, 1966.
[2] L. Bombelli, J.-H. J.-H. Lee, D. Meyer, and R. Sorkin, Space-time as a
causal set, Phys. Rev. Lett. 59 (1987), 521–524.
[3] L. Bombelli and D.A. Meyer, The origin of lorentzian geometry, Physics
Lett. A 141 (1989), 226–228.
[4] H. Bondi, Relativity and common sense, Dover, New York, 1980.
[5] G. M. D’Ariano, On the “principle of the quantumness”, the quantumness of relativity, and the computational grand-unification, (2010),
arXiv:1001.1088 [quant-ph].
[6] A. Einstein, Zur elektrodynamik bewegter körper, Annalen der Physik. 17
(1905), 891–921.
[7] R. P. Feynman and A. R. Hibbs, Quantum mechanics and path integrals,
McGraw-Hill, New York, 1965.
[8] P. Goyal, K. H. Knuth, and J. Skilling, Origin of complex quantum amplitudes and Feynman’s rules, Phys. Rev. A 81 (2010), 022109,
arXiv:0907.0909 [quant-ph].
[9] L. H. Kauffman, Transformations in special relativity, Int. J. Theo. Phys.
24 (1985), no. 3, 223–236.
18
[10] K. H. Knuth, Deriving laws from ordering relations., Bayesian Inference
and Maximum Entropy Methods in Science and Engineering, Jackson Hole
WY, USA, August 2003 (New York) (G. J. Erickson and Y. Zhai, eds.),
AIP Conference Proceedings, no. 707, American Institute of Physics, 2004,
arXiv:physics/0403031v1 [physics.data-an], pp. 204–235.
[11]
, Measuring on lattices, Bayesian Inference and Maximum Entropy Methods in Science and Engineering, Oxford, MS, USA, 2009 (New
York) (P. Goggans and C.-Y. Chan, eds.), AIP Conference Proceedings
1193, American Institute of Physics, 2009, arXiv:0909.3684v1 [math.GM],
pp. 132–144.
19