While these seemed to work well for everyday phenomena involving solid objects, light was still problematic. Newton believed that light was "corpuscular," but later physicists found that a transeverse wave model of light was more useful. Mechanical waves travel in a medium, an so it was assumed for light. This hypothetical medium was called the "luminiferous aether." It seemed to have some conflicting properties, such as as being extremely stiff, to account for the high speed of light, while at the same time being insubstantial, so as not to slow down the Earth as it passes through. The idea of an aether seemed to reintroduce the idea of an absolute frame of reference, one that is stationary with respect to the aether.
In the early 19th century, light, electricity, and magnitism began to be understood as aspects of the electromagnetic field. Maxwell's equations showed that accelerating a charge produced electromagnetic radiation which always traveled at the speed of light. The equations showed that the speed of the radiation did not change based upon the speed of the source. This is consistent with analogies to mechanical waves. Presumably, however, the speed of the radiation would change based on the speed of the observer. Physicists tried to use this idea to measure the speed of the Earth with respect to the aether. The most famous attempt was the Michelson-Morley experiment. While these experiments were controversial for some time, a consensus emerged that the speed of light did not vary because of the speed of the observer, and since according to Maxwell's equations it did not vary because of the speed of the source, the speed of light must be invariant for all observers.
Before Special Relativity, Hendrik Lorentz and others had already noted that Electromagnetic forces differed depending on the observer. For example, one observer might see nomagnetic field in a particular area while another moving relative to the first does. Lorentz suggested an aether theory in which objects and observers travelling with respect to a stationary aether underwent a physical shortening (Lorentz-Fitzgerald contraction) and a change in temporal rate (time dilation). This allowed what appeared at the time to be a reconciliation of Electromagnetics and Newtonian physics by replacing the Galilean transformations. When the velocities involved are much less than speed of light, the resulting laws simplify to the Galilean transformations. The theory, known as Lorentz Ether Theory (LET) was criticized, even by Lorentz himself, because of its ad hoc nature.
While Lorentz suggested the Lorentz transformation equations, Einstein's contribution was, inter alia, to derive these equations from a more fundamental theory, which theory did not require the presence of an aether. Einstein wanted to know what was invariant (the same) for all observers. Under Special Relativity, the seemingly complex transformations of Lorentz and Fitgerald derived cleanly from simple geometry and the Pythagorean theorem. The original title for his theory was (translated from German) "Theory of Invariants". It was Max Planck who suggested the term "relativity" to highlight the notion of transforming the laws of physics between observers moving relative to one another.
Special relativity is usually concerned with the behaviour of objects and observers which remain at rest or are moving at a constant velocity. In this case, the observer is said to be in an inertial frame of reference. Comparison of the position and time of events as recorded by different inertial observers can be done by using the Lorentz transformation equations. A common misstatement about relativity is that SR cannot be used to
handle the case of objects and observers who are undergoing acceleration (non-inertial reference frames), but this is incorrect. For an example, see the relativistic rocket problem. SR can correctly predict the behaviour of accelerating bodies in the presence of a constant or zero gravitational field, or those in a rotating reference frame. It is not capable of accurately describing motion in varying gravitational fields.
The speed of light in vacuum is the same to all inertial observers. This postulate has been verified experimentally.
Second Postulate
Observation of physical phenomena by more than one inertial observer must result in agreement between the observers as to the nature of reality. Or, the nature of the universe must not change for an observer if their inertial state changes.
Every physical theory should look the same mathematically to every inertial observer.
Two events that occur simultaneously in different places in one frame of reference may occur at different times in another frame of reference (lack of simultaneity).
The dimensions (e.g. length) of an object as measured by one observer may differ from the results of measurements of the same object made by another observer. (See Lorentz transformation equations)
The twin paradox concerns a twin who flies off in a spaceship travelling near the speed of light. When he returns he discovers that his twin has aged much more rapidly than he has (or he aged more slowly).
There are a couple of (equivalent) ways to define momentum and energy in SR. One method uses conservation laws. If these laws are to remain valid in SR they must be true in every possible reference frame. However, if one does some simple thought experiments using the Newtonian definitions of momentum and energy one sees that these quantities are not conserved in SR. One can rescue idea of conservation by making some small modifications to the definitions to account for relativistic velocities. It is these new definitions which are taken as the correct ones for momentum and energy in SR.
Given an object of mass m traveling at velocity v the energy and momentum are given by
where γ is given by
and c is the speed of light. The term γ occurs frequently in relativity, and comes from the Lorentz transformation equations. The energy and momentum can be related through the formula
which is referred to as the relativistic energy-momentum equation.
For velocities much smaller than those of light γ can be approximated
using a series expansion and one finds that
Barring the first term in the energy expression (discussed below), these formulas agree exactly with the standard definitions of Newtonian kinetic energy and momentum. This is as it should be, for special relativity must agree with Newtonian mechanics at low velocities.
Looking at the above formulas for energy, one sees that when an object is at rest (v = 0 and γ = 1) there is a non-zero energy remaining:
This energy is referred to as rest energy. The rest energy does not cause any conflict with the Newtonian theory because it is a constant and, as far as kinetic energy is concerned, it is only differences in energy which matter.
Taking this formula at face value, we see that in relativity, mass is simply another form of energy. That is, there may be processes by which mass (in the form of rest energy) may be converted to other forms of energy such as kinetic energy, heat, or light. That these processes do, in fact, occur has been demonstrated vividly in the form of nuclear reactions. The implications of this formula on 20th century life has made it one of the most famous equations in all of science.
Since γ increases with velocity so does the relativistic mass. This definition is convenient for some purposes. In particular, one can write the equations for energy and momentum as
which are valid in all reference frames. If the velocity is zero the relativistic mass and the rest mass become equal.
Neither definition is right or wrong, it is simply a matter of convenience. It turns out that in applications to general relativity and quantum field theory it is the invariant mass which is more useful. Thus many physicists simply refer to the mass when they actually mean the invariant mass.
SR uses a 'flat' 4 dimensional Minkowski space, usually referred to as space-time. This space, however, is very similar to the standard 3 dimensional Euclidean space, and fortunately by that fact, very easy to work with.
The differential of distance(ds) in cartesian 3D space is defined as:
where are the differentials of the three spatial dimensions. In the geometry of special relativity, a fourth dimension, time, is added, with units of c, so that the equation for the differential of distance becomes:
In many situations it may be convenient to treat time as imaginary (e.g. it may simplify equations), in which case in the above equation is replaced by , and the metric becomes
If we reduce the spatial dimensions to 2, so that we can represent the physics in a 3-D space,
We see that the null geodesics lie along a dual-cone:
defined by the equation
, or
Which is the equation of a circle with r=c*dt.
If we extend this to three spatial dimensions, the null geodesics are
continuous concentric spheres, with radius = distance = c*(+ or -)time.
This null dual-cone represents the "line of sight" of a point in space. That is, when we look at the stars and say "The light from that star which I am receiving is X years old.", we are looking down this line of sight: a null geodesic. We are looking at an event meters away and d/c seconds in the past. For this reason the null dual cone is also known as the 'light cone'. (The point in the lower left of the picture below represents the star, the origin represents the observer, and the line represents the null geodesic "line of sight".)
The cone in the -t region is the information that the point is 'receiving', while the cone in the +t section is the information that the point is 'sending'.