Paraxial ray propagation in a spherical lens system and the Lagrange invariant
A ray in the paraxial region has a very small angle θ with respect to the optical axis of the optical system, and all of its paths pass very close to the optical axis. In this case, the following approximation holds for θ:
sin θ = θ, tan θ = θ, cos θ = 1
Therefore, limiting the area to the paraxial region makes ray tracing calculations extremely simple and is useful for understanding the structure of the optical system. Here, we will show how to derive the calculation formula.
table of contents:
・Derivation of Abbe's invariants
When a ray of light passes through the boundary surface (spherical shape) of a material, the ray of light in the paraxial region can be illustrated as follows:
(i) When u > 0, u' > 0,

Figure 1-1
(ii) When u > 0, u' < 0,

Figure 1-2
*For dimensional notation, refer to Appendix: Definitions of direction and sign for one‑dimensional vectors and angles
The parameters in the above figure are defined as follows:
n : Refractive index of the material before incidence on the boundary surface
n' : Refractive index of the material after exiting the boundary surface
r : Radius of curvature of the boundary surface
u: The angle between the incident ray on the boundary surface and the central axis (counterclockwise is positive)
u': Angle between the exit ray from the boundary surface and the central axis (counterclockwise is positive)
i: The angle between the incident light on the boundary surface and the normal to the boundary surface (counterclockwise is positive)
i': The angle between the light emitted from the boundary surface and the normal to the boundary surface (counterclockwise is positive)
φ: The angle between the normal at the incident position of the ray on the boundary surface and the central axis (counterclockwise is positive)
h: Height of the ray at the boundary surface relative to the central axis
s: The distance from the boundary surface to the point where the incident light intersects with the central axis
s': Distance from the boundary surface to the point where the light emitted from the boundary surface intersects with the central axis
Here, the angles u , u' , φ , i , i' and the height h are assumed to be sufficiently small. Also, the color coding for each parameter is as follows: the amount given by the setting is in blue, and the amount given by the calculation formula is in black (in the recurrence formula model shown later).
In this case, the conditions shown in the figure can be expressed as the following equations in both (i) and (ii).

Here, the fact that both (i) and (ii) are expressed by the same equation means that they are both forms of the same model, and it can be said that it is sufficient to consider either one of them.
From (1)-①abc, the following is derived.

Furthermore, according to Snell's law , the following can be said in the paraxial region:
From (1)-①'ab and (1)-②, the Abbe's invariant shown below can be derived.

(1)-③ (Abbe's invariant) describes a ray traveling from point P to point P', and it can be seen that u, u', and h are not included in it. From this, it can be said that in the paraxial region, as shown in Figure 1-3, any ray passing through point P travels toward point P' (i.e., points P and P' are conjugate).

Figure 1-2
・Derivation of Lagrange invariant
When the entire system is rotated around point O (the center of the sphere on the boundary surface), point P moves to point Q and point P' moves to point Q'. Points Q and Q' are also conjugate, but since the boundary surface remains completely unchanged before and after rotating the entire system, it can be said that points Q and Q' were also conjugate before rotating the entire system.
Here, in the paraxial region, arcs PQ and P'Q' can be considered as straight lines perpendicular to the central axis, so the vectors PQ and P'Q' can be treated as vectors perpendicular to the central axis.

Figure 2-1
For each parameter, those shown in Figures 1-1 and 1-2 will be followed, and new parameters will be defined as follows.
y: object height
y': image height
ω: Angle of the ray from the object height y toward the intersection of the boundary surface and the optical axis
ω': Angle of the ray from the intersection of the boundary surface and the optical axis toward image height y'
In this case, the conditions shown in the figure can be expressed as the following equations:

Furthermore, according to Snell's law, the following can be said in the paraxial region:
From (1)-①a, (2)-①ab, and (2)-②, the following relationship can be obtained.

This relation is called the Lagrange invariant or the Helmholtz-Lagrange invariant.
・Recurrence relations and matrix notation for paraxial calculations
Consider a system in which a ray passes through the 0th surface (object surface), the 1st surface, the 2nd surface, ..., the j-th surface, ..., and the k-th surface (image surface) in that order. Here, each surface is assumed to be spherical (flat surfaces are given with an infinite radius of curvature). In this case, if we illustrate the ray in the paraxial region before and after passing through the j-th surface, it looks like this:

Figure 3-1
*For dimensional notation, refer to Appendix: Definitions of direction and sign for one‑dimensional vectors and angles
The subscript j of each parameter means the j-th boundary surface.
For each parameter in the above diagram, those shown in Figures 1-1 and 1-2 will be retained, and new ones will be defined as follows.
dj : Distance from the j-th surface to the (j+1)-th surface
Here again, we assume that the angles uj , u'j , and height hj are sufficiently small. Also, for each parameter, the values given by the settings are shown in blue, and the values given by the calculation formula are shown in black .
First, from (1)-①a, the following can be said about the j-th plane.

Furthermore, from (1)-③ (Abbe's invariant), the following can also be said for the j-th plane.

From (3)-①,②,

Furthermore, the following can be said from Figure 3-1.
