Volume 4, Number 4, Article 9, Pages 329-351 doi:10.1167/4.4.9 http://journalofvision.org/4/4/9/ ISSN 1534-7362
Accuracy and precision of objective refraction from wavefront aberrations
Larry N. Thibos
School of Optometry, Indiana University, Bloomington, IN, USA
[home] [e-mail]
Xin Hong
School of Optometry, Indiana University, Bloomington, IN, USA
Arthur Bradley
School of Optometry, Indiana University, Bloomington, IN, USA
[home] [e-mail]
Raymond A. Applegate
College of Optometry, University of Houston, Houston, TX, USA
[home] [e-mail]
Abstract

We determined the accuracy and precision of 33 objective methods for predicting the results of conventional, sphero-cylindrical refraction from wavefront aberrations in a large population of 200 eyes. Accuracy for predicting defocus (as specified by the population mean error of prediction) varied from –0.50 D to +0.25 D across methods. Precision of these estimates (as specified by 95% limits of agreement) ranged from 0.5 to 1.0 D. All methods except one accurately predicted astigmatism to within ±1/8D. Precision of astigmatism predictions was typically better than precision for predicting defocus and many methods were better than 0.5D. Paraxial curvature matching of the wavefront aberration map was the most accurate method for determining the spherical equivalent error whereas least-squares fitting of the wavefront was one of the least accurate methods. We argue that this result was obtained because curvature matching is a biased method that successfully predicts the biased endpoint stipulated by conventional refractions. Five methods emerged as reasonably accurate and among the most precise. Three of these were based on pupil plane metrics and two were based on image plane metrics. We argue that the accuracy of all methods might be improved by correcting for the systematic bias reported in this study. However, caution is advised because some tasks, including conventional refraction of defocus, require a biased metric whereas other tasks, such as refraction of astigmatism, are unbiased. We conclude that objective methods of refraction based on wavefront aberration maps can accurately predict the results of subjective refraction and may be more precise. If objective refractions are more precise than subjective refractions, then wavefront methods may become the new gold standard for specifying conventional and/or optimal corrections of refractive errors.




History
Received September 28, 2003; published April 23, 2004; corrected April 28, 2005
Citation
Thibos, L. N., Hong, X., Bradley, A., & Applegate, R. A. (2004). Accuracy and precision of objective refraction from wavefront aberrations. Journal of Vision, 4(4):9, 329-351, http://journalofvision.org/4/4/9/, doi:10.1167/4.4.9.
Keywords
visual optics, optical aberrations, refraction, metrics of optical quality
for related articles by these authors

for papers that cite this paper


Introduction
The purpose of a conventional, ophthalmic refraction of the eye is to determine that combination of spherical and cylindrical lenses which optimizes visual acuity for distant objects. The underlying assumption of refraction is that visual acuity is maximized when the quality of the retinal image is maximized. Furthermore, it is commonly assumed that retinal image quality is maximized when the image is optimally focused. For these reasons, the endpoint of a subjective refraction is taken as an operational definition of the term “best correction” as applied to eyes.
This paper is concerned with the problem of objectively determining the best correction of an eye from measurements of wavefront aberrations. Aberrometers measure all of the eye’s monochromatic aberrations and display the result in the form of an aberration map that describes the variation in optical path length from source to retinal image through each point in the pupil. Zernike expansion of an aberration map includes the second order aberrations of defocus and astigmatism. Thus, one obvious strategy for objective refraction is to prescribe correcting lenses based on Zernike coefficients of the second-order.
Unfortunately, the problem is not solved so easily. Several studies have shown that eliminating the second-order Zernike aberrations does not necessarily optimize the subjective impression of best-focus nor the objective measurement of visual performance (Applegate, Ballentine, Gross, Sarver, & Sarver, 2003; Applegate, Marsack, Ramos, & Sarver, 2003; Guirao & Williams, 2003; Thibos, Hong, Bradley, & Cheng, 2002). Eliminating second-order Zernike aberrations is equivalent to minimizing the root mean squared (RMS) wavefront error, but this minimization does not necessarily optimize the quality of the retinal image (King, 1968; Mahajan, 1991). Thus a search has begun for alternative metrics of optical quality that are optimized by subjective refraction when higher-order aberrations are present.
A variety of problems must be solved when converting an aberration map into a prescription for corrective lenses or refractive surgery. One of the most important is a correction for the eye’s chromatic aberration. Objective aberrometers typically use infrared light, for which the eye has relatively low refractive power compared to visible light. Optical models of longitudinal chromatic aberration (Thibos, Ye, Zhang, & Bradley, 1992) can be extrapolated to estimate the difference in optical power of the eye between the measurement wavelength and some visible wavelength, but it is unclear what wavelength should be chosen as a reference for any given eye. Furthermore, since only one wavelength can be in-focus at a time, some method is needed to factor in the relative contribution of all wavelengths, each with a different amount of defocus and a different luminance, in order to objectively refract an eye for polychromatic objects.
Another sticky problem is the lack of a universally-accepted metric of image quality that could be used to establish objectively the state of optimum-focus for an aberrated eye. One purpose of this paper is to describe a variety of such metrics based on general principles described elsewhere (Cheng, Thibos, & Bradley, 2003; Williams, Applegate, & Thibos, 2004). Assuming that consensus agreement could be achieved for a metric of choice, one still needs to deal with the fact that identifying the best correction is a multi-dimensional problem in optimization. Guirao & Williams (Guirao & Williams, 2003) have described an iterative method for finding the optimum sphere, cylinder and axis parameters that optimize a metric of image quality. Other possibilities include an objective version of the clinical technique of refraction by successive elimination. A first approximation would eliminate the bulk of defocus error by correcting the eye with a spherical lens of power M, the so-called spherical equivalent. Next, the eye’s astigmatism is corrected with a cylindrical lens, followed by a fine-tuning of the spherical lens power if necessary. This is the basis of most of the methods described below.
A different kind of problem is to incorporate into the method the refractionist’s rule “maximum plus to best visual acuity” (Borisch, 1970). According to this clinical maxim, the spherical refractive error of myopic eyes should be deliberately under-corrected. The amount of under-correction is not enough to diminish visual acuity, but it is sufficient to minimize unnecessary accommodation and to maximize the usable depth of focus (DOF) at distance and near. These twin goals are achieved by prescribing a spherical lens power that is slightly less negative (in the case of myopia) or slightly more positive (in the case of hyperopia) than the lens required to make the retina conjugate to infinity. Instead, the prescribed lens conjugates the retina with a plane at the hyperfocal distance, which is the nearest distance the retina can focus on without significantly reducing visual performance for a target located at infinity (Campbell, 1957). Consequently, the eye is left in a slightly myopic state (Figure 1B), compared to an optimum correction that would place the retina conjugate to infinity (Figure 1A). Note that the diagram in Figure 1 has been simplified by assuming that any astigmatism has already been fully corrected using the appropriate cylindrical lens.
fig01.gif
Figure 1. Two criteria for refracting the eye. (A) An optimum refraction conjugates the retina with infinity. In this case the ideal correcting lens images infinity at the eye’s far point (). (B) A conventional refraction conjugates the fovea with the eye’s hyperfocal point (), which lies closer to the eye by an amount equal to half the depth-of-field (DOF). In this case the correcting lens images infinity at a point (o) slightly beyond the eye’s far point and therefore the eye remains slightly myopic.
Yet another issue is the extent to which neural factors need to be taken into account when converting an aberration map into a prescription. One such neural factor is the angular sensitivity of cone photoreceptors (Enoch & Lakshminarayanan, 1991) which is commonly modeled optically by an apodization filter in the pupil plane (Bradley & Thibos, 1995; Metcalf, 1965). Post-receptoral neural processing of the retinal image affects the processing of blurred retinal images in a manner that can be modeled as a mathematical convolution of the optical point-spread function with a neural point-spread function (Thibos & Bradley, 1995). This too may be construed as a form of apodization since the effect of the convolution will be to attenuate the remote tails of a blurred point-spread function (PSF).
Recently Guirao and Williams (Guirao & Williams, 2003) described a variety of methods for quantifying the optical quality of an eye based on (1) analysis of wavefront aberrations using pupil-plane metrics and (2) analysis of retinal image quality using image-plane metrics. They reported that all five image plane metrics they considered were more accurate than two pupil-plane metrics in predicting the optimum subjective refraction for a polychromatic target for a small population of 6 eyes. Further testing was done on a large population of 146 eyes for which aberration data for a fixed, 5.7 mm pupil were available in the literature. Unfortunately, a variety of uncontrolled conditions precluded strong conclusions from this large population (e.g. possible fluctuations of accommodation, unknown pupil size during subjective refraction, binocular refractions that likely yielded sub-optimal acuity endpoints) but nevertheless the authors found a close correlation between subjective and objective refractions computed from image-plane metrics. Although visual performance during refraction presumably depended on some combination of optical and neural factors, they found that optimizing the optical image without considering neural factors led to accurate prediction of the outcome of subjective refraction. However, no assessment of the precision of these predictions was reported.
The purpose of our study was to evaluate two general approaches to converting an aberration map into a conventional sphero-cylindrical prescription. The first approach is a surface-fitting procedure designed to find the nearest sphero-cylindrical approximation to the actual wavefront aberration map. The second approach involves a virtual through-focus experiment in which the computer adds or subtracts various amounts of spherical or cylindrical wavefronts to the aberration map until the optical quality of the eye is maximized. Preliminary accounts of this work have been presented (Thibos, Bradley, & Applegate, 2002; Thibos, Hong, & Bradley, 2001).
Methods
Refraction based on the principle of equivalent quadratic
We define the equivalent quadratic of a wavefront aberration map as that quadratic (i.e. a sphero-cylindrical) surface which best represents the map. This idea of approximating an arbitrary surface with an equivalent quadratic is a simple extension of the common ophthalmic technique of approximating a sphero-cylindrical surface with an equivalent sphere. Two methods for determining the equivalent quadratic from an aberration map are presented next.
Least-squares fitting
One common way to fit an arbitrarily aberrated wavefront with a quadratic surface is to minimize the sum of squared deviations between the two surfaces. This least-squares fitting method is the basis for Zernike expansion of wavefronts. Because the Zernike expansion employs an orthogonal set of basis functions, the least-squares solution is given by the second-order Zernike coefficients, regardless of the values of the other coefficients. These second-order Zernike coefficients can be converted to a sphero-cylindrical prescription in power vector notation using Equation 1,
eq01.gif(1)
where cnm is the nth order Zernike coefficient of meridional frequency m, and r is pupil radius. The power vector notation is a cross-cylinder convention that is easily transposed into conventional minus-cylinder or plus-cylinder formats used by clinicians (see eqns 22, 23 of (Thibos, Wheeler, & Horner, 1997).
Paraxial curvature matching
Curvature is the property of wavefronts that determines how they focus. Thus, another reasonable way to fit an arbitrary wavefront with a quadratic surface is to match the curvature of the two surfaces at some reference point. A variety of reference points could be selected, but the natural choice is the pupil center. Two surfaces that are tangent at a point and have exactly the same curvature in every meridian are said to osculate. Thus, the surface we seek is the osculating quadric. Fortunately, a closed-form solution exists for the problem of deriving the power vector parameters of the osculating quadratic from the Zernike coefficients of the wavefront (Thibos et al., 2002). This solution is obtained by computing the curvature at the origin of the Zernike expansion of the Seidel formulae for defocus and astigmatism. This process effectively collects all r2 terms from the various Zernike modes. We used the OSA definitions of the Zernike polynomials, each of which has unit variance over the unit circle (Thibos, Applegate, Schwiegerling, & Webb, 2000). The results given in Equation 2 are truncated at the sixth Zernike order but could be extended to higher orders if warranted.
eq02.gif(2)
Refraction based on maximizing optical or visual quality
An empirical way to determine the focus error of an eye (with accommodation paralyzed) is to move an object axially along the line-of-sight until the retinal image of that object appears subjectively to be well focused. This procedure is easily simulated mathematically by adding a spherical wavefront to the eye’s aberration map and then computing the retinal image using standard methods of Fourier optics as illustrated in Movie 1. The curvature of the added wavefront can be systematically varied to simulate a through-focus experiment that varies the optical quality of the eye+lens system over a range from good to bad. Given a suitable metric of optical quality, this computational procedure yields the optimum power M of that spherical correcting lens needed to maximize optical quality of the corrected eye. With this virtual spherical lens in place, the process can be repeated for through-astigmatism calculations to determine the optimum values of J0 and J45 needed to maximize image quality. If necessary, a second iteration could be used to fine-tune results by repeating the above process with these virtual lenses in place. However, the analysis reported below did not include a second iteration.
Movie 1. Dynamic simulation of the through-focus method of objective refraction. To determine the optimum value M of a spherical defocusing lens, a pre-determined sequence of M-values are used to modulate the wavefront map in the same way that a real lens alters the eye’s wavefront aberration function. From the new aberration map we compute the retinal point-spread function (PSF), optical transfer function (OTF), and retinal image of an eye chart. Scalar metrics of optical quality are used to optimize focus (M). The process is then repeated to optimize astigmatism parameters J0, J45. This example is for an eye with 0.1 mm of spherical aberration.
The computational method described above captures the essence of clinical refraction by mathematically simulating the effects of sphero-cylindrical lenses of various powers. Our method is somewhat simpler to implement than that described by Guirao & Williams (Guirao & Williams, 2003) who used an iterative searching method to determine that combination of spherical and cylindrical lenses which maximizes the eye’s optical quality. Regardless of which searching algorithm is used, a suitable metric of optical quality is required as a merit function. Guirao and Williams used 5 such metrics of image quality. In Appendix A we expand their list to 31 metrics by systematically pursuing three general approaches to quantifying optical quality: (1) wavefront quality, (2) retinal image quality for point objects, and (3) retinal image quality for grating objects. Implementation of image sharpness metrics for extended objects, such as a letter chart, (Fienup & Miller, 2003; Hultgren, 1990) have been left for future work. Several of the implemented metrics include a neural component that takes into account the spatial filtering of the retinal image imposed by the observer’s visual system. Strictly speaking, such metrics should be referred to as metrics of neuro-optical quality or visual quality, but for simplicity we use the term “optical quality metric” generically. For each of these 31 metrics we used the virtual refraction procedure described above to determine (to the nearest 1/8 D) the values of M, J0 and J45 required to maximize the metric. These objective refractions were then compared with conventional subjective refractions. A listing of acronyms for the various refraction methods is given in Table 1.
Evaluation of methods for objective refraction
To judge the success of an objective method of refraction requires a gold standard for comparison. The most clinically relevant choice is a subjective refraction performed for Sloan letter charts illuminated by white light. Accordingly, we evaluated our objective refractions against the published results of the Indiana Aberration Study (Thibos et al., 2002). That study yielded a database of aberration maps for 200 eyes that were subjectively well-corrected by clinical standards. The methodology employed avoided the problems mentioned above that limited the conclusions drawn by Guirao & Williams. A brief summary of the experimental procedure used in the Indiana Aberration Study is given next.
Subjective refractions were performed to the nearest 0.25D on 200 normal, healthy eyes from 100 subjects using the standard optometric protocol of maximum plus to best visual acuity. Accommodation was paralyzed with 1 drop of 0.5% cyclopentalate during the refraction. Optical calculations were performed for the fully dilated pupil, which varied between 6-9 mm for different eyes. The refractive correction was taken to be that sphero-cylindrical lens combination which optimally corrected astigmatism and conjugated the retina with the eye’s hyperfocal point (Figure 1b). This prescribed refraction was then implemented with trial lenses and worn by the subject during subsequent aberrometry (l=633 nm). This experimental design emphasized the effects of higher-order aberrations by minimizing the presence of uncorrected second-order aberrations. The eye’s longitudinal chromatic aberration was taken into account by the different working distances used for aberrometry and subjective refraction as illustrated in Figure 2. Assuming the eye was well focused for 570 nm when viewing the polychromatic eye chart at 4 m, the eye would also have been focused at infinity for the 633nm laser light used for aberrometry (Thibos et al., 1992).
fig02.gif
Figure 2. Schematic diagram of optical condition of the Indiana Aberration Study. Yellow light with 570 nm wavelength is assumed to be in focus during subjective refraction with a white-light target at 4 m. At the same time, 633 nm light from a target at infinity would be well focused because of the eye’s longitudinal chromatic aberration.
Since all eyes were corrected with spectacle lenses during aberrometry, the predicted refraction was M = J0 = J45 = 0. The level of success achieved by the 33 methods of objective refraction described above was judged on the basis of precision and accuracy at matching these predictions (Figure 3). Accuracy for the spherical component of refraction was computed as the population mean of M as determined from objective refractions. Accuracy for the astigmatic component of refraction was computed as the population mean of (Bullimore, Fusaro, & Adams, 1998) vectors. Precision is a measure of the variability in results and is defined for M as twice the standard deviation of the population values, which corresponds to the 95% limits of agreement (LOA) (Bland & Altman, 1986). The confidence region for astigmatism is an ellipse computed for the bivariate distribution of J0 and J45. This suggests a definition of precision as the geometric mean of the major and minor axes of the 95% confidence ellipse.
fig03.gif
Figure 3. Graphical depiction of the concepts of precision and accuracy as applied to the 1-dimensional problem of estimating spherical power (left column of diagrams) and the 2-dimensional problem of estimating astigmatism (right column of diagrams).
In our view, accuracy and precision are equally important for refraction. A method that is precise but not accurate will yield the same wrong answer every time. Conversely, a method that is accurate but not precise gives different answers every time and is correct only on average. Thus we seek a method that is both accurate and precise. However, one might argue that lack of accuracy implies a systematic bias that could be removed by a suitable correction factor applied to any individual eye. One way to obtain such a correction factor is to examine the population statistics of a large number of eyes, as we have done in this study. Any systematic bias obtained for this group could then be used as a correction factor for future refractions, assuming of course that the individual in question is well represented by the population used to determine the correction factor. Although this may be an expedient solution to the problem of objective refraction, it lacks the power of a theoretically sound account of the reasons for systematic biases in the various metrics of optical quality.
Results
Refraction based on equivalent quadratic
The two methods for determining the equivalent quadratic surface for a wavefront aberration map gave consistently different results. A frequency histogram of results for the least-squares method (Figure 4A) indicated an average spherical refractive error of M = –0.39 D. In other words, this objective method predicted the eyes were, on average, significantly myopic compared to subjective refraction. To the contrary, the method based on paraxial curvature matching (Figure 4B) predicted an average refractive error close to zero for our population. Both methods accurately predicted the expected astigmatic refraction as shown by the scatter plots and 95% confidence ellipses in Figure 5.
fig04.gif
Figure 4. Frequency distribution of results for the least-squares method for fitting the wavefront aberration map with a quadratic surface. Dilated pupil size ranged from 6 to 9 mm across the population.
fig05.gif
Figure 5. Scatter plots of (A) the least-squares fit of the wavefront over the entire pupil and (B) paraxial curvature matching methods of determining the two components of astigmatism. Circles show the results for individual eyes, green cross indicates the mean of the 2-dimensional distribution, and ellipses are 95% confidence intervals. Precision is the geometrical mean of the major and minor axes of the ellipse.
Refraction based on maximizing optical or visual quality
Computer simulation of through-focus experiments to determine that lens (either spherical or astigmatic) which optimizes image quality are computationally intensive, producing many intermediate results of interest but too voluminous to present here. One example of the type of intermediate results obtained when optimizing the pupil fraction metric PFWc (see Table 1 for a list of acronyms) is shown in Figure 6A. For each lens power over the range –1 to +1 D (in 0.125 D steps) a curve is generated relating RMS wavefront error to pupil radius. Each of these curves crosses the criterion level (l/4 in our calculations) at some radius value. That radius is interpreted as the critical radius since it is the largest radius for which the eye’s optical quality is reasonably good. The set of critical radius values can then be plotted as a function of defocus, as shown in Figure 6B. This through-focus function peaks at some value of defocus, which is taken as the optimum lens for this eye using this metric. In this way the full dataset of Figure 6 is reduced to a single number.
N
Acronym
Brief Description
1
RMSw
Standard deviation of wavefront
2
PV
Peak-valley
3
RMSs
RMSs: std(slope)
4
PFWc
Pupil fraction for wavefront (critical pupil)
5
PFWt
Pupil fraction for wavefront (tessellation)
6
PFSt
Pupil fraction for slope (tessellation)
7
PFSc
Pupil fraction for slope (critical pupil)
8
Bave
Average Blur Strength
9
PFCt
Pupil fraction for curvature (tessellation)
10
PFCc
Pupil fraction for curvature (critical pupil)
11
D50
50% width (min)
12
EW
Equivalent width (min)
13
SM
Sqrt(2nd moment) (min)
14
HWHH
Half width at half height (arcmin)
15
CW
Correlation width (min)
16
SRX
Strehl ratio in space domain
17
LIB
Light in the bucket (norm)
18
STD
Standard deviation of intensity (norm)
19
ENT
Entropy (bits)
20
NS
Neural sharpness (norm)
21
VSX
Visual Strehl in space domain
22
SFcMTF
Cutoff spat. freq. for rMTF (c/d)
23
AreaMTF
Area of visibility for rMTF (norm)
24
SFcOTF
Cutoff spat. freq. for rOTF (c/d)
25
AreaOTF
Area of visibility for rOTF (norm)
26
SROTF
Strehl ratio for OTF
27
VOTF
OTF vol/ MTF vol
28
VSOTF
Visual Strehl ratio for OTF
29
VNOTF
CS*OTF vol/ CS*MTF vol
30
SRMTF
Strehl ratio for MTF
31
VSMTF
Visual Strehl ratio for MTF
32
LSq
Least squares fit
33
Curve
Curvature fit
Table 1. Listing of acronyms for refraction methods. Ordering is that used in correlation matrices (Figures 8, A8).
fig06.gif
Figure 6. Rank ordering (based on accuracy) of 33 methods for predicting spherical refractive error. Red symbols indicate means for metrics based on wavefront quality. Black symbols indicate mean for metrics based on image quality. Error bars indicate ± 1 standard deviation of the population. Numerical data are given in Table 2.
Similar calculations were then repeated for other eyes in the population to yield 200 estimates of the refractive error using this particular metric. A frequency histogram of these 200 values similar to those in Figure 4 was produced for inspection by the experimenters. Such histograms were then summarized by a mean value, which we took to be a measure of accuracy, and a standard deviation, which (when doubled) was taken as a measure of precision.
The accuracy and precision of the 31 methods for objective refraction based on optimizing metrics of optical quality, plus the two methods based on wavefront fitting, are displayed in rank order in Figure 7. Mean accuracy varied from –0.50 D to +0.25 D. The 14 most accurate methods predicted M to within 1/8 D and 24 methods were accurate to within 1/4 D. The method of paraxial curvature matching was the most accurate method, closely followed by the through-focus method for maximizing the wavefront quality metrics PFWc and PFCt. Least-squares fitting was one of the least accurate methods (mean error = -0.39 D).
fig07.gif
Figure 7. An example of intermediate results for the through-focus calculations needed to optimize the pupil fraction metric PFWc. (A) The RMS value is computed as a function of pupil radius for a series of defocus values added to the wavefront aberration function of this eye. The pupil size at the intersection points of each curve with the criterion level of RMS are plotted as a function of lens power in (B). The optimum correcting lens for this eye is the added spherical power that maximized the critical pupil diameter (and therefore maximized PFWc) which in this example is +0.125 D.
Precision of estimates of M ranged from 0.5 to 1.0 D. A value of 0.5 D means that the error in predicting M for 95 percent of the eyes in our study fell inside the confidence range given by the mean ± 0.5 D. The most precise method was PFSc (±0.49D), which was statistically significantly better than the others (F-test for equality of variance, 5% significance level). Precision of the next 14 methods in rank ranged from ±0.58D to ±0.65D. These values were statistically indistinguishable from each other. This list of the 15 most precise methods included several examples from each of the three categories of wavefront quality, point-image quality, and grating-image quality. Rank ordering of all methods for predicting defocus is given in Table 2.

Accuracy
Precision
Rank
Metric
Mean
Metric
2xSTD
1
PFCc
0.2406
PFSc
0.4927
2
Curv
-0.006
AreaOTF
0.5803
3
PFWc
-0.0063
VSOTF
0.5806
4
PFCt
-0.0425
PFWc
0.5839
5
SFcMTF
-0.0425
LIB
0.5951
6
LIB
-0.0681
NS
0.5961
7
VSX
-0.0731
VSMTF
0.5987
8
SFcOTF
-0.0737
EW
0.6081
9
CW
-0.0912
SRX
0.6081
10
EW
-0.1006
AreaMTF
0.6112
11
SRX
-0.1006
PFCt
0.6213
12
VSMTF
-0.1131
STD
0.63
13
NS
-0.1144
SFcMTF
0.6343
14
VOTF
-0.125
VSX
0.6391
15
PFSc
-0.1281
D50
0.6498
16
VNOTF
-0.1575
CW
0.6558
17
AreaMTF
-0.165
PFWt
0.6575
18
STD
-0.1656
PFSt
0.6577
19
VSOTF
-0.1794
RMSw
0.6702
20
SROTF
-0.1875
SFcOTF
0.6786
21
HWHH
-0.200
SRMTF
0.6888
22
PFSt
-0.2162
SROTF
0.69
23
AreaOTF
-0.2269
ENT
0.6987
24
SRMTF
-0.2544
LSq
0.7062
25
D50
-0.2825
HWHH
0.7115
26
PFWt
-0.3231
RMSs
0.7159
27
ENT
-0.3638
Curv
0.7202
28
RMSw
-0.3831
SM
0.7315
29
LSq
-0.3906
VNOTF
0.7486
30
RMSs
-0.425
Bave
0.7653
31
SM
-0.4319
PV
0.7725
32
PV
-0.4494
VOTF
0.8403
33
Bave
-0.4694
PFCc
0.9527
Table 2. Rank ordering of methods for predicting spherical equivalent M based on accuracy and precision. Acronyms in red type are wavefront quality methods. Brief descriptions of acronyms are given in Table 1. Detailed descriptions are in Appendix. Units are diopters.
A similar process was used to determine the accuracy for estimating astigmatism. We found that all methods except one (PFCc) had a mean error across the population of less than 1/8 D. This accuracy is the best we could reasonably expect, given that the subjective refractions and the virtual refractions used to predict subjective refractions were both quantized at 1/8 D of cross-cylinder power. Precision of astigmatism predictions was typically better than precision for predicting defocus. The precision of all metrics for predicting astigmatism ranged from ±0.32D to ±1.0D and the 15 best methods were better than ±0.5D. Rank ordering of all methods for predicting astigmatism is given in Table 3.

Accuracy
Precision
Rank
Metric
Mean
Metric
2xSTD
1
HWHH
0.0155
LSq
0.3235
2
LIB
0.0164
PFSc
0.3315
3
PFCt
0.0192
Bave
0.3325
4
AreaMTF
0.0258
RMSs
0.3408
5
ENT
0.0273
RMSw
0.3429
6
NS
0.0281
Curv
0.3568
7
VSX
0.03
PFWc
0.3639
8
PFSt
0.0305
PV
0.4278
9
AreaOTF
0.0313
VSMTF
0.4387
10
EW
0.0343
AreaMTF
0.4423
11
SRX
0.0343
NS
0.4544
12
SRMTF
0.038
PFCt
0.4715
13
VSMTF
0.0407
STD
0.4752
14
STD
0.0422
PFWt
0.4923
15
CW
0.0576
SM
0.4967
16
RMSs
0.0589
SRMTF
0.5069
17
VSOTF
0.0594
EW
0.5181
18
PFSc
0.0608
SRX
0.5181
19
D50
0.0665
CW
0.5287
20
SM
0.0668
LIB
0.535
21
Bave
0.0685
AreaOTF
0.5444
22
SROTF
0.0724
SFcMTF
0.5659
23
PFWc
0.0745
VSX
0.5813
24
VOTF
0.0787
VSOTF
0.6796
25
LSq
0.0899
HWHH
0.6796
26
RMSw
0.0909
SROTF
0.7485
27
Curv
0.0913
PFSt
0.7555
28
PV
0.098
SFcOTF
0.7821
29
PFWt
0.1039
VNOTF
0.816
30
VNOTF
0.1059
D50
0.8416
31
SFcOTF
0.113
ENT
0.8751
32
SFcMTF
0.1218
VOTF
0.9461
33
PFCc
0.8045
PFCc
1.0005
Table 3. Rank ordering of methods for predicting astigmatism parameters J0 and J45 jointly. Acronyms in red type are wavefront quality methods. Brief descriptions of acronyms are given in Table 1. Detailed descriptions are in Appendix. Units are diopters.
In comparing the precision for predicting defocus and astigmatism we found that 7 metrics were in the top-15 list for both types of prediction. Five of these were also accurate to within 1/8 D for predicting both defocus and astigmatism. Thus 5 metrics (PFSc, PFWc, VSMTF, NS, and PFCt) emerged as reasonably accurate and among the most precise. Three of these successful metrics were pupil plane metrics and two were image plane metrics. These results demonstrate that accurate predictions of subjective refractions are possible with pupil plane metrics. However, such metrics do not include the process of image formation that occurs in the eye, a process that must influence subjective image quality. For this reason, image-plane metrics of visual quality are more germane to vision models of the refraction process that seek to capture the subjective notion of a well-focused retinal image (Williams, Applegate, & Thibos, 2004).
Correlation between multiple objective refractions for the same eye
One implication of the results presented above is that different methods of objective refraction that yield similar refractions on average are likely to be statistically correlated. We tested this prediction by computing the correlation coefficient between all possible pairs of methods for predicting M. The resulting correlation matrix is visualized in Figure 8. For example, the left-most column of tiles in the matrix represents the Pearson correlation coefficient r between the first objective refraction method in the list (RMSw) and all other methods in the order specified in Table 1. Notice that the values of M predicted by optimizing RMSw are highly correlated with the values returned by methods 3 (RMSs), 8 (Bave), 19 (ENT), and 32 (least-squares fit). As predicted, all of these metrics are grouped at the bottom of the ranking in Figure 7. To the contrary, refractions using RMSw are poorly correlated with values returned by methods 4 (PFWc), 9 (PFCt), 21 (VSX), 24 (SFcOTF), and 33 (Curvature fit). All of these metrics are grouped at the top of the ranking in Figure 7, which further supports this connection between accuracy and correlation. A similar analysis of the correlation matrix for astigmatism parameters is not as informative because there was very little difference between the various methods for predicting J0 and J45.
fig08.gif
Figure 8. Correlation matrix for values of M determined by objective refraction. Metric number is given in Table 1.
Another interesting feature of Figure 8 is that some refraction methods (e.g. PFCc, VOTF, VNOTF) are very poorly correlated with all other methods. This result for metric PFCc is explained by the fact that PFCc was the only metric to produce hyperopic refractions in the vicinity of M=+0.25D. However, this argument does not apply to the other two examples that are poorly correlated with most other metrics even though these other metrics produced similar refractions on average (e.g. 20 (NS), 7 (PFSc), and 23 (AreaMTF)). This result suggests that maximizing metrics VOTF and VNOTF optimizes a unique aspect of optical and visual quality that is missed by other metrics. In fact, these two metrics were specifically designed to capture infidelity of spatial phase in the retinal image.
Discussion
The least-squares method for fitting an aberrated wavefront with a spherical wavefront is the basis of Zernike expansion to determine the defocus coefficient. The failure of this method to accurately predict the results of subjective refraction implies that the Zernike coefficient for defocus is an inaccurate indicator of the spherical equivalent of refractive error determined by conventional subjective refractions. On average, this metric predicted that eyes in our study were myopic by -0.39D when in fact they were well corrected.
To the contrary, matching paraxial curvature accurately predicted the results of subjective refraction. This method is closely related to the Seidel expansion of wavefronts because it isolates the purely parabolic (r2) term. It also corresponds to a paraxial analysis since the r2 coefficient is zero when the paraxial rays are well focused. Although this method was one of the least accurate methods for predicting astigmatism, it nevertheless was accurate to within 1/8D. The curvature method was one of the most precise methods for predicting astigmatism but was significantly less precise than some other methods for predicting defocus. For this reason it was eliminated from the list of 5 most precise and accurate methods.
Figure 7 may be interpreted as a table of correction factors that could potentially make all of the predictions of defocus equally accurate. While this might seem a reasonable approach to improving accuracy, it may prove cumbersome in practice if future research should show that the correction factors vary with pupil diameter, age, or other conditions.
We do not know why the various metrics have different amounts of systematic bias, but at least two possibilities have already been mentioned. First, to undertake the data analysis we needed to make an assumption about which wavelength of light was well focused on the retina during subjective refraction with a polychromatic stimulus. We chose 570 nm as our reference wavelength based on theoretical and experimental evidence (Charman & Tucker, 1978; Thibos & Bradley, 1999) but the actual value is unknown. Changing this reference wavelength by just 20 nm to 550 nm would cause a 0.1 D shift in defocus, which is a significant fraction of the differences in accuracy between the various metrics.
A second source of bias may be attributed to the difference between optimal and conventional refraction methods. The objective refraction procedures described in this paper are designed to determine the optimum refraction (Figure 1a) whereas the subjective refractions were conventional (Figure 1b). The difference between the two endpoints is half the depth-of-focus (DOF) of the eye. The DOF for subjects in the Indiana Aberration Study is unknown, but we would anticipate a value of perhaps ±0.25D (Atchison, Charman, & Woods, 1997) which is about half the total range of focus values spanned in Figure 7. Accordingly, we may account for the results in Figure 7 by supposing that the curvature matching technique happens to locate the far end of the DOF interval (which is located at optical infinity in a conventional refraction) whereas some middle-ranking metric (such as VSOTF) locates the middle of the DOF, located at the hyperfocal distance. This inference is consistent with the fact that most eyes in the Indiana Aberration Study had positive spherical aberration. Such eyes have less optical power for paraxial rays than for marginal rays. Consequently, the retina will appear to be conjugate to a point that is beyond the hyperfocal point if the analysis is confined to the paraxial rays.
The preceding arguments suggest that the superior accuracy of the curvature method for determining the spherical equivalent of a conventional refraction is due to a bias in this method that favors the far end of the eye’s DOF. In short, curvature matching (and several other metrics with similar accuracy) is a biased method that successfully predicts a biased endpoint. By the same argument, the biased curvature method is not expected to predict astigmatism accurately because conventional refractions are unbiased for astigmatism. Although this line of reasoning explains why the paraxial curvature method will locate a point beyond the hyperfocal point, we lack a convincing argument for why the located point should lie specifically at infinity. Perhaps future experiments that include measurement of the DOF as well as the hyperfocal distance will clarify this issue and at the same time help identify objective methods for determining the hyperfocal distance.
Pursuing the above line of reasoning suggests that some metric near the bottom of the accuracy ranking, such as RMSw, locates the near end of the DOF. This accounting is consistent with the findings of Guirao and Williams (Guirao & Williams, 2003) and of Cheng et al. (Cheng, Bradley, & Thibos, 2004) that the optimum focus lies somewhere between the more distant paraxial focus and the nearer RMS focus. Taken together, the least-squares and curvature fitting methods would appear to locate the two ends of the DOF interval. While perhaps a mere coincidence, if this intriguing result could be substantiated theoretically then it might become a useful method to compute the DOF from the wavefront aberration map for individual eyes.
A variety of other factors may also contribute to the range of inaccuracies documented in Figure 7. For example, all of the image quality metrics reported in this paper are based on monochromatic light. Generalizing these metrics to polychromati