Frame camera models =================== Ames Stereo Pipeline supports a generic Pinhole camera model with several lens distortion models which cover common calibration methods, and also the somewhat more complicated panoramic (*optical bar*) camera model. .. _pinholemodels: Pinhole models -------------- Overview -------- The generic Pinhole model uses the following parameters: - *fu* = The focal length in horizontal pixel units. - *fv* = The focal length in vertical pixel units. - *cu* = The horizontal offset of the principal point of the camera in the image plane in pixel units, from 0,0. - *cv* = The vertical offset of the principal point of the camera in the image plane in pixel units, from 0,0. - *pitch* = The size of each pixel in the units used to specify the four parameters listed above. This will usually either be 1.0 if they are specified in pixel units or alternately the size of a pixel in millimeters or meters. The focal length is sometimes known as the *principal distance*. The value :math:`cu` is usually approximately half the image width in pixels times the pitch, while :math:`cv` is often the image height in pixels times the pitch, though there are situations when these can be quite different. A few sample Pinhole models are shown later in the text. The underlying mathematical model is described in :numref:`pinholemath`. Along with the basic Pinhole camera parameters, a lens distortion model can be added. Note that the units used in the distortion model must match the units used for the parameters listed above. For example, if the camera calibration was performed using units of millimeters the focal lengths etc. must be given in units of millimeters and the pitch must be equal to the size of each pixel in millimeters. Alternatively, units of meter can be used, and the choice of unit must be documented by the creators of the models. The following lens distortion models are currently supported. (The formulas below may miss some small details; the implementation in ``LensDistortion.cc`` in VisionWorkbench should be the final reference.) Note that the values below change drastically depending on whether the model creator chooses pixel units, or if measuring in millimeters or meters. In either case, all lengths must be consistent and the units documented. * **Null** = A placeholder model that applies no distortion. * **Tsai** = A common distortion model :cite:`tsai1987` similar to the one used by OpenCV and Theia. This model uses the following parameters: *K1, K2* = Radial distortion parameters. *P1, P2* = Tangential distortion parameters. The following equations describe the distortion, starting with the undistorted pixel :math:`(Px, Py)`. .. math:: (x, y) = \left(\frac{Px - cu}{fu}, \frac{Py-cv}{fv}\right) r^{2} = x^{2} + y^{2} x_{dist} = Px + x\left(K_{1}r^{2} + K_{2}r^{4} + 2P_{1}y + P_{2}\left(\frac{r^{2}}{x} + 2x\right)\right) y_{dist} = Py + y\left(K_{1}r^{2} + K_{2}r^{4} + 2P_{2}x + P_{1}\left(\frac{r^{2}}{y} + 2y\right)\right) * **Adjustable Tsai** = A variant of the Tsai model where any number of *K* terms and a skew term (alpha) can be used. Can apply the AgiSoft Lens calibration parameters. * **Brown-Conrady** = An older model based on a centering angle :cite:`brown1966,brown1971`. This model uses the following parameters: *K1, K2, K3* = Radial distortion parameters. *P1, P2* = Tangential distortion parameters. *xp, yp* = Principal point offset. *phi* = Tangential distortion angle in radians. The following equations describe the distortion, note that this model uses non-normalized pixel units, so they can be in millimeters or meters: .. math:: x = x_{dist} - xp y = y_{dist} - yp r^{2} = x^{2} + y^{2} dr = K_{1}r^{3} + K_{2}r^{5} + K_{3}r^{7} x_{undist} = x + x\frac{dr}{r} - (P_{1}r^{2} +P_{2}r^{4})\sin(phi) y_{undist} = y + y\frac{dr}{r} + (P_{1}r^{2} +P_{2}r^{4})\cos(phi) * **Photometrix** = A model matching the conventions used by the Australis software from Photometrix. *K1, K2, K3* = Radial distortion parameters. *P1, P2* = Tangential distortion parameters. *xp, yp* = Principal point offset. *B1, B2* = Unused parameters. The following equations describe the distortion, note that this model uses non-normalized pixel units, so they are in mm. .. math:: x = x_{dist} - xp y = y_{dist} - yp r^{2} = x^{2} + y^{2} dr = K_{1}r^{3} + K_{2}r^{5} + K_{3}r^{7} x_{undist} = x + x\frac{dr}{r} + P_{1}(r^{2} +2x^{2}) + 2P_{2}xy y_{undist} = y + y\frac{dr}{r} + P_{2}(r^{2} +2y^{2}) + 2P_{1}xy * **RPC** = A rational polynomial coefficient model. In this model, one goes from distorted coordinates :math:`(x, y)` to undistorted coordinates via the formula .. math:: x_{undist} = \frac{P_1(x, y)}{Q_1(x, y)} y_{undist} = \frac{P_2(x, y)}{Q_2(x, y)} The functions in the numerator and denominator are polynomials in :math:`x` and :math:`y` with certain coefficients. The degree of polynomials can be any positive integer. RPC distortion models can be generated as approximations to other pre-existing models with the tool ``convert_pinhole_model`` (:numref:`convert_pinhole_model`). This tool also creates RPC to speed up the reverse operation, of going from undistorted to distorted pixels, and those polynomial coefficients are also saved as part of the model. -------------- .. _file_format: File formats ~~~~~~~~~~~~ ASP Pinhole model files are written in an easy to work with plain text format using the extension ``.tsai``. A sample file is shown below. :: VERSION_4 PINHOLE fu = 28.429 fv = 28.429 cu = 17.9712 cv = 11.9808 u_direction = 1 0 0 v_direction = 0 1 0 w_direction = 0 0 1 C = 266.943 -105.583 -2.14189 R = 0.0825447 0.996303 -0.0238243 -0.996008 0.0832884 0.0321213 0.0339869 0.0210777 0.9992 pitch = 0.0064 Photometrix xp = 0.004 yp = -0.191 k1 = 1.31024e-04 k2 = -2.05354e-07 k3 = -5.28558e-011 p1 = 7.2359e-006 p2 = 2.2656e-006 b1 = 0.0 b2 = 0.0 The first half of the file is the same for all Pinhole models: * ``VERSION_X`` = A header line used to track the format of the file. * ``PINHOLE`` = The type of camera model, so that other types can be stored with the .tsai extension. * ``fu, fv, cu, cv`` = The first four intrinsic parameters described in the previous section. * ``u, v, and w_direction`` = These lines allow an additional permutation of the axes of the camera coordinates. By default, the positive column direction aligns with x, the positive row direction aligns with y, and downward into the image aligns with z. * ``C`` = The location of the camera center, usually in the geocentric coordinate system (GCC/ECEF). * ``R`` = The rotation matrix describing the camera's absolute pose in the coordinate system (:numref:`pinholemath`). * ``pitch`` = The pitch intrinsic parameter described in the previous section. The second half of the file describes the lens distortion model being used. The name of the distortion model appears first, followed by a list of the parameters for that model. The number of parameters may be different for each distortion type. Samples of each format are shown below: * **Null** :: NULL * **Tsai** :: TSAI k1 = 1.31024e-04 k2 = -2.05354e-07 p1 = 0.5 p2 = 0.4 * **Adjustable Tsai** :: AdjustableTSAI Radial Coeff: Vector3(1.31024e-04, 1.31024e-07, 1.31024e-08) Tangential Coeff: Vector2(-2.05354e-07, 1.05354e-07) Alpha: 0.4 * **Brown-Conrady** :: BrownConrady xp = 0.5 yp = 0.4 k1 = 1.31024e-04 k2 = -2.05354e-07 k3 = 1.31024e-08 p1 = 0.5 p2 = 0.4 phi = 0.001 * **Photometrix** :: Photometrix xp = 0.004 yp = -0.191 k1 = 1.31024e-04 k2 = -2.05354e-07 k3 = -5.28558e-011 p1 = 7.2359e-006 p2 = 2.2656e-006 b1 = 0.0 b2 = 0.0 * **RPC** :: RPC rpc_degree = 1 image_size = 5760 3840 distortion_num_x = 0 1 0 distortion_den_x = 1 0 0 distortion_num_y = 0 0 1 distortion_den_y = 1 0 0 undistortion_num_x = 0 1 0 undistortion_den_x = 1 0 0 undistortion_num_y = 0 0 1 undistortion_den_y = 1 0 0 This sample RPC lens distortion model represents the case of no distortion, when the degree of the polynomials is 1, and both the distortion and undistortion formula leave the pixels unchanged, that is, the distortion transform is .. math:: (x, y) \to (x, y) = \left(\frac{ 0 + 1\cdot x + 0\cdot y}{1 + 0\cdot x + 0\cdot y}, \frac{0 + 0\cdot x + 1\cdot y)}{1 + 0\cdot x + 0\cdot y}\right). In general, if the degree of the polynomials is :math:`n`, there are :math:`2(n+1)(n+2)` coefficients. The zero-th degree coefficients in the denominator are always set to 1. For several years Ames Stereo Pipeline generated Pinhole files in the binary ``.pinhole`` format. That format is no longer supported. Also in the past Ames Stereo Pipeline has generated a shorter version of the current file format, also with the extension ``.tsai``, which only supported the TSAI lens distortion model. Existing files in that format can still be used by ASP. Note that the ``orbitviz`` tool can be useful for checking the formatting of ``.tsai`` files you create and to estimate the position and orientation. To inspect the orientation use the optional ``.dae`` model file input option and observe the rotation of the 3D model. .. _pinholemath: How the pinhole model is applied ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ As mentioned in :numref:`file_format`, the ASP Pinhole models store the focal length as :math:`fu` and :math:`fv`, the optical center :math:`(cu, cv)` (which is the pixel location at which the ray coming from the center of the camera is perpendicular to the image plane, in units of the pixel pitch), the vector :math:`C` which is the camera center in world coordinates system, and the matrix :math:`R` that is the transform from camera to world coordinates. To go in more detail, a point :math:`Q` in the camera coordinate system gets transformed to a point :math:`P` in the world coordinate system via: .. math:: P = RQ + C Hence, to go from world to camera coordinates one does: .. math:: Q = R^{-1} P - R^{-1} C From here the ``undistorted`` pixel location is computed as: .. math:: \frac{1}{p} \left(fu \frac{Q_1}{Q_3} + cu, fv \frac{Q_2}{Q_3} + cv\right) where :math:`p` is the pixel pitch. Next, a distortion model may be applied, as discussed earlier. .. _panoramic: Panoramic Camera Model ---------------------- ASP also supports a simple panoramic/optical bar camera model for use with images such as the declassified Corona KH4 and Keyhole KH9 images. It implements the model from :cite:`schenk2003rigorous` with the motion compensation from :cite:`sohn2004mathematical`. Such a model looks as follows: :: VERSION_4 OPTICAL_BAR image_size = 110507 7904 image_center = 55253.5 3952 pitch = 7.0e-06 f = 0.61000001430511475 scan_time = 0.5 forward_tilt = -0.261799 iC = -1047140.9611702315 5508464.4323527571 3340425.4078937685 iR = -0.96635634448923746 -0.16918164442572045 0.1937343197650008 -0.23427205529446918 0.26804084264169648 -0.93448954557235941 0.10616976770014927 -0.94843643849513648 -0.29865750042675621 speed = 7700 mean_earth_radius = 6371000 mean_surface_elevation = 4000 motion_compensation_factor = 1.0 scan_dir = left Here, the image size and center are given in pixels, with the width followed by the height. The pixel pitch and focal length ``f`` are in meters. The scan time is seconds, the forward tilt is in radians, the speed is in meters per second, and the Earth radius and mean surface elevation are in meters. The initial camera center ``iC`` is in meters, and the rotation matrix ``iR`` stores the absolute pose. ``scan_dir`` must be set to ``left`` or ``right``. The values ``scan_dir`` and ``use_motion_compensation`` control how the sensor model accounts accounts for the motion of the satellite during the image scan. Without the benefit of detailed historical documents it may require experimentation to find the good initial values for these cameras. When using ``bundle_adjust``, the intrinsic parameters that are solved for are ``speed``, ``motion_compensation_factor``, and ``scan_time``.