Georeferencing terrestrial photography

To make effective use of the information contained in a photograph it is necessary to know what every pixel in the flat image represents in the real world. In this way spatial relationships can be found and the visual data transfered to a map. The photograph needs to be georeferenced.

Visibility

The first step is to calculate the visible portion of the digital elevation model from the point of view of the camera. For this computation we need to know the exact position of the observer (camera), and the direction and field of view. The first datum is read from a GPS and altimeter at the time of taking the photograph. The second information is derived from the actual photograph itself, by identifying precisely the location of the target or central pixel of the image. It could also be derived from accurate information of the azimuth and elevation of the camera direction of sight by using a phototheodolite, for example. Finally, the field of view is derived from the focal length and the dimensions of the film.

World to camera coordinate system transformation

Once visibility has been calculated we apply a viewing transformation that maps points in the world coordinate system (that of the DEM) to points in the camera coordinate system, representing the viewing geometry of the camera in three-dimensional space, as shown in figure 1. A viewing transformation is parameterised by a viewing direction vector N, a vector V indicating which way is up, a vector U positive in the direction of the X-axis and a viewing position C ( e.g. Fiume, 1989). This is a standard procedure for obtaining perspective views in computer graphics (e.g. Foley et al., 1990). Firstly we apply a translation transformation to set the origin at C, the camera position.

**Figure 1:** Change from world to camera coordinate system. DEM coordinates are referenced to OXYZ reference system, while the image taken by camera at point C is referenced to CUNV.

The viewing transformation is then completed by multiplying the result of this translation by the following transformation matrix, that rotates the translated coordinates according to the viewing coordinate axis:

where f is the camera focal length.

Thus, the DEM:

Plus the photograph:

Camera Box at the Crête du Plan. Photo: Uli Strasser

Through the georeferencing process:

allows mapping the DEM onto the photograph:

Green dots are ground control points and red dots the superimposed scaled perspective projection of grid cells in the original DEM.

and then renders a georeferenced map of reflectance values:

animation

for the code, please contact Javier G. Corripio

References

Corripio, J. G.: 2004, Snow surface albedo estimation using terrestrial photography, International Journal of Remote Sensing 25(24), 5705-5729. Preprint.
Fiume, E. L.: 1989, The mathematical structure of raster graphics, Academic Press, Boston.
Foley,J.D., van Dam, A., Feiner, S.K. and Hughes, J.F.: 1990, Computer graphics, principles and practice, Adison-Wesley, Reading, Massachusetts.
Slama, C. C., Theurer, C. and Henriksen, S. W. (eds): 1980, Manual of Photogrammetry, American Society of Photogrammetry, Falls Church, Va. xv, 1056 p.
Watt, A. H. and Watt, M.: 1992, Advanced animation and rendering techniques: theory and practice, ACM Press: Addison-Wesley, New York.

Javier G Corripio

!!! Dieses Dokument stammt aus dem ETH Web-Archiv und wird nicht mehr gepflegt !!!
!!! This document is stored in the ETH Web archive and is no longer maintained !!!