Build Your Own Stereoscope
Stereoscope, Stereoscopy & Stereopsis
Pocket stereoscope with original test image. Used by military to examine stereoscopic pairs of vertical aerial photographs.
Stereo card image modified for crossed eye viewing.
View of Manhattan, c. 1909
Stereoscopy, stereoscopic imaging or 3-D (three-dimensional) imaging is any technique capable of recording three-dimensional visual information or creating the illusion of depth in an image.
Stereopsis is the process in visual perception leading to perception of stereoscopic depth.
The illusion of depth in a photograph, movie, or other two-dimensional image is created by presenting a slightly different image to each eye. Many 3D displays use this method to convey images. It was first invented by Sir Charles Wheatstone in 1840.[1] Stereoscopy is used in photogrammetry and also for entertainment through the production of stereograms. Stereoscopy is useful in viewing images rendered from large multi-dimensional data sets such as are produced by experimental data. Modern industrial three dimensional photography may use 3D scanners
to detect and record 3 dimensional information. The three-dimensional
depth information can be reconstructed from two images using a computer
by corresponding the pixels in the left and right images. Solving the Correspondence problem in the field of Computer Vision aims to create meaningful depth information from two images.
Traditional stereoscopic photography consists of creating a 3-D
illusion starting from a pair of 2-D images. The easiest way to create depth perception
in the brain is to provide the eyes of the viewer with two different
images, representing two perspectives of the same object, with a minor
deviation similar to the perspectives that both eyes naturally receive
in binocular vision.
If eyestrain and distortion are to be avoided, each of the two 2-D
images preferably should be presented to each eye of the viewer so that
any object at infinite distance seen by the viewer should be perceived
by that eye while it is oriented straight ahead, the viewer's eyes
being neither crossed nor diverging. When the picture contains no
object at infinite distance, such as a horizon or a cloud, the pictures
should be spaced correspondingly closer together.
Side-by-side
Characteristics
A stereo card intended to be viewed in a stereoscope
Little or no additional image processing is required. Under some
circumstances, such as when a pair of images is presented for crossed
or diverged eye viewing, no device or additional optical equipment is
needed.
The principal advantages of side-by-side viewers is that there is no
diminution of brightness so images may be presented at very high
resolution and in full spectrum color. The ghosting associated with
polarized projection or when color filtering is used is totally
eliminated. The images are discretely presented to the eyes and visual
center of the brain, with no co-mingling of the views. The recent
advent of wider HD and computer flat screens has made wider 3D digital
images practical in this side by side mode, which hitherto has been
used mainly with paired photos or in print form.
Stereographic cards and the stereoscope
Two separate images are printed side-by-side. When viewed without a
stereoscopic viewer the user is required to force his eyes either to
cross, or to diverge, so that the two images appear to be three. Then
as each eye sees a different image, the effect of depth is achieved in
the central image of the three.
The stereoscope offers several advantages:
- Using positive curvature (magnifying) lenses, the focus point of
the image is changed from its short distance (about 30 to 40 cm) to a
virtual distance at infinity. This allows the focus of the eyes to be
consistent with the parallel lines of sight, greatly reducing eye
strain.
- The card image is magnified, offering a wider field of view and the ability to examine the detail of the photograph.
- The viewer provides a partition between the images, avoiding a potential distraction to the user.
Stereograms cards are frequently used by orthoptists and vision therapists in the treatment of many binocular vision and accommodative disorders.
Cross converged viewing, with new "masking" glasses
By exchanging the right and left views, and the opposite eye
converged to the shifted images, it is possible to obtain a true color
3-D effect with some effort, without glasses or a viewer. Very
recently, low cost glasses became available to aid the viewer in cross
convergence viewing. An unusual effect of these optical glasses is to
substantially widen the field of view to include a part of the
peripheral area not visible to both eyes. There is a wrap-around effect
produced, without the ghosting and the depth perspective can be greatly
deepened. These new glasses provide acrylic lenses, plastic frames,
that integrate a set of adjustable masking elements, that channel the
view to only one image per eye. This allows full use of the screen
width in the way that live vision treats stereo. The cross view image
can easily fill any screen including 16 by 9 HDTV screens, working like
a mirror to allow a dual perspective & wider view, as the images
are mentally processed. Several non-commercial demonstration sites,
such as bymal.com[2]
are showing extended 3D, sound, perfect color slide-shows, of subjects
such as the Renaissance Pleasure Faire. They can be viewed with the
glasses or without by masking with two hands turned upward, one per
side. The technique is easily applied to full motion video as well.
There are examples of the ultra wide aspect cross glasses images, as
well as narrower "portrait" format, which fills a 4Χ3 computer screen
efficiently.
Narrow paired images for parallel view, or with masking 3D glasses
Cross viewing without glasses
To view the crossed-eye view shown here, the viewer should move
slightly back from his or her normal viewing distance and place his
viewpoint on a line perpendicular to the center of the image. A finger
should be placed halfway between the eyes and the image, then the
finger should be viewed. The three bright spots between the pictures
should become four spots, and the two images become three. If the focus
of the eyes is now allowed to drift to the surface of the screen
without uncrossing the eyes, a three dimensional depth illusion will
appear in the central image. The finger may now be removed from the
view. A viewer may find that the extra side images become unimportant
once in-depth view of the central image is stable. This is a popular
way of presenting images on computers but it is difficult to learn and
for many viewers the method produces substantial eye-strain, and is not
comfortable enough for extended viewing. Another disadvantage is that
after prolonged viewing, the eyes may become accustomed to
"close-convergence", as it requires the ability to direct the eyes as
if viewing an object about eight inches away. This very close angle may
lead to momentary double-vision. It also offers few of the advantages
enumerated above that are provided by the stereoscope or Pokescope.
When images are presented as for the stereoscope, with the image to be
viewed by the left eye on the left, they can be viewed by diverging the
eyes. This gives a different kind of "naked eye stress" than crossing
the eyes (known as "wall-eyed divergence") but may require a smaller
adjustment of focus, but can be even harder to learn. Without the use
of viewing equipment, the size of a stereoscopic image viewable is
significantly limited by one's eye-spacing and the inability of one's
eyes to diverge painlessly. The major advantage of cross-eye viewing is
that the images can be more than twice the area, and no glasses are
needed by those who have the viewing knack. Prismatic cross glasses,
with built-in masking, make the convergence very easy for most people,
but they tend to be expensive, something like 5 times the cost of the
simpler arcylic masking glasses.
Vintage Stereoscopic Picture (for parallel viewing, quite hard to see with no equipment)
Vintage Stereoscopic Picture (for cross viewing, requires no special equipment)
A printable cut and fold cross viewer can be used. 
Transparency viewers
Stereoscope and case during WWII this tool was used by Allied photo
interpreters to analyze images shot from aerial photo reconnaissance
platforms.
In the 1940s, a modified and miniaturized variation of this technology was introduced as the View-Master.
Pairs of stereo views are printed on translucent film which is then
mounted around the edge of a cardboard disk, images of each pair being
diametrically opposite. A lever is used to move the disk so as to
present the next image pair. A series of seven views can thus be seen
on each card when it was inserted into the View-Master viewer. These
viewers were available in many forms both non-lighted and self-lighted
and may still be found today. One type of material presented is
children's fairy tale story scenes or brief stories using popular cartoon
characters. These use photographs of three dimensional model sets and
characters. Another type of material is a series of scenic views
associated with some tourist destination, typically sold at gift shops
located at the attraction.
Another important development in the late 1940s was the introduction of the Stereo Realist
camera and viewer system. Using color slide film, this equipment made
stereo photography available to the masses and caused a surge in its
popularity. The Stereo Realist and competing products can still be
found (in estate sales and elsewhere) and utilized today.
Low-cost folding cardboard viewers with plastic lenses have been
used to view images from a sliding card and have been used by computer
technical groups as part of annual convention proceedings. These have
been supplanted by the DVD recording and display on a television
set. By exhibiting moving images of rotating objects a three
dimensional effect is obtained through other than stereoscopic means.
An advantage offered by transparency viewing is that a wider field
of view may be presented since images, being illuminated from the rear,
may be placed much closer to the lenses. Note that with simple viewers
the images are limited in size as they must be adjacent and so the
field of view is determined by the distance between each lens and its
corresponding image.
Good quality wide angle lenses are quite expensive and they are not found in most stereo viewers.
Head-mounted displays
-
an HMD with a separate video source displayed in front of each eye to achieve a stereoscopic effect
The user typically wears a helmet or glasses with two small LCD or OLED
displays with magnifying lenses, one for each eye. The technology can
be used to show stereo films, images or games, but it can also be used
to create a virtual display. Head-mounted displays may also be
coupled with head-tracking devices, allowing the user to "look around"
the virtual world by moving their head, eliminating the need for a
separate controller. Performing this update quickly enough to avoid
inducing nausea in the user requires a great amount of computer image
processing. If six axis position sensing (direction and position) is
used then wearer may move about within the limitations of the equipment
used. Owing to rapid advancements in computer graphics and the
continuing miniaturization of video and other equipment these devices
are beginning to become available at more reasonable cost.
Head-mounted or wearable glasses may be used to view a see-through
image imposed upon the real world view, creating what is called augmented reality.
This is done by reflecting the video images through partially
reflective mirrors. The real world view is seen through the mirrors'
reflective surface. Experimental systems have been used for gaming,
where virtual opponents may peek from real windows as a player moves
about. This type of system is expected to have wide application in the
maintenance of complex systems, as it can give a technician what is
effectively "x-ray vision" by combining computer graphics rendering of
hidden elements with the technician's natural vision. Additionally,
technical data and schematic diagrams may be delivered to this same
equipment, eliminating the need to obtain and carry bulky paper
documents.
Augmented stereoscopic vision is also expected to have applications
in surgery, as it allows the combination of radiographic data (CAT scans and MRI imaging) with the surgeon's vision.
3D glasses
Liquid Crystal shutter glasses
-
Glasses containing liquid crystal that will let light through in synchronization with the images on the computer display, using the concept of alternate-frame sequencing. See also Time-division multiplexing.
Linearly polarized glasses
-
To present a stereoscopic motion picture, two images are projected superimposed onto the same screen through orthogonal polarizing
filters. It is best to use a silver screen so that polarization is
preserved. The projectors can receive their outputs from a computer
with a dual-head graphics card. The viewer wears low-cost eyeglasses
which also contain a pair of orthogonal polarizing filters. As each
filter only passes light which is similarly polarized and blocks the
orthogonally polarized light, each eye only sees one of the images, and
the effect is achieved. Linearly polarized glasses require the viewer
to keep his head level, as tilting of the viewing filters will cause
the images of the left and right channels to bleed over to the opposite
channel on the other hand, viewers learn very quickly not to tilt
their heads. In addition, since no head tracking is involved, several
people can view the stereocopic images at the same time.
There are several commercial systems offering products like the
above, and one can also put one together by oneself using instructions
on the GeoWall Consortium site they refer to such a system as a GeoWall.
Circularly polarized glasses
-
To present a stereoscopic motion picture, two images are projected superimposed onto the same screen through circular polarizing filters of opposite handedness.
The viewer wears low-cost eyeglasses which contain a pair of analyzing
filters (circular polarizers mounted in reverse) of opposite
handedness. Light that is left-circularly polarized is extinguished by
the right-handed analyzer; while right-circularly polarized light is
extinguished by the left-handed analyzer. The result is similar to that
of steroscopic viewing using linearly polarized glasses; except the
viewer can tilt his head and still maintain left/right separation.
Real D Cinema
System (used recently with the sterescopic Disney movie, "Chicken
Little 3D") uses electronically driven circular polarizers that
alternate between left- and right- handedness, and does so in sync with
the left or right image being displayed by the (digital) movie
projector.
Two-color anaglyph
Full color Anachrome red (left eye)
and cyan (right eye) filters
-
Main article: Anaglyph image
Anaglyph images have seen a recent resurgence due to the
presentation of images on the internet. Where traditionally, this has
been a largely black & white format, recent digital camera and
processing advances have brought very acceptable color images to the
internet and DVD field. With the online availability of low cost paper
glasses with improved red-cyan filters, and even better plastic framed
glasses, the field is growing fast. Scientific images, where depth
perception is useful, include the presentation of complex
multi-dimensional data sets and stereographic images from (for example)
the surface of Mars,
but due to recent release of 3D DVDs, they are increasingly used for
entertainment. Anaglyph images are much easier to view than either
parallel sighting or crossed eye stereograms, although the latter types
offer bright and accurate color rendering, which is not quite
obtainable with even good color anaglyphs.
Compensating anaglyph glasses
Simple sheet or uncorrected molded glasses do not compensate for the
250 nanometer difference in the wave lengths of the red-cyan filters.
With simple glasses, the red filter image can be blurry when viewing a
close computer screen or printed image since the retinal focus differs
from the cyan filtered image, which dominates the eyes' focusing.
Better quality molded plastic glasses employ a compensating
differential diopter power to equalize the red filter focus shift
relative to the cyan. The direct view focus on computer monitors has
been recently improved by manufacturers providing secondary paired
lenses fitted and attached inside the red-cyan primary filters of some
high end anaglyph glasses. They are used where very high resolution is
required, including science, stereo macros, and animation studio
applications. They also use carefully balanced cyan (blue-green)
acrylic lenses, which pass a minute percentage of red to improve skin
tone perception. Simple red/blue glasses work well with black and
white, but are very unsuitable for human skin in color.
ColorCode 3-D
ColorCode 3-D is a new patented 3-D Stereo system. It is the only in
the world to reproduce 3-dimensional images in a simple way with full
color- and depth information on all display media. ColorCode 3-D is
sometimes confused with anaglyph because of the colored filters in the
ColorCodeViewer, but both the filters and the encoding process are
entirely different from the more than 150 years old anaglyph system.
Chromadepth method and glasses
The Chromadepth
procedure of American Paper Optics is based on the fact that with a
prism colors are separated by varying degrees. The ChromaDepth
eyeglasses contain special view foils, which consist of microscopically
small prisms. This causes the image to be translated a certain amount
that depends on its color. If one uses a prism foil now with one eye
but not on the other eye, then the two seen pictures depending upon
color are more or less widely separated. The brain produces the
spatial impression from this difference. The advantage of this
technology consists above all of the fact that one can regard
ChromaDepth pictures also without eyeglasses (thus two-dimensional)
problem-free (unlike with two-color anaglyph). However the colors are
only limitedly selectable, since they contain the depth information of
the picture. If one changes the color of an object, then its observed
distance will also be changed.
Anachrome "compatible" color anaglyph method
Anachrome optical diopter glasses.
A recent variation on the anaglyph technique is called "Anachrome method".[3]
This approach is an attempt to provide images that look fairly normal
without glasses as 2D images to be "compatible" for posting in
conventional websites or magazines. The 3D effect is generally more
subtle, as the images are shot with a narrower stereo base, (the
distance between the camera lenses). Pains are taken to adjust for a
better overlay fit of the two images, which are layered one on top of
another. Only a few pixels of non-registration give the depth cues. The
range of color is perhaps three times wider in Anachrome due to the
deliberate passage of a small amount of the red information through the
cyan filter. Warmer tones can be boosted, and this provides warmer skin
tones and vividness.
As of April 2007, more than 4,500 educational, or scientific images
were offered on-line in this and similar "compatible" formats. More
than 40 public photo groups on www.flickr.com, the free photo archive,
accept or feature "compatible" or more conventional anaglyph photos.
|
|
3-D view of an engine using the same technology.
|
Other display methods
Autostereograms
A random dot autostereogram encodes a 3D scene which can be "seen" with proper viewing technique
-
Main article: autostereogram
More recently, random-dot autostereograms have been created using
computers to hide the different images in a field of apparently random
noise, so that until viewed by diverging the eyes, the subject of the
image remains a mystery. A popular example of this is the Magic Eye series, a collection of stereograms based on distorted colorful and interesting patterns instead of random noise.
In the classic Pulfrich effect
paradigm a subject views, binocularly, a pendulum swinging
perpendicular to his line of sight. When a neutral density filter
(e.g., a darkened lens -like from a pair of sunglasses) is placed in
front of, say, the right eye the pendulum appears to take on an
elliptical orbit, being closer as it swings toward the right and
farther as it swings toward the left.
The widely accepted explanation of the apparent motion with depth is
that a reduction in retinal illumination (relative to the fellow eye)
yields a corresponding delay in signal transmission, imparting
instantaneous spatial disparity to moving objects. This occurs because
the eye, and hence the brain, respond more quickly to brighter objects
than to dimmer ones.[4][5][6][7]
So, if the brightness of the pendulum is greater in the left eye
than in the right, the retinal signals from the left eye will reach the
brain slightly ahead of those from the right eye making it seem like
the pendulum seen by the right eye is lagging behind its counterpart in
the left eye. This difference in position over time is interpreted by
the brain as motion with depth: No motion, no depth.
The ultimate effect of this, with appropriate scene composition, is
the illusion of motion with depth. Object motion must be maintained for
most conditions and is effective only for very limited "real-world"
scenes.
Prismatic & self masking crossview glasses
"Naked-eye" cross viewing is a skill that must be learned to be used. New prismatic
glasses now make cross-viewing easier, and also mask off the secondary
non-3D images, that otherwise show up on either side of the 3D image.
The most recent low-cost glasses mask the images down to one per eye
using integrated baffles. Images or video frames can be displayed on a
new widescreen HD or computer monitor with all available area used for
display. HDTV
wide format permits excellent color and sharpness. Cross viewing
provides true "ghost-free 3D" with maximum clarity, brightness and
color range, as does the stereopticon
and stereoscope viewer with the parallel approach. The potential depth
and brightness is maximized. A recent cross converged development, is a
new variant wide format that uses a conjoining of visual information
outside of the regular binocular
stereo window. This allows an efficient seamless visual presentation in
true wide-screen, more closely matching the focal range of the human
eyes.
Lenticular prints
-
Lenticular printing is a technique by which one places an array of lenses, with a texture much like corduroy,
over a specially made and carefully aligned print such that different
viewing angles will reveal different image slices to each eye,
producing the illusion of three dimensions, over a certain limited
viewing angle. This can be done cheaply enough that it is sometimes
used on stickers, album covers, etc. It is the classic technique for 3D
postcards.
Displays with filter arrays
The LCD is covered with an array of prisms that divert the light
from odd and even pixel columns to left and right eyes respectively. As of 2004, several manufacturers, including Sharp Corporation,
offer this technology in their notebook and desktop computers. These
displays usually cost upwards of 1000 dollars and are mainly targeted
at science or medical professionals.
Another technique, for example used by the X3D company,
is simply to cover the LCD with two layers, the first being closer to
the LCD than the second, by some millimeters. The two layers are
transparent with black strips, each strip about one millimeter wide.
One layer has its strips about ten degrees to the left, the other to
the right. This allows seeing different pixels depending on the
viewer's position.
Wiggle stereoscopy
This method, possibly the most simple stereogram viewing technique,
is to simply alternate between the left and right images of a
stereogram. In a web browser, this can easily be accomplished with an animated .gif image, flash applet or a specialized java applet. Most people can get a crude sense of dimensionality from such images, due to persistence of vision and parallax.
Closing one eye and moving the head from side-to-side helps to
understand why this works. Objects that are closer appear to move more
than those further away.
This effect may also be observed by a passenger in a vehicle or
low-flying aircraft, where distant hills or tall buildings appear in
three-dimensional relief, a view not seen by a static observer as the
distance is beyond the range of effective binocular vision.
Advantages of the wiggle viewing method include:
- No glasses or special hardware required
- Most people can "get" the effect much quicker than cross-eyed and parallel viewing techniques
- It is the only method of stereoscopic visualisation for people with limited or no vision in one eye
Disadvantages of the "wiggle" method:
- Does not provide true binocular stereoscopic depth perception
- Not suitable for print, limited to displays that can "wiggle" between the two images
- Difficult to appreciate details in images that are constantly "wiggling"
Most wiggle images use only two images, leading to an annoyingly
jerky image. A smoother image, more akin to a motion picture image
where the camera is moved back and forth, can be composed by using
several intermediate images (perhaps with synthetic motion blur) and
longer image residency at the end images to allow inspection of details.
Although the "wiggle" method is an excellent way of previewing
stereoscopic images, it cannot actually be considered a true
three-dimensional stereoscopic format. An individual looking at a
wiggling image is not at all experiencing stereoscopic viewing, they
are still only seeing a flat two-dimensional image that is "wiggling".
To experience binocular depth perception as made possible with true
stereoscopic formats, each eyeball must be presented with a different
image at the same time this is not the case with "wiggling" stereo.
The "wiggle" effect is similar to walking around one's environment
while blinking one eyes.
Taking the pictures
In the 1950s,
stereoscopic photography regained popularity when a number of
manufacturers began introducing stereoscopic cameras to the public. The
new cameras were developed to use 135 film,
which had gained popularity after the close of World War II. Many of
the conventional cameras used the film for 35mm transparency slides,
and the new stereoscopic cameras utilized the film to make stereoscopic
slides. The Stereo Realist
camera was the most popular, and the 35mm picture format became the
standard by which other stereo cameras were designed. The stereoscopic
cameras were marketed with special viewers that allowed for the use of
such slides, which were similar to View-Master reels but offered a much
larger image. With these cameras the public could easily create their
own stereoscopic memories. Although their popularity has waned
somewhat, these cameras are still in use today.
The 1980s
saw a minor revival of stereoscopic photography extent when
point-and-shoot stereo cameras were introduced. These cameras suffered
from poor optics and plastic construction, so they never gained the
popularity of the 1950s stereo cameras. Over the last few years they
have been improved upon and now produce good images.
The beginning of the 21st century marked the coming of the age of
digital photography. Stereo lenses were introduced which could turn an
ordinary film camera into a stereo camera by using a special double
lens to take two images and direct them through a single lens to
capture them side-by-side on the film. Although there are not any
out-of-the-box digital stereocameras available, it is possible to
create a twin camera rig, together with a "shepherd" device to
synchronize the shutter and flash of the two cameras. (By mounting two
cameras on a bracket, spaced a bit, with a mechanism to make both take
pictures at the same time.) Newer cameras are even being used to shoot
"step video" 3D slide shows with many pictures almost like a 3D motion
picture if viewed properly. A modern camera can take 5 pictures per
second, with images that greatly exceed HDTV resolution.
The side-by-side method is extremely simple to create, but it can be
difficult or uncomfortable to view without optical aids. One such aid
for non-crossed images is the modern Pokescope.
Traditional stereoscopes such as the Holmes can be used as well. Cross
view technique now has the simple Perfect-Chroma cross viewing glasses
to facilitate viewing.
Imaging methods
If anything is in motion within the field of view, it is necessary
to take both images at once, either through use of a specialized
two-lens camera, or by using two identical cameras, operated as close
as possible to the same moment.
A single digital camera can also be used if the subject remains
perfectly still (such as an object in a museum display). Two exposures
are required. The camera can be moved on a sliding bar for offset, or
with practice, the photographer can simply shift the camera while
holding it straight and level. In practice the hand-held method works
very well. This method of taking stereo photos is sometimes referred to
as the "Cha-Cha" method.
A good rule of thumb is to shift sideways 1/30th of the distance to
the closest subject for 'side by side' display, or just 1/60th if the
image is to be also used for color anaglyph or anachrome image display.
For example, if you are taking a photo of a person in front of a house,
and the person is 30 feet away, then you should move the camera 1 foot
between shots.
The stereo effect is not significantly diminished by slight pan or
rotation between images. In fact slight rotation inwards (also called
'toe in') can be beneficial. Bear in mind that both images should show
the same objects in the scene (just from different angles) - if a tree
is on the edge of one image but out of view in the other image, then it
will appear in a ghostly, semi-transparent way to the viewer, which is
distracting and uncomfortable. Therefore, you can either crop the
images so they completely overlap, or you can 'toe-in' the cameras so
that the images completely overlap without having to discard any of the
images. However, be a little cautious - too much 'toe-in' can cause eye
strain for reasons best described here [1].
Longer base line
For making stereo images of a distant object (e.g., a mountain with
foothills), one can separate the camera positions by a larger distance
than usual. This will enhance the depth perception of these distant
objects, but is not suitable for use when foreground objects are
present. In the red-cyan anaglyphed example at right, a ten-meter
baseline atop the roof ridge of a house was used to image the mountain.
The two foothill ridges are about 6.5 km (4 mi) distant and are
separated in depth from each other and the background. The baseline is
still too short to resolve the depth of the two more distant major
peaks from each other. Owing to various trees that appeared in only one
of the images the final image had to be severely cropped at each side
and the bottom.
In the wider image, taken from a different location, a single camera
was walked about 100 ft (30m) between pictures. The images were
converted to monochrome before combination.
Long base line image showing prominent foothill ridges; click the image for more information on the technique
|
|
Base line selection
There is a specific optimal distance for viewing of natural scenes
(not stereograms), which has been estimated by some to have the closest
object at a distance of about 30 times the distance between the eyes
(when the scene extends to infinity). An object at this distance will
appear on the picture plane,
the apparent surface of the image. Objects closer than this will appear
in front of the picture plane, or popping out of the image. All objects
at greater distances appear behind the picture plane. This interpupillar or interocular distance
will vary between individuals. If one assumes that it is 2.5 inches
(about 6.5 cm), then the closest object in a natural scene by this
criterion would be 30 Χ 2.5 = 75 inches (about 2 m). It is this ratio
(1:30) that determines the inter-camera spacing appropriate to imaging
scenes. Thus if the nearest object is 30 feet away, this ratio suggests
an inter-camera distance of one foot. It may be that a more dramatic
effect can be obtained with a lower ratio, say 1:20 (in other words,
the cameras will be spaced further apart), but with some risk of having
the overall scene appear less "natural". This unnaturalness can often
be seen in old stereoscope cards, where a landscape will have the
appearance of a stack of cardboard cutouts. Where images may also be
used for anaglyph display a narrower base, say 1:50 or 1:60 will allow
for less ghosting in the display.
References
- ^ Welling, William. Photography in America, page 23
- ^ Malcom Patterson
- ^ Anachrome Advanced Plastic Anaglyph 3D Glasses and Anachrome 3D Technology
- ^
Lit A. (1949) The magnitude of the Pulfrich stereo-phenomenon as a
function of binocular differences of intensity at various levels of
illumination. Am. J. Psychol. 62:159-181.
- ^ Rogers B.J. Anstis S.M. (1972) Intensity versus Adaptation and the Pulfrich Stereophenomenon Vision Res. 12:909-928.
- ^
Williams JM, Lit A. (1983) Luminance-dependent visual latency for the
Hess effect, the Pulfrich effect, and simple reaction time. Vision Res.
23(2):171-9.
- ^
Deihl Rolf R. (1991) Measurement of Interocular delays with Dynamic
Random-Dot stereograms. Eur. Arch. Psychiatry Clin. Neurosci.
241:115-118.
External links
Stereopsis
Stereopsis (from stereo meaning solidity, and opsis meaning vision or sight) is the process in visual perception
leading to perception of stereoscopic depth. In turn, stereoscopic
depth is the sensation of depth that emerges from the fusion of the two
slightly different projections of the world on the two retinas. The
difference between the two eyes' images, which is a result of the eyes'
horizontal separation, is usually referred to as binocular disparity or
retinal disparity. The fact that this binocular disparity is
interpreted by the brain as depth was first discovered by Charles Wheatstone, a British scientist, and described by him in a classic paper[1]
published in 1838:
the mind perceives an object of three-dimensions
by means of the two dissimilar pictures projected by it on the two
retinζ
, (Wheatstone, 1838). To prove his ideas, Wheatstone invented a
simple device which he dubbed a stereoscope.
Using his newly invented stereoscope Wheatstone was able to
convincingly show that a vivid sense of depth emerges from two
completely flat pictures depicting two different projections of the
same scene.
History of stereopsis
Stereopsis was first described by Charles Wheatstone in 1838.[2]
He recognized that because each eye views the visual world from
slightly different horizontal positions, each eye's image differs from
the other. Objects at different distances from the eyes project images
in the two eyes that differ in their horizontal positions, giving the
depth cue of horizontal disparity, also known as retinal disparity and as binocular disparity. Wheatstone showed that this was an effective depth cue by creating the illusion
of depth from flat pictures that differed only in horizontal disparity.
To display his pictures separately to the two eyes, Wheatstone invented
the stereoscope.
Leonardo da Vinci
had also realized that objects at different distances from the eyes
project images in the two eyes that differ in their horizontal
positions, but had concluded only that this made it impossible for a
painter to portray a realistic depiction of the depth in a scene from a
single canvas. Leonardo chose for his near object a column with a
circular cross section and for his far object a flat wall. Had he
chosen any other near object, he may have discovered horizontal
disparity of its features. His column was one of the few objects that
projects identical images of itself in the two eyes.
Stereopsis became popular during Victorian times with the invention of the prism stereoscope by David Brewster. This, combined with photography, meant that tens of thousands of stereograms were produced.
Until about the 1960s, research into stereosis was dedicated to exploring its limits and its relationship to singleness of vision. Researchers included Peter Ludvig Panum, Ewald Hering, Adelbert Ames Jr., and Kenneth N. Ogle.
Also in the 1960s, Bela Julesz invented random-dot stereograms.
Unlike previous stereograms, in which each half image showed
recognizable objects, each half image of the first random-dot
stereograms showed a square matrix of about 10,000 small dots, with
each dot having a 50% probability of being black or white. No
recognizable objects could be seen in either half image. The two half
images of a random-dot stereogram were essentially identical, except
that one had a square area of dots shifted horizontally by one or two
dot diameters, giving horizontal disparity. The gap left by the
shifting was filled in with new random dots, hiding the shifted square.
Nevertheless, when the two half images were viewed one to each eye, the
square area was almost immediately visible by being closer or farther
than the background. Julesz whimsically called the square a cyclopean stimulus after the mythical Cyclops
who had only one eye. This was because it was as though we have a
cyclopean eye inside our brains that can see cyclopean stimuli hidden
to each of our actual eyes. Random-dot stereograms highlighted a
problem for stereopsis, the correspondence problem.
This is that any dot in one half image can realistically be paired with
many same-coloured dots in the other half image. Our visual systems
clearly solve the correspondence problem, in that we see the intended
depth instead of a fog of false matches. Research began to understand
how.
Also in the 1960s, Horace Barlow, Colin Blakemore, and Jack Pettigrew found neurons in the cat visual cortex that had their receptive fields
in different horizontal positions in the two eyes. This established the
neural basis for stereopsis. Their findings were disputed by David Hubel and Torsten Wiesel, although they eventually conceded when they found similar neurons in the monkey visual cortex. In the 1980s, Gian Poggio and others found neurons in V2 of the monkey brain that responded to the depth of random-dot stereograms.
In the 1990s, Christopher Tyler invented autostereograms, random-dot stereograms that can be viewed without a stereoscope. This led to the popular Magic Eye pictures.
Popular culture
A stereoscope
is a device by which each eye can be presented with different images,
allowing stereopsis to be stimulated with two pictures, one for each
eye. This has led to various crazes for stereopsis, usually prompted by new sorts of stereoscopes. In Victorian times it was the prism stereoscope (allowing stereo photographs to be viewed), in the 1950s it was red-green glasses (allowing stereo movies to be viewed), in the 1970s it was polarizing glasses (allowing coloured movies to be viewed), and in the 1990s it was Magic Eye pictures (autostereograms). Magic Eye pictures did not require a stereoscope, but relied on viewers using a form of free fusion so that each eye views different images.
Geometrical basis for stereopsis
Stereopsis appears to be processed in the visual cortex in binocular cells having receptive fields
in different horizontal positions in the two eyes. Such a cell is
active only when its preferred stimulus is in the correct position in
the left eye and in the correct position in the right eye, making it a disparity detector.
When a person stares at an object, the two eyes converge so that the object appears at the center of the retina
in both eyes. Other objects around the main object appear shifted in
relation to the main object. In the following example, whereas the main
object (dolphin) remains in the center of the two images in the two
eyes, the cube is shifted to the right in the left eye's image and is
shifted to the left when in the right eye's image.
The two eyes converge on the object of attention.
|
The cube is shifted to the right in left eye's image.
|
The cube is shifted to the left in the right eye's image.
|
We see a single, Cyclopean, image from the two eyes' images.
Because each eye is in a different horizontal position, each has a slightly different perspective on a scene yielding different retinal images. Normally two images are not observed, but rather a single view of the scene, a phenomenon known as singleness of vision.
If the images are very different (such as by going cross-eyed, or by presenting different images in a stereoscope) then one image at a time may be seen, a phenomenon known as binocular rivalry.
The brain gives each point in the Cyclopean image a depth value, represented here by a grayscale depth map.
Computer stereo vision
Computer stereo vision, is a part of the field of computer vision. It is sometimes used in mobile robotics to detect obstacles.
Two cameras take pictures of the same scene, but they are separated
by a distance - exactly like our eyes. A computer compares the images
while shifting the two images together over top of each other to find
the parts that match. The shifted amount is called the disparity. The disparity at which objects in the image best match is used by the computer to calculate their distance.
For a human, the eyes change their angle according to the distance
to the observed object. To a computer this represents significant extra
complexity in the geometrical calculations (Epipolar geometry).
In fact the simplest geometrical case is when the camera image planes
are on the same plane. The images may alternatively be converted by
reprojection through a linear transformation to be on the same image plane. This is called Image rectification.
Computer stereo vision with many cameras under fixed lighting is called structure from motion. Techniques using a fixed camera and known lighting are called photometric stereo techniques, or "shape from shading".
Computer stereo display
Many attempts have been made to reproduce human stereo vision on
rapidly changing computer displays, and toward this end numerous
patents relating to 3D television and cinema have been filed in the USPTO.
At least in the US, commercial activity involving those patents has
been confined exclusively to the grantees and licensees of the patent
holders, whose interests tend to last for twenty years from the time of
filing.
Discounting 3D television and cinema (which generally require a
plurality of digital projectors whose moving images must be
synchronized by computer), several stereoscopic LCDs are going to be offered by Sharp,
which has already started shipping a notebook with a built in
stereoscopic LCD. Although older technology required the user to don
goggles or visors for viewing computer-generated images, or CGI, newer
technology tends to employ fresnel lenses or plates over the liquid crystal displays, freeing the user from the need to put on special glasses or goggles.
References
- Scott B. Steinman, Barbara A. Steinman and Ralph Philip Garzia. (2000). Foundations of Binocular Vision: A Clinical perspective. McGraw-Hill Medical. ISBN 0-8385-2670-5.
See also
External links
This article is licensed under the GNU Free Documentation License. It uses material from Wikipedia Encyclopedia article "Stereoscopy"
|