Generic Visual Perception
Generic Visual Perception
Generic Visual Perception
INTRODUCTION
While computing technology is growing in leaps and bounds, the human brain continues to be the world's fastest computer. Combine brain-power with seeing power, and you have the fastest, cheapest, most extra ordinary processor ever-the human eye. Little wonder, research labs the world over are striving to produce a nearperfect electronic eye. The 'generic visual perception processor !"##$' has been developed after %& long years of scientific effort. !eneric "isual #erception #rocessor !"##$ can automatically detect ob'ects and trac( their movement in real-time . The !"##, which crunches )& billion instructions per second *+#,$, models the human perceptual process at the hardware level by mimic(ing the separate temporal and spatial functions of the eye-to-brain system. The processor sees its environment as a stream of histograms regarding the location and velocity of ob'ects. !"## has been demonstrated as capable of learning-in-place to solve a variety of pattern recognition problems. +t boasts automatic normali-ation for varying ob'ect si-e, orientation and lighting conditions, and can function in daylight or dar(ness.
This electronic .eye. on a chip can now handle most tas(s that a normal human eye can. That includes driving safely, selecting ripe fruits, reading and recogni-ing things. ,adly, though modeled on the visual perception capabilities of the human brain, the chip is not really a medical marvel, poised to cure the blind.
+t is desirable to provide devices including combined data processing units of a similar nature, each addressing a particular parameter extracted from the video signal. +n particular, it is desirable to provide devices including multiple units for calculating histograms, or electronic spatio-temporal neuron ,T0, each processing a 12T2 2$, by a function in order to generate individually an output value. The present invention also provides a method for perception of an ob'ect using characteristics, such as its shape, its si-e or its orientation, using a device composed of a set of histogram calculation units. 2
3sing the techni/ues of the present invention, a general outline of a moving ob'ect is determined with respect to a relatively stable bac(ground, then inside this outline, elements that are characteri-ed by their tone, color, relative position etc. are determined. .
%OTI#NTIA& 'IG"T#D
The !"## was invented in 1992, by *4" founder #atric #irim . +t would be relatively simple for a C56, chip to implement in hardware the separate contributions of temporal and spatial processing in the brain. The brain-eye system uses layers of parallel-processing neurons that pass the signal through a series of preprocessing steps, resulting in real-time trac(ing of multiple moving ob'ects within a visual scene. #irim created a chip architecture that mimic(ed the wor( of the neurons, with the help of multiplexing and memory. The result is an inexpensive device that can autonomously .perceive. and then trac( up to eight user-specified ob'ects in a video stream based on hue, luminance, saturation, spatial orientation, speed and direction of motion. The !"## trac(s an .ob'ect,. defined as a certain set of hue, luminance and saturation values in a specific shape, from frame to frame in a video stream by anticipating where it7s leading and trailing edges ma(e .differences. with the bac(ground. That means it can trac( an ob'ect through varying light sources or changes in si-e, as when an ob'ect gets closer to the viewer or moves farther away. The !"##7, ma'or performance strength over current-day vision systems is its adaptation to varying light conditions. Today7s vision systems dictate uniform shadow less illumination ,and even next generation prototype systems, designed to wor( under 8normal9 lighting conditions, can be used only dawn to dus(. The !"## on the other hand, adapt to real time changes in lighting without recalibration, day or light.
'O!T(AR# A'%#CT'
6n the software side, a host operating system running on an external #C communicates with the !"##'s evaluation board via an 6, (ernel within the on-chip microprocessor. *4" dubs the neural-learning capability of its development environment .programming by seeing and doing,. because of its ease of use. The engineer needs no (nowledge of the internal wor(ings of the !"##, the company said, only application-specific domain (nowledge. .#rogramming the !"## is as simple as setting a few registers, and then testing the results to gauge the application's success,. said ,teve ;owe, *4"'s director of research and development. .6nce debugged, these tiny application programs are loaded directly into the !"##'s internal ;65.. 2pplication programs themselves can use CDD, which ma(es calls to a library of assembly language algorithms for visual perception and trac(ing of ob'ects. The 7
R#COGNITION TA'
+n applications, each pixel may be described with respect to any of the six domains of information available to it F hue, luminance, saturation, speed, direction of motion and spatial orientation. The !"## further subcategori-es pixels by ranges, for instance luminance within %& percent and B< percent, hue of blue, saturation between )& and )< percent, and moving upward in scene. 2 set of second-level pattern recognition commands permits the !"## to search for different ob'ects in different parts of the scene F for instance, to loo( for a closed eyelid only within the rectangle bordered by the corners of the eye. ,ince some applications may also re/uire multiple levels of recognition, the !"## has software hoo(s to pass along the recognition tas( from level to level. :or instance, to detect when a driver is falling asleep F a capability that could find use in California, which is about to mandate that cars sound an .alarm. when drowsy drivers begin to nod off F the !"## is first programmed to detect the driver's 8
G$%% ARC"IT#CTUR#
The chip houses 23 neural bloc(s, both temporal and spatial, each consisting of )& hardware input and output .synaptic. connections. The !"## multiplexes this neural hardware with off-chip scratchpad memory to simulate as many as %&&,&&& synaptic connections per neuron. 4ach of these synapses can be changed through the on-chip microprocessor for a combined processing total of over B.) billion synaptic connectionsGsecond. +n executing up to )& *+#, to analy-e successive frames of a video stream, the temporal neurons identify pixels that have changed over time and generate a >-bit value indicative of the magnitude of that change. The spatial-processing system analy-es the resulting .difference. histogram to calculate the speed and direction of the motion. ?istogram is a bar chart of the count of pixels of every tone of gray that occurs in the image. +t helps us analy-e, and more importantly, correct the contrast of the image. Technically, the histogram maps Luminance, which is defined from the way the human eye perceives the brightness of different colors. :or example, our eyes are most sensitive to greenH we see green as being brighter than we see blue. Luminance weighs the effect of this to indicate the actual perceived brightness of the image pixels due to the color components.
MU&TI%&# %#RC#%TION'
The vision perception processor chip processes motion images in real-time, and is able to perceive and trac( ob'ects based on combinations of hue, luminance, speed, direction of motion and spatial orientation. The chip has three functionsI temporal processing, spatial processing and histogram processing. +n applications, the vision processor called generic visual perception processor !"##$ is used with a microprocessor, flash memory and peripherals. +n a case of detection of a driver falling sleep, the processor core detects movement of the eyelids. :irst, the driver is identified, and then the microprocessor directs the vision processor to search within the corner points of a rectangular area in which the nose of the driver would be expected to be located, based on a model, and to select pixels having characteristics of the shadows of nostrils. The results are calculated to the end of 10
11
12
The !"##'s ma'or performance strength over current-day vision systems is its automatic adaptation to varying lighting conditions. Today's vision systems dictate uniform, shadow less illumination, and even next-generation prototype systems, designed to wor( under .normal. lighting conditions, can be used only from dawn to dus(. The !"##, on the other hand, adapts to real-time changes in lighting without recalibration, day or night.
DI'AD$ANTAG#'
,adly, though modeled on the visual perception capabilities of the human brain, the chip is not really a medical marvel, poised to cure the blind. The chip is an industrial, rather than medical, invention and is not aimed at con/uering human blindness. *ut with do-ens of potential applications in a myriad of industries, its inventors are confident that the !"## will /uic(ly blossom into a multibillion dollar business.
14
A%%&ICATION'
*4" lists possible applications for the !"## in process monitoring, /uality control and assemblyH automotive systems such as intelligent air bags that monitor passenger si-e and traffic congestion monitorsH pedestrian detection, license plate recognition, electronic toll collection, automatic par(ing management, automatic inspectionH and medical uses including disease identification. The chip could also prove useful in unmanned air vehicles, miniature smart weapons, ground reconnaissance and other military applications, as well as in security access using facial, iris, fingerprint, or height and gait identification
*+ Auto,otive industr?ere, !"##-type devices could (eep .watch. and trigger off an alarm if the driver were to nod off to sleep at the wheel. 6r act as a monitor for the car's progress to ma(e sure it does not get too close to the curb. 2lso with transportation, !"## could be used in developing systems for collision avoidance, automatic cruise control, smart air bag systems, license plate recognition, measurement of traffic flow, electronic toll collection, automatic cargo trac(ing, par(ing management and the inspection of crac(s in rails and tunnels. 2nother automotive application is warning erratic drivers. !"## does so by monitoring the left and right lanes of the road, signaling the driver when the vehicle deviates from the proscribed lane.
.+ Ro/otics
+n manufacturing, !"## have applications in robotics, particularly for dirty and dangerous 'obs such as feeding hot parts to forging presses, cleaning up ha-ardous waste, and spraying toxic coatings on aircraft parts.
15
2+ Militar- applications
The chip can be used in other important pattern-recognition applications, such as military target ac/uisition and fire control.. 5ilitary applications include unmanned air vehicles, automatic target detection, tra'ectory correction, ground reconnaissance and surveillance. "ision systems are useful in home security, as well, and !"##-based systems could be inexpensively developed to detect intruders and fires. +n addition to the above applications, !"## should be able to wor( as medical scanners, blood analy-ers, cardiac monitoring, ban( chec(s, bar code reading, seal and signature verification, trademar( database indexing, construction of virtual reality environment models, human motion analysis, expression understanding, cloud identification, and many other fields.
16
2rray :ormat max$I J&&?xB&&" :rame rateI &-%&& "!2 frames per second progressive-scan +nterface 5odeI 5asterG,lave 1ata ;ate max$I =& 5egapixel per second 1ynamic ;angeI %&-bits, > channels #arametersI Luminance, ?ue, ,aturation, 5otion orientation, velocity$, 6riented edges, lines, curves, corners ComputationI B= ,T0 bloc(s 5ulti-scales possibilities optional$ 5ulti-chips connections capabilities ,emi-!raphic interface visuali-ation !3+ with mouse #2L, 0T,C, "!2, visuali-ation +nternal 6, C language programmation #C+, #,), +)C, ;,)>), #W5 &.> Watt, >,>"olts for #2LG0T,C format -=&,DJ< C Complete application in one ,i# %.x%.I C-56, +mager, !"##, %&5bits ";25, % 5bits ,erial :lash 5emory,)K 5?- Cristal.
17
!UTUR# 'CO%#'
:or future, scientists are wor(ing on a 8visual mouse9 for handgesture interface to computers that ta(e advantage of that high compression ratio. :uture studies also involve using this processor as an eye of the robots, which provides tremendous applications.
CONC&U'ION'
+mitating the human eye's neural networ(s and the brain, the generic visual perception processor can handle about )& billion instructions per second, and can manage most tas(s performed by the eye. 5odeled on the visual perception capabilities of the human brain, the !"## is a single chip that can detect ob'ects in a motion video signal and then to locate and trac( them in real time far more dependably than competing systems, which cost far more, according to company scientists. This is a generic chip, and we've already identified more than %&& applications in ten or more industries. The chip could be useful across a wide swathe of industries where visual trac(ing is important.
18
R#!#RCNC#'
L1M httpIGGwww.patentstorm.us L2M httpIGGwww.techweb.com L3M *art Cos(o., 0eural networ(s and fu--y systems, 2ddison Wesley L=MhttpIGGwww.pixelin(.comGsupport L<MhttpIGGgvpp.colorado.eduGcl.res.ppt.htm
19