1994 Sensor-Based Systems Published Papers and Technical Reports

Librarian: Cathy Wiley
phone: (202) 767-0018
email:library@aic.nrl.navy.mil.

Here is a plain-text order form to order reports by mail. Some of the following reports can be downloaded as PostScript, or compressed PostScript files.

SENSOR-BASED SYSTEMS

Frank Pipitone
"Tripod Operators for Realtime Recognition of Surface Shapes in Range Images," Proceedings of the NASA Technology 2004 Symposium, Washington DC, November 1994 (NCARAI No: AIC-94-029).
Not available on-line at this time. Please see order form.

Abstract: Tripod operators (TO's) are a versatile class of feature extraction operators for surfaces. They are useful for recognition and/or localization based on range or tactile data. They extract a few sparse point samples in a regimented way, so that N sampled surface points yield only N-3 independent scalar features containing all the pose-invariant surface shape information in these points and no other information. They provide a powerful index into sets of prestored surface representations. A TO consists of three points in 3-space fixed at the vertices of an equilateral triangle and a procedure for making several "depth" measurements in the coordinate frame of the triangle, which is placed on the surface like a surveyor's tripod. TO's can be imbedded in a vision system in many ways and applied to almost any surface shape. Here the focus is an experimental study in which individual TO's are used to search a cluttered range image for one of 25 known shapes, typically in milliseconds, with very few false detections. We believe that this simple way of using TO's, in conjunction with existing triangulation range sensor technology, can be effectively applied to industrial parts recognition tasks, and with additional research to other applications.


Frank Pipitone
"Rapid Recognition of Elementary Surface Shapes in Cluttered Range Images Using Tripod Operators," Proceedings of the Workshop on Machine Vision Applications (MVA94/IAPR), Kawasaki, Japan, December 1994, (NCARAI No: AIC-94-030).
Not available on-line at this time. Please see order form.

Abstract: Tripod operators (TO's) are a versatile class of feature extraction operators for surfaces. They are useful for recognition and/or localization based on range or tactile data. They extract a few sparse point samples in a regimented way, so that N sampled surface points yield only N-3 independent scalar features containing all the pose-invariant surface shape information in these points and no other information. They provide a powerful index into sets of prestored surface representations. A TO consists of three points in 3-space fixed at the vertices of an equilateral triangle and a procedure for making several "depth" measurements in the coordinate frame of the triangle, which is placed on the surface like a surveyor's tripod. They have complete six DOF isometry invariance and can be imbedded in a vision system in many ways and applied to almost any surface shape. Here the focus is an experimental study in which TO's are used to search a cluttered range image for one of 25 known shapes, typically in milliseconds, with very few false positive detections.


Behrooz Kamgar-Parsi, Behzad Kamgar-Parsi, and John Sciortino
"Multi-Source Data Deinterleaving With Neural Networks," Proceedings of TECOM Artificial Intelligence Symposium, Aberdeen, MD, September 1994, (NCARAI No: AIC-94-028).
Not available on-line at this time. Please see order form.

Abstract: When several data sources are sending asynchronously without any multiplexing conventions, the stream of data from each source will be interleaved in an unpredictable sequence. In such a situation, it would be highly desirable to deinterleave the data streams before attempting further processing. After the application of certain signal processing techniques on the incoming interleaved data stream, one obtains a feature space in which different data sources typically form distinct clusters. It is therefore essential to have a reliable clustering technique to determine: (i) the correct number of sources, and (ii) the correct membership for each datum. The Hopfield -Kamgar neural net clustering technique appears to be the clustering technique of choice for this task. We will explain the main aspects of our technique and briefly discuss alternative neural nets and conventional methods for clustering, and in particular as applied to data deinterleaving.


Behrooz Kamgar-Parsi and Behzad Kamgar-Parsi
"Learning Natural Thresholds for Object Recognition," Internal Report, 1994, (NCARAI Report: AIC-94-042).
Not available on-line at this time. Please see order form.

Abstract: To determine whether or not an unknown object is a correct match of a given object P, current techniques define a threshold value and decide the matter by whether or not the similarity measure exceeds the threshold. The unknown object may deviate from object P in many ways. Hence, a given threshold may lead to a correct answer for certain types of deviations but not for others. Humans on the other hand appear to use thresholds that are multi-dimensional and complex. We propose an approach to develop natural thresholds for acceptance/rejection. This is done by attempting to construct decision boundaries at places where the human eye appears to "draw'' the line between acceptable (P) and unacceptable (not P). To this end we have developed a random deformation technique which is capable of automaticallygenerating an infinite number of true and false look-alikes of object P, which are then learned by the system. We have applied this technique to a real life problem, namely, distinguishing an approaching aircraft from clouds (or other objects) through its shape. The discriminating power of the system is comparable to that of the human eye.


Behrooz Kamgar-Parsi and Behzad Kamgar-Parsi
"Quantization Error in Regular Grids: Triangular Pixels," Internal Report, 1994, (NCARAI Report: AIC-94-051).
Not available on-line at this time. Please see order form.

Abstract: Quantization of the image plane into pixels introduces an error in any quantity computed from the image. Digital processing of images requires quantizing the image plane into pixels. This spatial quantization introduces an error in any quantity computed from the image. The regular polygons that tile a 2D plane are triangles, squares, and hexagons. In previous papers we treated square and hexagonal pixels. Here we derive closed-form, analytic expressions for the average error and the error distribution function due to triangular pixel quantization, for any function of an arbitrary number of independent variables in the linear approximation. These quantities are essential in examining the intrinsic sensitivity of image processing algorithms. We, also, find the result that for all possible cases 0.99


Behrooz Kamgar-Parsi and Behzad Kamgar-Parsi
Model-based Pattern Recognition with Multilayer Neural Networks: Learning from the Eye," Internal Report, 1994, (NCARAI Report: AIC-94-052).
Not available on-line at this time. Please see order form.

Abstract: We propose a model-based pattern recognition approach using multilayer neural networks to overcome certain shortcomings of the existing model-based techniques. In certain domains, the approach may allow the possibility of duplicating the discriminating power of the human eye in a network, provided that the pattern in question is meaningful to humans. To facilitate this we have developed a random deformation technique capable of generating an arbitrarily large number of true and false look-alikes of the model. The suggested approach attempts to construct decision boundaries at places where the human eye appears to "draw" the line between acceptable and unacceptable patterns. Applications of this technique to a real life problem shows a performance comparable to that of the eye.


Behzad Kamgar-Parsi, Behrooz Kamgar-Parsi and Menashe Brosh
"Distribution and Moments of the Weighted Sum of Uniform Random Variables, with Applications in reducing Monte Carlo Simulations," In press. Journal of Statistical Computation and Simulation, 1994, (NCARAI Report: AIC-94-050)
Not available on-line at this time. Please see order form.

Abstract: We derive analytical expressions for the distribution function and the moments of the weighted sum Y=7i n=1 aiXi , where Xi are independent random variables with non-identical uniform distributions, for an arbitrary number of variables N, and arbitrary coefficient values ai. These results are the generalizations ofthose for the regular sum of uniform random variables. Using the results, we examine the inadequacy of the central limit approximation for finite N. We also discuss the savings in the cost of computing properties of the weighted sum using these results vs. Monte Carlo simulations. We give an example of the application of the weighted sum to analyzing the effects of digitization error in computer vision.

1994 Publications by Section
Computational Reasoning
Intelligent M4 Systems
Interface Design And Evaluation
Machine Learning
Back to NCARAI Library


Cathy Wiley, wiley@aic.nrl.navy.mil