#### Identifier

etd-04272011-105721

#### Degree

Master of Science in Electrical Engineering (MSEE)

#### Department

Electrical and Computer Engineering

#### Document Type

Thesis

#### Abstract

Untitled Page

In this thesis we study the running of the Scale Invariant Feature Transform (SIFT) algorithm on a pipelined computational platform. The SIFT algorithm is one of the most widely used methods for image feature extraction.

We develop a tile based template for running SIFT that facilitates the analysis while abstracting away lower-level details. We formalize the computational pipeline and the time to execute any algorithm on it based on the relative times taken by the pipeline stages. In the context of the SIFT algorithm, this reduces the time to that of running the entire image through a bottlenecked stage and the time to run either the first or last tile through the remaining stages. Through an experimental study of the SIFT algorithm on a broad collection of test images, we determined image feature fraction values, that relate the sizes of the image extracts as it the computation proceeds through the stages of the SIFT algorithm.

We show that for a single chip uniprocessor pipeline, the computational stage is the bottleneck. Specifically we show that for an N x N image with n x n tiles the overall time complexity is θ ( (n+x)^{2 } p_{i } Г_{0}+αβN^{2}x^{2} Г_{1}+ (αβ+γ)n^{2}logx P_{0} Г_{2} ) ; here x is the neigborhood of the tile, p_{i }, p_{o} are the number of input, output pins of the chip, α,β,γ are the feature fractions, and Г_{0}, Г_{1}, Г_{2} are the input, compute, output clocks. The three terms in the expression represents the time complexities of input, compute and output stages. The input and output stages can be slowed down substantially without appreciate degradation of the overall performance. This slowdown can be traded off for lower power and higher signal quantity.

For multicore chips, we show that for an N x N image on a P-core chip, the overall time complexity to process the image is θ ( N^{2 } p_{i } Г_{0}+ (n^{2}w^{2} + αβn^{2}x^{2}) P Г_{1}+ (αβ+γ)n^{2}logx P_{0} Г_{2} ) ; in addition to the quantities described earlier w is the window size used for the Gaussian blurring. Overall we establish that without improvements in the input bandwidth, the power of multicore processing cannot be used efficiently for SIFT.

#### Date

2011

#### Document Availability at the Time of Submission

Release the entire work immediately for access worldwide.

#### Recommended Citation

Vinukonda, Phaneendra, "A study of the scale-invariant feature transform on a parallel pipeline" (2011). *LSU Master's Theses*. 2721.

https://digitalcommons.lsu.edu/gradschool_theses/2721

#### Committee Chair

Vaidyanathan, Ramachandran