A Meta-Top-Down Method for Large-Scale
Hierarchical Classification
ABSTRACT
With an
increasing number of images that are available in social media, image
annotation has emerged as an important research topic due to its application in
image matching and retrieval. Most studies cast image annotation into a
multi-label classification problem. The main shortcoming of this approach is
that it requires a large number of training images with clean and complete
annotations in order to learn a reliable model for tag prediction. We address
this limitation by developing a novel approach that combines the strength of
tag ranking with the power of matrix recovery. Instead of having to make a
binary decision for each tag, our approach ranks tags in the descending order
of their relevance to the given image, significantly simplifying the problem. In
addition, the proposed method aggregates the prediction models for different
tags into a matrix, and casts tag ranking into a matrix recovery problem. It
introduces the matrix trace norm to explicitly control the model complexity so
that a reliable prediction model can be learned for tag ranking even when the
tag space is large and the number of training images is limited. Experiments on
multiple well-known image datasets demonstrate the effectiveness of the
proposed framework for tag ranking compared to the state-ofthe- art approaches
for image annotation and tag ranking.
.
EXISTING
SYSTEM
cast image
annotation into a multi-label classification problem. The main shortcoming of
this approach is that it requires a large number of training images with clean
and complete annotations in order to learn a reliable model for tag prediction.
We address this limitation by developing a novel approach that combines the
strength of tag ranking with the power of matrix recovery. Instead of having to
make a binary decision for each tag, our approach ranks tags in the descending
order of their relevance to the given image, significantly simplifying the
problem
PROPOSED
SYSTEM:
the proposed method aggregates the prediction
models for different tags into a matrix, and casts tag ranking into a matrix
recovery problem. It introduces the matrix trace norm to explicitly control the
model complexity so that a reliable prediction model can be learned for tag
ranking even when the tag space is large and the number of training images is
limited. Experiments on multiple well-known image datasets demonstrate the
effectiveness of the proposed framework for tag ranking compared to the
state-ofthe- art approaches for image annotation and tag ranking.
MODULE DESCRIPTION:
Automatic image annotation.
Tag ranking.
Low-rank.
Matrix recovery.
Trace norm.
Automatic image Annotation.
Automatic image annotation aims to find a
subset of keywords/ tags that describes the visual content of an image. It
plays an important role in bridging the semantic gap between low-level features
and high-level semantic content of images. Most automatic image annotation
algorithms can be classified into three categories generative models that model
the joint distribution between tags and visual features, discriminative models
that view image annotation as a classification problem, and search based approaches.
Below, we will briefly review approaches in each category. Both mixture models
and topic models, two well known approaches in generative model, have been
successfully applied to automatic image annotation. In a Gaussian mixture model
is used to model the dependence between keywords and visual features. In kernel
density estimation is applied to model the distribution of visual features and
to estimate the conditional probability of keyword assignments given the visual
features. Topic models annotate images as samples from a specific mixture of
topics, which each topic is a joint distribution between image features and
annotation keywords. Various topic models have been developed for image
annotation, including probabilistic latent semantic analysis (pLSA) ,latent
Dirichlet allocation and hierarchical Dirichlet processes . Since a large
number of training examples are needed for estimating the joint probability
distribution over both features and keywords, the generative models are unable
to handle the challenge of large tag space with limited number of training
images
Discriminative models , views image
annotation as a multi-class classification problem, and learns one binary
classification model for either one or multiple tags. A 2D multiresolution
hidded Markov model (MHMM) is proposed to model the relationship between tags
and visual content .A structured max-margin algorithm is developed in to
exploit the dependence among tags. One problem with discriminative approaches
for image annotation is imbalanced data distribution because each binary
classifier is designed to distinguish image of one class from images of the
other classes. It becomes more severe when the number of classes/tags is large
.Another limitation of these approaches is that they are unable to capture the
correlation among classes, which is known to be important in multi-label
learning. To overcome
Tag ranking.
Tag
ranking aims to learn a ranking function that puts relevant tags in front of
the irrelevant ones. In the simplest form, it learns a scoring function that
assigns larger values to the relevant tags than to those irrelevant ones. In ,
the authors develop a classification framework for tag ranking that computes
tag scores for a test image based on the neighbor voting. It was extended in
[46] to the case where each image is represented by multiple sets of visual
features. Liu et al. utilizes the Kernel
Density Estimation (KDE) to calculate relevance scores for different tags, and
performs a randomwalk to further improve the performance of tag ranking by
exploring the correlation between tags. Similarly, Tang et al. proposed a two-stage graph-based relevance
propagation approach. In , a two-view tag weighting method is proposed to
effectively exploit both the correlation among tags and the dependence between
visual features and tags. In , a max-margin riffled independence model is
developed for tag ranking. As mentioned in the introduction section, most of
the existing algorithms for tag ranking tend to perform poorly when the tag
space is large and the number of training images is limited.
Low-rank.
In mathematics, low-rankapproximation is
a minimization problem, in which the cost
function measures the fit
between a given matrix (the data) and an approximating matrix (the optimization
variable), subject to a constraint that the approximating matrix has reduced rank. The problem is used for mathematical modeling and data
compression. The rank
constraint is related to a constraint on the complexity of a model that fits
the data. In applications, often there are other constraints on the
approximating matrix apart from the rank constraint, e.g., non-negativity and Hankel structure.
We study
the rank, trace-norm and max-norm as complexity measures of matrices, focusing
on the problem of fitting a matrix with matrices having low complexity. We
present generalization error bounds for predicting unobserved entries that are
based on these measures. We also consider the possible relations between these
measures. We show gaps between them, and bounds on the extent of such gaps.
Matrix recovery.
A common modeling assumption in many
engineering applications is that the underlying data lies (approximately) on a
low-dimensional linear subspace. This property has been widely exploited by
classical Principal Component Analysis (PCA) to achieve dimensionality
reduction. However, real-life data is often corrupted with large errors or can
even be incomplete. Although classical PCA is effective against the presence of
small Gaussian noise in the data, it is highly sensitive to even sparse errors
of very high magnitude. We propose
powerful tools that exactly and efficiently correct large errors in such
structured data. The basic idea is to formulate the problem as a matrix rank
minimization problem and solve it
efficiently by nuclear-norm minimization. Our algorithms achieve
state-of-the-art performance in low-rank matrix recovery with theoretical
guarantees. Please browse the links to the left for more information. The
introduction section provides a brief overview of the low-rank matrix recovery
problem and introduces state-of-the-art algorithms to solve. Please refer to
our papers in the references section for complete technical details, and to the
sample code section for MATLAB packages. The applications section showcases
engineering problems where our techniques have been used to achieve
state-of-the-art performance.
Trace norm.
Trace-norm
and max-norm as complexity measures of matrices, focusing on the problem of
fitting a matrix with matrices having low complexity. We present generalization
error bounds for predicting unobserved entries that are based on these
measures. We also consider the possible relations between these measures
Accelerated Gradient Algorithm
Gradient descent is a first-order optimization algorithm. To find a local
minimum of a function
using gradient descent, one takes steps proportional to
the negative of the gradient (or of the approximate gradient) of the
function at the current point. If instead one takes steps proportional to
the positive of the gradient, one approaches a local
maximum of that
function; the procedure is then known as gradient ascent.
Gradient descent is also known
as steepest descent, or the method of steepest descent. When known as
the latter, gradient descent should not be confused with the method of steepest descent for approximating integrals.

System Configuration:
HARDWARE REQUIREMENTS:
Hardware
- Pentium
Speed
- 1.1 GHz
RAM - 1GB
Hard Disk - 20 GB
Floppy Drive - 1.44 MB
Key Board - Standard Windows Keyboard
Mouse - Two or Three Button Mouse
Monitor
- SVGA
SOFTWARE REQUIREMENTS:
Operating System :
Windows
Technology :
Java and J2EE
Web Technologies :
Html, JavaScript, CSS
IDE : My Eclipse
Web Server :
Tomcat
Tool kit : Android
Phone
Database : My SQL
Java Version : J2SDK1.5
CONCLUSION
Extensive experiments on
image annotation and tag ranking have demonstrated that the proposed method
significantly outperforms several state-of-the-art methods for image annotation
especially when the number of training images is limited and when many of the
assigned image tags are missing. In the future, we plan to apply the proposed
framework to the image annotation problem when image tags are acquired by
crowdsouring that tend to be noisy and incomplete.
In addition, the proposed
method aggregates the prediction models for different tags into a matrix, and
casts tag ranking into a matrix recovery problem. It introduces the matrix
trace norm to explicitly control the model complexity so that a reliable
prediction model can be learned for tag ranking even when the tag space is
large and the number of training images is limited. Experiments on multiple
well-known image datasets demonstrate the effectiveness of the proposed
framework for tag ranking compared to the state-ofthe- art approaches for image
annotation and tag ranking.
Comments
Post a Comment