1
|
Zhou B, Wang X. Feature representation for 3D object retrieval based on unconstrained multi-view. MULTIMEDIA SYSTEMS 2022; 28:1699-1711. [DOI: 10.1007/s00530-022-00939-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Accepted: 04/07/2022] [Indexed: 09/01/2023]
|
2
|
Xu D, Shen X, Lyu Y, Du X, Feng F. MC‐Net: Learning mutually‐complementary features for image manipulation localization. INT J INTELL SYST 2022. [DOI: 10.1002/int.22826] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Dengyun Xu
- College of Computer Science and Technology Jilin University Changchun Jilin China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education Jilin University Changchun Jilin China
| | - Xuanjing Shen
- College of Computer Science and Technology Jilin University Changchun Jilin China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education Jilin University Changchun Jilin China
| | - Yingda Lyu
- Center for Computer Fundamental Education Jilin University Changchun Jilin China
| | - Xiaoyu Du
- School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing Jiangsu China
| | - Fuli Feng
- School of Computing National University of Singapore Singapore Singapore
| |
Collapse
|
3
|
Shi Z, Chang C, Chen H, Du X, Zhang H. PR‐NET: Progressively‐refined neural network for image manipulation localization. INT J INTELL SYST 2022. [DOI: 10.1002/int.22822] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Zenan Shi
- College of Computer Science and Technology Jilin University Changchun Jilin China
- State Key Laboratory of Communication Content Cognition Beijing China
| | - Chaoqun Chang
- College of Software Jilin University Changchun Jilin China
| | - Haipeng Chen
- College of Computer Science and Technology Jilin University Changchun Jilin China
- State Key Laboratory of Communication Content Cognition Beijing China
| | - Xiaoyu Du
- State Key Laboratory of Communication Content Cognition Beijing China
- School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing Jiangsu China
| | - Hanwang Zhang
- School of Computer Science and Engineering Nanyang Technological University Singapore Singapore
| |
Collapse
|
4
|
Li C, Zhang J, Yao J. Streamer action recognition in live video with spatial-temporal attention and deep dictionary learning. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.07.148] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
5
|
Guan W, Song X, Gan T, Lin J, Chang X, Nie L. Cooperation Learning From Multiple Social Networks: Consistent and Complementary Perspectives. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:4501-4514. [PMID: 31794409 DOI: 10.1109/tcyb.2019.2951207] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
GWI survey1 has highlighted the flourishing use of multiple social networks: the average number of social media accounts per Internet user is 5.54, and among them, 2.82 are being used actively. Indeed, users tend to express their views in more than one social media site. Hence, merging social signals of the same user across different social networks together, if available, can facilitate the downstream analyses. Previous work has paid little attention on modeling the cooperation among the following factors when fusing data from multiple social networks: 1) as data from different sources characterizes the characteristics of the same social user, the source consistency merits our attention; 2) due to their different functional emphases, some aspects of the same user captured by different social networks can be just complementary and results in the source complementarity; and 3) different sources can contribute differently to the user characterization and hence lead to the different source confidence. Toward this end, we propose a novel unified model, which co-regularizes source consistency, complementarity, and confidence to boost the learning performance with multiple social networks. In addition, we derived its theoretical solution and verified the model with the real-world application of user interest inference. Extensive experiments over several state-of-the-art competitors have justified the superiority of our model.1http://tinyurl.com/zk6kgc9.
Collapse
|
6
|
Meng Q, Zuo C, Shi F, Zhu W, Xiang D, Chen H, Chen X. Three-dimensional choroid neovascularization growth prediction from longitudinal retinal OCT images based on a hybrid model. Pattern Recognit Lett 2021. [DOI: 10.1016/j.patrec.2021.03.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
7
|
Integrating knowledge-based sparse representation for image detection. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.09.089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
8
|
Wang H, Peng J, Jiang G, Xu F, Fu X. Discriminative feature and dictionary learning with part-aware model for vehicle re-identification. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.06.148] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
9
|
Sharma K, Rameshan R. Image Set Classification Using a Distance-Based Kernel Over Affine Grassmann Manifold. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:1082-1095. [PMID: 32275625 DOI: 10.1109/tnnls.2020.2980059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Modeling image sets or videos as linear subspaces is quite popular for classification problems in machine learning. However, affine subspace modeling has not been explored much. In this article, we address the image sets classification problem by modeling them as affine subspaces. Affine subspaces are linear subspaces shifted from origin by an offset. The collection of the same dimensional affine subspaces of [Formula: see text] is known as affine Grassmann manifold (AGM) or affine Grassmannian that is a smooth and noncompact manifold. The non-Euclidean geometry of AGM and the nonunique representation of an affine subspace in AGM make the classification task in AGM difficult. In this article, we propose a novel affine subspace-based kernel that maps the points in AGM to a finite-dimensional Hilbert space. For this, we embed the AGM in a higher dimensional Grassmann manifold (GM) by embedding the offset vector in the Stiefel coordinates. The projection distance between two points in AGM is the measure of similarity obtained by the kernel function. The obtained kernel-gram matrix is further diagonalized to generate low-dimensional features in the Euclidean space corresponding to the points in AGM. Distance-preserving constraint along with sparsity constraint is used for minimum residual error classification by keeping the locally Euclidean structure of AGM in mind. Experimentation performed over four data sets for gait, object, hand, and body gesture recognition shows promising results compared with state-of-the-art techniques.
Collapse
|
10
|
Liu X, Yang X, Wang M, Hong R. Deep Neighborhood Component Analysis for Visual Similarity Modeling. ACM T INTEL SYST TEC 2020. [DOI: 10.1145/3375787] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Learning effective visual similarity is an essential problem in multimedia research. Despite the promising progress made in recent years, most existing approaches learn visual features and similarities in two separate stages, which inevitably limits their performance. Once useful information has been lost in the feature extraction stage, it can hardly be recovered later. This article proposes a novel end-to-end approach for visual similarity modeling, called
deep neighborhood component analysis
, which discriminatively trains deep neural networks to jointly learn visual features and similarities. Specifically, we first formulate a metric learning objective that maximizes the intra-class correlations and minimizes the inter-class correlations under the neighborhood component analysis criterion, and then train deep convolutional neural networks to learn a nonlinear mapping that projects visual instances from original feature space to a discriminative and neighborhood-structure-preserving embedding space, thus resulting in better performance. We conducted extensive evaluations on several widely used and challenging datasets, and the impressive results demonstrate the effectiveness of our proposed approach.
Collapse
Affiliation(s)
- Xueliang Liu
- Hefei University of Technology, Hefei, Anhui, China
| | - Xun Yang
- National University of Singapore, Singapore, Singapore
| | - Meng Wang
- Hefei University of Technology, Hefei, Anhui, China
| | - Richang Hong
- Hefei University of Technology, Hefei, Anhui, China
| |
Collapse
|
11
|
Arc detection and recognition in pantograph–catenary system based on convolutional neural network. Inf Sci (N Y) 2019. [DOI: 10.1016/j.ins.2019.06.006] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
12
|
Han Z, Liu Z, Han J, Vong CM, Bu S, Chen CLP. Unsupervised Learning of 3-D Local Features From Raw Voxels Based on a Novel Permutation Voxelization Strategy. IEEE TRANSACTIONS ON CYBERNETICS 2019; 49:481-494. [PMID: 29990288 DOI: 10.1109/tcyb.2017.2778764] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Effective 3-D local features are significant elements for 3-D shape analysis. Existing hand-crafted 3-D local descriptors are effective but usually involve intensive human intervention and prior knowledge, which burdens the subsequent processing procedures. An alternative resorts to the unsupervised learning of features from raw 3-D representations via popular deep learning models. However, this alternative suffers from several significant unresolved issues, such as irregular vertex topology, arbitrary mesh resolution, orientation ambiguity on the 3-D surface, and rigid and slightly nonrigid transformation invariance. To tackle these issues, we propose an unsupervised 3-D local feature learning framework based on a novel permutation voxelization strategy to learn high-level and hierarchical 3-D local features from raw 3-D voxels. Specifically, the proposed strategy first applies a novel voxelization which discretizes each 3-D local region with irregular vertex topology and arbitrary mesh resolution into regular voxels, and then, a novel permutation is applied to permute the voxels to simultaneously eliminate the effect of rotation transformation and orientation ambiguity on the surface. Based on the proposed strategy, the permuted voxels can fully encode the geometry and structure of each local region in regular, sparse, and binary vectors. These voxel vectors are highly suitable for the learning of hierarchical common surface patterns by stacked sparse autoencoder with hierarchical abstraction and sparse constraint. Experiments are conducted on three aspects for evaluating the learned local features: 1) global shape retrieval; 2) partial shape retrieval; and 3) shape correspondence. The experimental results show that the learned local features outperform the other state-of-the-art 3-D shape descriptors.
Collapse
|
13
|
Integration of Context Information through Probabilistic Ontological Knowledge into Image Classification. INFORMATION 2018. [DOI: 10.3390/info9100252] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
The use of ontological knowledge to improve classification results is a promising line of research. The availability of a probabilistic ontology raises the possibility of combining the probabilities coming from the ontology with the ones produced by a multi-class classifier that detects particular objects in an image. This combination not only provides the relations existing between the different segments, but can also improve the classification accuracy. In fact, it is known that the contextual information can often give information that suggests the correct class. This paper proposes a possible model that implements this integration, and the experimental assessment shows the effectiveness of the integration, especially when the classifier’s accuracy is relatively low. To assess the performance of the proposed model, we designed and implemented a simulated classifier that allows a priori decisions of its performance with sufficient precision.
Collapse
|
14
|
|
15
|
|
16
|
Zhou Z, Hao S. Anatomical landmark detection on 3D human shapes by hierarchically utilizing multiple shape features. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2016.09.131] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
17
|
Wu J, Pan S, Zhu X, Zhang C, Wu X. Positive and Unlabeled Multi-Graph Learning. IEEE TRANSACTIONS ON CYBERNETICS 2017; 47:818-829. [PMID: 28113878 DOI: 10.1109/tcyb.2016.2527239] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
In this paper, we advance graph classification to handle multi-graph learning for complicated objects, where each object is represented as a bag of graphs and the label is only available to each bag but not individual graphs. In addition, when training classifiers, users are only given a handful of positive bags and many unlabeled bags, and the learning objective is to train models to classify previously unseen graph bags with maximum accuracy. To achieve the goal, we propose a positive and unlabeled multi-graph learning (puMGL) framework to first select informative subgraphs to convert graphs into a feature space. To utilize unlabeled bags for learning, puMGL assigns a confidence weight to each bag and dynamically adjusts its weight value to select "reliable negative bags." A number of representative graphs, selected from positive bags and identified reliable negative graph bags, form a "margin graph pool" which serves as the base for deriving subgraph patterns, training graph classifiers, and further updating the bag weight values. A closed-loop iterative process helps discover optimal subgraphs from positive and unlabeled graph bags for learning. Experimental comparisons demonstrate the performance of puMGL for classifying real-world complicated objects.
Collapse
|
18
|
Li X, Lv J, Wu X, Yu X. A Semi-supervised manifold alignment algorithm and an evaluation method based on local structure preservation. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2016.11.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
19
|
Li X, Liu Y, Jiang Y, Liu X. Identifying social influence in complex networks: A novel conductance eigenvector centrality model. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2015.11.123] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
20
|
Sentiment analysis for Chinese microblog based on deep neural networks with convolutional extension features. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.02.077] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
21
|
SUN X, YE J, REN F. Detecting influenza states based on hybrid model with personal emotional factors from social networks. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.01.107] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
22
|
Wang X, Liu G, Pan L, Li J. Uncovering fuzzy communities in networks with structural similarity. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.01.109] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
23
|
|
24
|
Liu Z, Wang X, Bu S. Human-Centered Saliency Detection. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2016; 27:1150-1162. [PMID: 26571539 DOI: 10.1109/tnnls.2015.2495148] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
We introduce a new concept for detecting the saliency of 3-D shapes, that is, human-centered saliency (HCS) detection on the surface of shapes, whereby a given shape is analyzed not based on geometric or topological features directly obtained from the shape itself, but by studying how a human uses the object. Using virtual agents to simulate the ways in which humans interact with objects helps to understand shapes and detect their salient parts in relation to their functions. HCS detection is less affected by inconsistencies between the geometry or topology of the analyzed 3-D shapes. The potential benefit of the proposed method is that it is adaptable to variable shapes with the same semantics, as well as being robust against a geometrical and topological noise. Given a 3-D shape, its salient part is detected by automatically selecting a corresponding agent and making them interact with each other. Their adaption and alignment depend on an optimization framework and a training process. We demonstrate the detected salient parts for different types of objects together with the stability thereof. The salient parts can be used for important vision tasks, such as 3-D shape retrieval.
Collapse
|
25
|
Jiang R, Fu W, Wen L, Hao S, Hong R. Dimensionality reduction on Anchorgraph with an efficient Locality Preserving Projection. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2015.07.128] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
26
|
Yang Y, Wang X, Liu Q, Xu M, Yu L. A bundled-optimization model of multiview dense depth map synthesis for dynamic scene reconstruction. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2014.11.014] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
27
|
Xia Y, Zhang L, Xu W, Shan Z, Liu Y. Recognizing multi-view objects with occlusions using a deep architecture. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2015.01.038] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
28
|
Leng B, Du C, Guo S, Zhang X, Xiong Z. A powerful 3D model classification mechanism based on fusing multi-graph. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2015.05.048] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
29
|
Liu A, Wang Z, Nie W, Su Y. Graph-based characteristic view set extraction and matching for 3D model retrieval. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2015.04.042] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
30
|
Pan W, Xue J, Lu K, Zhai R, Dai S. Hybrid architecture for 3D visualization of ultrasonic data. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2015.03.026] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
31
|
Haj Mohamed H, Belaid S. Algorithm BOSS (Bag-of-Salient local Spectrums) for non-rigid and partial 3D object retrieval. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2015.05.045] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
32
|
|
33
|
Lu K, He N, Xue J, Dong J, Shao L. Learning view-model joint relevance for 3D object retrieval. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2015; 24:1449-1459. [PMID: 25643404 DOI: 10.1109/tip.2015.2395961] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
3D object retrieval has attracted extensive research efforts and become an important task in recent years. It is noted that how to measure the relevance between 3D objects is still a difficult issue. Most of the existing methods employ just the model-based or view-based approaches, which may lead to incomplete information for 3D object representation. In this paper, we propose to jointly learn the view-model relevance among 3D objects for retrieval, in which the 3D objects are formulated in different graph structures. With the view information, the multiple views of 3D objects are employed to formulate the 3D object relationship in an object hypergraph structure. With the model data, the model-based features are extracted to construct an object graph to describe the relationship among the 3D objects. The learning on the two graphs is conducted to estimate the relevance among the 3D objects, in which the view/model graph weights can be also optimized in the learning process. This is the first work to jointly explore the view-based and model-based relevance among the 3D objects in a graph-based framework. The proposed method has been evaluated in three data sets. The experimental results and comparison with the state-of-the-art methods demonstrate the effectiveness on retrieval accuracy of the proposed 3D object retrieval method.
Collapse
|
34
|
Ji Z, Pang Y, He Y, Zhang H. Semi-supervised LPP algorithms for learning-to-rank-based visual search reranking. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2014.10.037] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
35
|
|
36
|
Liu AA, Xu N, Su YT, Lin H, Hao T, Yang ZX. Single/multi-view human action recognition via regularized multi-task learning. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.04.090] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
37
|
|
38
|
Wang JB, He N, Zhang LL, Lu K. Single image dehazing with a physical model and dark channel prior. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.08.005] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
39
|
Multi-View Human Action Recognition Using Wavelet Data Reduction and Multi-Class Classification. ACTA ACUST UNITED AC 2015. [DOI: 10.1016/j.procs.2015.08.540] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
40
|
Leng B, Zeng J, Yao M, Xiong Z. 3D object retrieval with multitopic model combining relevance feedback and LDA model. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2015; 24:94-105. [PMID: 25420263 DOI: 10.1109/tip.2014.2372618] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
View-based 3D model retrieval uses a set of views to represent each object. Discovering the complex relationship between multiple views remains challenging in 3D object retrieval. Recent progress in the latent Dirichlet allocation (LDA) model leads us to propose its use for 3D object retrieval. This LDA approach explores the hidden relationships between extracted primordial features of these views. Since LDA is limited to a fixed number of topics, we further propose a multitopic model to improve retrieval performance. We take advantage of a relevance feedback mechanism to balance the contributions of multiple topic models with specified numbers of topics. We demonstrate our improved retrieval performance over the state-of-the-art approaches.
Collapse
|
41
|
Wang J, Lu C, Wang M, Li P, Yan S, Hu X. Robust face recognition via adaptive sparse representation. IEEE TRANSACTIONS ON CYBERNETICS 2014; 44:2368-2378. [PMID: 25415943 DOI: 10.1109/tcyb.2014.2307067] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Sparse representation (or coding)-based classification (SRC) has gained great success in face recognition in recent years. However, SRC emphasizes the sparsity too much and overlooks the correlation information which has been demonstrated to be critical in real-world face recognition problems. Besides, some paper considers the correlation but overlooks the discriminative ability of sparsity. Different from these existing techniques, in this paper, we propose a framework called adaptive sparse representation-based classification (ASRC) in which sparsity and correlation are jointly considered. Specifically, when the samples are of low correlation, ASRC selects the most discriminative samples for representation, like SRC; when the training samples are highly correlated, ASRC selects most of the correlated and discriminative samples for representation, rather than choosing some related samples randomly. In general, the representation model is adaptive to the correlation structure that benefits from both l1-norm and l2-norm. Extensive experiments conducted on publicly available data sets verify the effectiveness and robustness of the proposed algorithm by comparing it with the state-of-the-art methods.
Collapse
|
42
|
Sun F, Li H, Hao S. Shape analysis based on feature-preserving Elastic Quadratic Patch Modeling. Neurocomputing 2014. [DOI: 10.1016/j.neucom.2014.05.042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
43
|
Zhang S, Yao H, Sun X, Wang K, Zhang J, Lu X, Zhang Y. Action recognition based on overcomplete independent components analysis. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2013.12.052] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
44
|
Lu K, Wang Q, Xue J, Pan W. 3D model retrieval and classification by semi-supervised learning with content-based similarity. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2014.03.079] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
45
|
Lu K, Ji R, Tang J, Gao Y. Learning-based bipartite graph matching for view-based 3D model retrieval. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2014; 23:4553-4563. [PMID: 25073171 DOI: 10.1109/tip.2014.2343460] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Distance measure between two sets of views is one central task in view-based 3D model retrieval. In this paper, we introduce a distance metric learning method for bipartite graph matching-based 3D object retrieval framework. In this method, the relationship among 3D models is formulated by a graph structure with semisupervised learning to estimate the model relevance. More specially, we model two sets of views by using a bipartite graph, on which their optimal matching is estimated. Then, we learn a refined distance metric by using the user’s relevance feedback. The proposed method has been evaluated on four data sets and the experimental results and comparison with the state-of-the-art methods demonstrate the effectiveness of the proposed method.
Collapse
|
46
|
|
47
|
Ji P, Zhao N, Hao S, Jiang J. Automatic image annotation by semi-supervised manifold kernel density estimation. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2013.09.016] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
48
|
|
49
|
Zhou T, Qi M, Jiang J, Wang X, Hao S, Jin Y. Person Re-identification based on nonlinear ranking with difference vectors. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2014.04.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
50
|
Improve the performance of co-training by committee with refinement of class probability estimations. Neurocomputing 2014. [DOI: 10.1016/j.neucom.2014.01.039] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|