Stone accumulation in kidney is a typical disease/ sickness in most countries all around the world. Its frequency rate is continually expanding. It has been observed that, the classification of renal stones prompts an imperative decrease of the re-occurrence. The classification of stones based on particular texture, surface highlights and lab examinations are a standout amongst the most utilized strategies. In this paper we use dataset of explicitly intended for top captured pictures of 454 expelled kidney stones which extracted through urinary or surgical procedure for classification purpose. In this paper different techniques have been learned and applied the specialist’s defined framework to arrange them into defined classes then perform classification process. In this paper we use feature fusion technique to collect as much as possible features. We select VGG16, InceptionV3 and ALEX features for fusion using serial feature fusion method. We choose C-SVM and F-KNN classifier to get improved accuracy of same dataset and predict better correctness’s with the possibilities of expansion of the dataset measure. In initial testing classification accuracy recorded at 83.43%, FNR 19.25%, Precision Rate 88.48% and Sensitivity of 86.51% on CSVM, later on the best testing classification accuracy recorded at 99.5%, FNR 0.1%, Precision Rate 99.90% and Sensitivity of 99.96% on F-KNN.
Ensuring the actual presence of a genuine legitimate trait as opposed to a fake self-manufactured synthetic is a major problem in bio-metric authentication. The proposed system's objective is to improve the reliability of bio metric recognition systems through the use of image quality evaluation. The proposed technique uses general image quality features derived from a single image to distinguish between legitimate and impostor samples, making it optimal for applications with a very low degree of complexity. In the proposed method, we are using publicly available ATVSFir_ DB dataset of iris which makes it highly competitive. We have also tested the algorithm on self-generated dataset for authenticity and rigorous testing purposes. The results acquired from the experimental phase were satisfying and authentic. The proposed method is able to achieve an averaged accuracy of 99.1% for the ATVS-Fir_DB dataset and 99.9% for the self-generated dataset.
Detection of a moving human is challenging for real-time systems. Misdetection in high alert security areas may lead to heavy losses. This paper presents an optimized approach to avoid this misdetection in sensitive areas. Rotation invariant optimized correlation filters are used for detection of humans. Some pre-processing algorithms such as background subtraction and color space conversion have been linked to the correlation filters to minimize processing time and maximize the accuracy of target detection. The experimental tests of the proposed methodology validate that better accuracy can be achieved if the proposed optimized approach is utilized for moving human detection in real-time systems. In future work, the proposed approach will be extended to detect human activity at night and thermal imagery.
CPU performance is estimated from the execution of processes per unit time. The selection of the CPU scheduling algorithm in less time is a vital issue. In this paper, a novel approach has been proposed in which selection of an appropriate CPU scheduling algorithm is done through machine learning algorithms dynamically. The result of the proposed algorithm is shown in the experimental section. Through experimentation, it is found that a decision tree gives better results in terms of accuracy and computational time as compared to other machine learning algorithms.
A spatial domain optimal trade-off Maximum Average Correlation Height (SPOT-MACH) filter has been shown to have advantages over frequency domain implementations of the Optimal Trade-Off Maximum Average Correlation Height (OR-MACH) filter as it can be made locally adaptive to spatial variations in the input image background clutter and normalized for local intensity changes. This enables the spatial domain implementation to be resistant to illumination changes. The Affine Scale Invariant Feature Transform (ASIFT) is an extension of previous feature transform algorithms; its features are invariant to six affine parameters which are translation (2 parameters), zoom, rotation and two camera axis orientations. This results in it accurately matching increased numbers of key points which can then be used for matching between different images of the object being tested. In this paper a novel approach will be adopted for enhancing the performance of the spatial correlation filter (SPOT MACH filter) using ASIFT in a pre-processing stage enabling fully invariant object detection and recognition in images with geometric distortions. An optimization criterion is also be developed to overcome the temporal overhead of the spatial domain approach. In order to evaluate effectiveness of algorithm, experiments were conducted on two different data sets. Several test cases were created based on illumination, rotational and scale changes in the target object. The performance of correlation algorithms was also tested against composite images as references and it was found that this results in a well-trained filter with better detection ability even when the target object has gone through large rotational changes.
Correlation filters due to its three protuberant advantages have proven very effective for automatic target detection, biometric verification and security applications. In this paper, correlation filters are implemented in hardware FPGA keeping in view their importance in real time applications. Hardware implementation results are placed in comparison with results generated through software. These results are almost similar with a negligible variation i.e. 10-4, which is demonstrated in the experimental section, in addition to valuable time reduction. The hardware design of these filters is implemented in LabView which can be subsequently employed in real-time security applications. This design may be expanded for other advanced variants of correlation filters in future work.
Object Recognition and Tracking are one of the key research areas in image processing and computer vision. This paper presents a novel technique which efficiently recognizes an object based on full boundary detection using affine scale invariant feature transform method (ASIFT). ASIFT is an improvement to SIFT algorithm as it provides invariance up to six parameters longitude and latitude wise. The six parameters are based on translation (2 parameters), rotation, camera axis orientation (2 parameters) and zoom. Key points commonly referred to as feature points are then obtained using the mentioned parameters which will recognize the object efficiently. Furthermore a region merging technique is used for object recognition and detection in the remote scene environment using ASIFT technique. A short pictorial comparison between SIFT and ASIFT will also be presented based on feature points calculation. After the recognition using ASIFT is performed, an algorithm will be presented for tracking of the recognized object using modified particle filter. The particle filter will use a proximal gradient (PG) approach for tracking of the recognized object in subsequent images. In case an object drastically varies its position w.r.t any of the six parameters mentioned above, ASIFT will again be called for object recognition.
In a remote scene environment consisting of multiple objects and miscellaneous scenarios, detecting an object of interest is a troublesome task especially while tracking the object over successive frames. Numerous methods have been proposed over the years for efficient detection of object of interest in a remote scene environment while in he meanwhile discarding all those which aren’t of interest and thus considered as noise. It is still one of the most actively researched areas in the field of image processing and computer vision. In this paper, a method is proposed which will not only detect a fixed shape object in a remote scene environment but it will also track it over successive frames. However, an additional methodology is also proposed which will detect the object in case of change of viewing angles e.g. scenario’s like rotation of object, zooming etc. First, Scale Invariant Feature Transform (SIFT) will be presented which will provide invariance up to four different parameters i.e. rotation, translation and zoom. In the second phase, ASIFT will be used which will provide invariance up to six different parameters i.e. translation, rotation, zoom and camera axis orientations. After both algorithms are presented, a detailed comparison between both is presented. Detection of object is performed with the help of both SIFT and ASIFT and then comparison is made based on feature points. Finally, Tracking is performed based on Proximal Gradient Particle filters which will further strengthen the comparison between SIFT and ASIFT once the object that needs to be tracked changes its course of motion or zoom. Experimental results will show which one of the two filters is more efficient.
Osteoporosis is an age-based disease causing skeletal disorder. It is described by the decreased bone mass and weakening of the bone structure thereby resulting in the higher fracture risks. Early identification can help prevent the disease and successfully predict the fracture risks. Automated diagnosis of osteoporosis using X-ray image is a very challenging task because the radiographs from the healthy patients and osteoporotic cases show a great resemblance. The texture representation is done using two type of methods: appearance based methods and feature based methods. This study explains two systems, one based on PCA and one based on LDA. The system contains two stages, first one is PCA or LDA based feature computation and the second is the classification stage. The classification has been done using classifiers i.e. kNN, NB and SVM. The discriminating power of the texture descriptors is assessed using ten-folded cross-validation scheme using different machine learning techniques. The scope of the study is to support therapists in osteoporosis prediction, avoiding unnecessary further testing with bone.
Biometric identification method is used to assess the characteristics of human behavior by identifying their different parameters. Gait recognition is an active biometric research topic which has many security and surveillance applications, and also can help in early diagnosis of different medical conditions such as Parkinson disease. It has been concluded from Psychological studies that people have slight but substantial capability to distinguish individuals by their gait characteristics. There are different techniques to perform gait recognition, and can be achieved by analyzing data from either imagery or radar sensors. This particular research project however will involve correct identification of a person from person’s gait by using images/video taken at different distances, angle of views and walking speeds of the person. CASIA Gait Recognition Dataset used in this project contains gait energy images. These images are extracted from images frame sequence of walking subject with camera positioned relative to subject, with increments of 18 degrees. Lower part of GEI is used in feature extraction, as it has most dynamic information. Gait signatures of a person created from gait energy images will be used to train artificial neural networks model to correctly classify the subject. Two Back propagation algorithms are compared in terms of performance. Cross-entropy and ROC curves are used as performance criteria for both training algorithms. Our system performs very well in terms of minimization of cross-entropy and classification rate.
Object recognition and semantic segmentation have been the two most common problems of traditional scene understanding in the computer vision domain. Major breakthroughs were reported in the last few years because of the increased utilization of deep learning, which offer a convincing alternative by learning the problem specific features on their own. In this paper, a summary of the frequently used framework – convolutional neural networks (CNN) is discussed. Accordingly a categorization scheme has been proposed to analyze the deep networks developed for image segmentation. Under this scheme, thirteen methods from the literature have been reviewed which are classified on the basis on how they perform segmentation operation i.e. semantic segmentation, instance segmentation and hybrid approaches. These method were reviewed from different aspects like their category, the novelty in the architecture of the method, and their special features in contrast with the traditional approaches. Latest review and analysis of these segmentation approaches, which provided outstanding results for image segmentation compared to the ordinary system, reveals that deep learning is increasingly becoming an important part of image segmentation and improvement in deep learning algorithms, which could resolve computer vision problems.
The assessment of osteoporotic subjects from X-ray images poses a significant challenge for pattern recognition and medical diagnostic applications. Textured images of bone micro-architecture of osteoporotic and healthy subjects exhibit high degree of similarity hence amplifying difficulty of classifying such textures. This research is focused on exploring different texture based methods to segregate osteoporotic from healthy controls. We enacted set of well evaluated preprocessing model to enhance the prospects of drawing a distinct line between two classes while exercising diverse texture analysis approaches including Grey Level Co-occurrence Matrix (GLCM), two-dimensional and one-dimensional Local Binary Patterns. Finally we propose a hybrid technique to attain an enhanced class distinction. Consequently experiments were conducted on two populations of osteoporotic patients and controls, with comparative analysis of the results.
Osteoporosis is an age-based disease causing skeletal disorder. It is described by the little bone mass and weakening of the bone structure thereby resulting in the higher fracture risks. Early identification can help prevent the disease and successfully predict the fracture risks. Automated diagnosis of osteoporosis using X-ray image is a very challenging task because the radiographs from the healthy subjects and osteoporotic cases show a high grade of resemblance. This study presents an evaluation of osteoporosis identification using texture descriptor Local Binary Pattern (LBP) and Shift Local Binary Pattern (SLBP). In contrast with the conventional LBP, with the shifted LBP specific number of binary local codes are generated for each pixel place. The distinguishing ability of the texture descriptors is evaluated using ten-fold cross validation and leave-one out scheme using different machine learning techniques. The results prove the SLBP outperforms the traditional LBP for bone texture characterization.
A fully invariant system helps in resolving difficulties in object detection when camera or object orientation and position
are unknown. In this paper, the proposed correlation filter based mechanism provides the capability to suppress noise,
clutter and occlusion. Minimum Average Correlation Energy (MACE) filter yields sharp correlation peaks while
considering the controlled correlation peak value. Difference of Gaussian (DOG) Wavelet has been added at the
preprocessing stage in proposed filter design that facilitates target detection in orientation variant cluttered environment.
Logarithmic transformation is combined with a DOG composite minimum average correlation energy filter (WMACE),
capable of producing sharp correlation peaks despite any kind of geometric distortion of target object. The proposed
filter has shown improved performance over some of the other variant correlation filters which are discussed in the result
section.
Correlation filters are a well established means for target recognition tasks. However, the unintentional effect of circular
correlation has a negative influence on the performance of correlation filters as they are implemented in frequency
domain. The effects of aliasing are minimized by introducing zero aliasing constraints in the template and test image. In
this paper, the comparative analysis of logarithmic zero aliasing optimal trade off correlation filters has been carried out
for different types of target distortions. The zero aliasing Maximum Average Correlation Height (MACH) filter has been
identified as the best choice based on our research for achieving enhanced results in the presence of any type of variance
which are discussed in results section. The reformulation of the MACH expressions with zero aliasing has been made to
demonstrate the achievable enhancement to the logarithmic MACH filter in target detection applications.
Sensitivity to the variations in the reference image is a major concern when recognizing target objects. A combinational framework of correlation filters and logarithmic transformation has been previously reported to resolve this issue alongside catering for scale and rotation changes of the object in the presence of distortion and noise. In this paper, we have extended the work to include the influence of different logarithmic bases on the resultant correlation plane. The meaningful changes in correlation parameters along with contraction/expansion in the correlation plane peak have been identified under different scenarios. Based on our research, we propose some specific log bases to be used in logarithmically transformed correlation filters for achieving suitable tolerance to different variations. The study is based upon testing a range of logarithmic bases for different situations and finding an optimal logarithmic base for each particular set of distortions. Our results show improved correlation and target detection accuracies.
Human detection has gained considerable importance in aggravated security scenarios over recent times. An effective
security application relies strongly on detailed information regarding the scene under consideration. A larger
accumulation of humans than the number of personal authorized to visit a security controlled area must be effectively
detected, amicably alarmed and immediately monitored. A framework involving a novel combination of some existing
techniques allows an immediate detection of an undesirable crowd in a region under observation. Frame differencing
provides a clear visibility of moving objects while highlighting those objects in each frame acquired by a real time
camera. Training of a correlation pattern recognition based filter on desired shapes such as elliptical representations of
human faces (variants of an Omega Shape) yields correct detections. The inherent ability of correlation pattern
recognition filters caters for angular rotations in the target object and renders decision regarding the existence of the
number of persons exceeding an allowed figure in the monitored area.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.