Please use this identifier to cite or link to this item: https://dspace.univ-ouargla.dz/jspui/handle/123456789/35678
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorKHERFI, Mohammed Lamine-
dc.contributor.advisorCHERIET, Abdelhakim-
dc.contributor.authorALLAOUI, Mebarka-
dc.date.accessioned2024-02-22T10:44:00Z-
dc.date.available2024-02-22T10:44:00Z-
dc.date.issued2023-
dc.identifier.urihttps://dspace.univ-ouargla.dz/jspui/handle/123456789/35678-
dc.descriptionComputer Visionen_US
dc.description.abstractDimensionality reduction is a commonly employed technique in the field of machine learning and analytics, as it aids in the examination and representation of expansive data- sets characterized by a multitude of dimensions. This approach is precious for enhancing the effectiveness of tasks such as data clustering and classification. Recently, embedding methods have emerged as a promising direction for improving clustering accuracy. As a result, robust embedding and clustering techniques can be used to resolve real-life pro- blems. The contribution of the present dissertation is fourfold : (1) We explored ways to enhance the performance of several clustering algorithms by employing one of the most effective embedding techniques available. Our central hypothesis posits that the chosen embedding technique can enable the discovery of the optimal clusterable embedding ma- nifold. Consequently, we utilized it as a preprocessing step prior to clustering, thereby enabling the clustering algorithms to enhance their performance. (2) We performed em- bedding and clustering simultaneously through an original formulation, which allows for preserving the data’s original structure in the embedding space and producing a better clustering assignment. The unified manifold embedding and clustering (UEC) algorithm is based on a bi-objective loss function that combines data embedding and clustering, which is optimized using three different ways : 1) Comma Variant, 2) Plus Variant, and 3) Light Plus Variant. (3) The onset of the COVID-19 pandemic posed a significant challenge, making it increasingly arduous for researchers to keep abreast of the latest scientific advancements, given the rapid influx of scientific articles. To address this issue, we introduced an intelligent tool rooted in Machine Learning. This tool automatically organizes a vast repository of scientific literature pertaining to COVID-19 and presents it in a manner that facilitates easy navigation and swift document retrieval. The initial step involves preprocessing and transforming the documents into numerical features. Subse- quently, these features undergo dimensionality reduction into a 2D space through a deep denoising autoencoder followed by the Uniform Manifold Approximation and Projection technique (UMAP). The projected data is then clustered using the Agglomerative Cluste- ring Algorithm. Finally, we employ Latent Dirichlet Allocation (LDA) for topic modeling, assigning a label to each cluster. (4) We propose an innovative deep learning framework designed to structure the extensive collection of scientific literature pertaining to COVID- 19. The fundamental concept underlying this architecture revolves around the training of the autoencoder, which utilizes a two-fold objective function comprising distinct terms. The first term is devoted to assessing the latent representation, while the second is used to achieve the clustering assignments. Afterward, the Latent Dirichlet Allocation (LDA) is used as topic modeling techniques.en_US
dc.language.isoenen_US
dc.publisherUNIVERSITY OF KASDI MERBAH OUARGLAen_US
dc.subjectmachine learningen_US
dc.subjectdimensionality reductionen_US
dc.subjectembeddingen_US
dc.subjectclus- teringen_US
dc.subjectjoint learningen_US
dc.titleEmbedding techniques and their application to Computer visionen_US
dc.typeThesisen_US
Appears in Collections:Département d'informatique et technologie de l'information - Doctorat

Files in This Item:
File Description SizeFormat 
Mebarka ALLAOUI-Doctorat.pdf1,28 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.