Menu
Introduction
Getting Started
Use cases
Technical documentations
Version
Publication date

Jul 10, 2025

Confidentiality
Public
Reactions
0
Share

UMAP analysis

Introduction


UMAPTask performs UMAP (Uniform Manifold Approximation and Projection) to reduce high-dimensional datasets into 2D or 3D representations that preserve the local and global structure of the data. It supports optional data scaling, K-Means clustering, and interactive Plotly visualizations.


This task is ideal for:


  • Visualizing complex datasets
    • Detecting structure or groupings
      • Pre-processing for downstream ML models

        Go to the UMAP task to find out more about the parameters. You can find here the documentation of the UMAP package.


        We will apply UMAP to the Iris and Digits datasets here. These are basic datasets that are used in Plotly documentation.



        Examples


        UMAP on the Iris Dataset


        The Iris dataset contains 4 numeric features and 3 species. We will plot each feature and colour each point according to the corresponding species. This will enable us to test whether these feature values are species-specific.


        In your scenario,  add the Iris dataset and the UMAP Dimensionality Reduction task. Fill in the parameters (see the screenshot below) and run the task. This will produce 2D and 3D plots, along with their respective tables.




        Text editor image

          


        Text editor image

        We obtain the following plots: 




        In both 2D and 3D plots, three compact clusters corresponding to the three species of Iris can be seen.


        We have reproduced Plotly's example UMAP projection.


        UMAP on the Digits Dataset


         The Digits dataset contains 64 pixel values per image (8×8 images).


        Text editor image
        Text editor image

        We obtain the following plots: 





        UMAP reveals the manifold structure that preserves digit separations.  


        We have reproduced Plotly's example UMAP projection.


        Conclusion


        UMAP Task provides a flexible and intuitive way to perform dimensionality reduction, visualize high-dimensional datasets, and uncover underlying structure in the data. With built-in options for feature scaling, clustering, and interactive Plotly visualizations, it adapts to both exploratory data analysis and production workflows. By applying UMAP to well-known datasets such as Iris and Digits, users can easily reproduce reference results from the Plotly gallery and gain confidence in the task’s behavior. Overall, UMAPTask is a powerful component for simplifying complex feature spaces and producing rich, interpretable visual insights.  



        Technical bricks to reuse or customize

        Have you developed a brick?

        Share it to accelerate projects for the entire community.