An Introduction to polarisR

Introduction

Welcome to the polarisR…..

This document provides a comprehensive guide to use the polarisR. We will walk through each tab of the application, explaining the features and functionalities available to help you explore and understand your high-dimensional data.

What does polarisR stand for?

projective output layouts and reduced interactive surfaces in R

Application Overview

The polarisR interface is organized into five main tabs, each designed for specific aspects of your data analysis workflow:

  1. Dataset Preview - Load and explore your data, select columns, and manage datasets
  2. Non-linear dimension reduction (NLDR) - Apply NLDR methods (t-SNE/UMAP) with parameter configuration and visualization
  3. Dynamic Tour - Explore high-dimensional structure through animated projections
  4. Diagnosing - Assess embedding quality using quantitative methods
  5. 2-D Layout Comparison - Compare different NLDR configurations and results

Each tab builds upon the previous ones, creating a comprehensive workflow from data loading to advanced comparative analysis. Let’s explore each tab in detail.

Dataset Preview Tab

The Dataset Preview tab is the starting point of your analysis. Here, you can load your data, select relevant columns, and get a quick overview of your dataset.

Dataset Preview Tab

Features:

# Access the datasets directly
data(four_clusters, package = "polarisR")
data(pdfsense, package = "polarisR") 

# View dataset information
?four_clusters
?pdfsense

Dataset Descriptions:

Additional Features:

Non-linear dimension reduction (NLDR) Tab

The Non-linear dimension reduction (NLDR) tab is where the main NLDR analysis happens. You can choose between t-SNE and UMAP, configure their parameters, and visualize the results.

Dataset Visualization Tab

Features:

Dynamic Tour Tab

The Dynamic Tour tab offers an interactive way to explore the high-dimensional space of your data. It provides a dynamic projection of the data, which can be viewed as a scatter plot, sage plot, or slice plot.

Dynamic Tour Tab

Features:

Diagnosing Tab

The Diagnosing tab provides tools to assess the selected NLDR layout. It uses the [quollr package] (https://github.com/JayaniLakshika/quollr) to perform a quantitative analysis of the NLDR layout and helps you to find the optimal binwidth for the model fitting.

Features:

2-D Layout Comparison Tab

The 2-D Layout Comparison tab allows you to compare the results of different NLDR layouts. You can compare different methods (t-SNE vs. UMAP) or the same method with different hyper-parameters.

Features: