Documentation — StatInsight

Installation

StatInsight is available for Windows and Linux. Choose your platform below.

Windows

1

Go to the Download page and click the Windows icon to save StatInsight_installer.exe.

2

If Windows SmartScreen shows a warning, click "More info" then "Run anyway" to proceed with the installation.

3

Read the License Agreement, select "I accept the agreement", then click Next and follow the remaining steps.

4

Leave "Launch StatInsight" checked and click Finish. The app opens and asks you to Accept the EULA.

5

Enter your License Key from the Pricing page. You are ready to start.

Linux

1

Go to the Download page and click the Linux icon to save StatInsight.tar.gz.

2

Extract the archive: tar -xzf StatInsight.tar.gz

3

Make the binary executable: chmod +x StatInsight/StatInsight

4

Launch the application: ./StatInsight/StatInsight — or double-click the binary in your file manager.

5

Accept the EULA, then enter your License Key from the Pricing page. You are ready to start.

Need a license key? Visit the Pricing page to get a free 14-day demo license — no credit card required.

Loading Data

Open a dataset by clicking File → Open File or dragging a file onto the application window. StatInsight automatically detects variable types upon loading.

Supported formats

Format	Extension	Notes
CSV	.csv	Delimiter auto-detected (comma, semicolon, pipe, tab)
Excel	.xls, .xlsx	Both legacy and modern Excel formats supported
RTF	.rtf	Rich Text Format with tabular data

Variable classification

Each column is automatically classified into one of the following types. You can change the type manually at any time using the variable type selector in the Descriptives tab.

Type	Description
Continuous	Numeric data with many unique values — measurements, weights, lab values
Categorical	Limited set of distinct groups — blood type, treatment group, study site
Binary	Exactly two distinct values — yes/no, 0/1, male/female
Date	Date or time values; common formats auto-recognized
Label	High-cardinality text columns (IDs, names) — excluded from statistical analysis

Variable types can be changed manually after loading if the automatic classification does not match your data.

Saving and reopening projects

Save your entire working session — including loaded data, variable types, and analysis results — as a .stati project file. Use File → Save Project to save, and File → Open Project to resume where you left off.

Descriptive Statistics

The Descriptives tab provides an at-a-glance summary of every variable in your dataset. Results update instantly when variable types are changed or outliers are removed.

Continuous variables

For numeric variables with many unique values, StatInsight calculates: mean, standard deviation, median, interquartile range (IQR), minimum, maximum, and sample size. A normality assessment is run automatically (composite of Shapiro-Wilk, Anderson-Darling, skewness, and kurtosis). Each variable is accompanied by an interactive histogram with outlier markers.

Categorical & binary variables

For categorical and binary columns, StatInsight displays frequency counts and percentages for each group, alongside a bar plot. This allows immediate detection of imbalanced groups or data entry errors.

Date variables

Date columns show the earliest and latest values, total span, and a time-area distribution plot to visualize data collection over time — useful for identifying recruitment gaps or data quality issues.

Outlier removal

Outliers are identified using the IQR method: a value is flagged as an outlier if it falls below Q1 − 1.5 × IQR or above Q3 + 1.5 × IQR. An Remove Outliers button is available for each continuous variable, allowing targeted removal without affecting other variables. Removed values are tracked and can be restored.

Quick Statistics

Quick Statistics runs all applicable statistical tests on your entire dataset in one click. StatInsight automatically selects the appropriate test for each variable combination based on data type and normality assessment, saving you the effort of choosing tests manually.

What's included in each result

p-value Effect size Power analysis Confidence intervals Plain-language interpretation Visualization

Each result card displays the test name, the variables tested, the key statistics, and a plain-English summary of the finding. Significant results (p < 0.05) are visually highlighted. All results can be filtered and exported — see the Export section for details.

Custom Statistics

The Custom Statistics tab lets you choose a specific test and configure exactly which variables to compare. This is the right choice when you have a defined hypothesis or need a test not covered by Quick Statistics.

Assumption-aware guidance

Before you run a test, StatInsight evaluates whether your selected data meets the test's assumptions and displays a colour-coded indicator:

Green — test recommended for your data Yellow — test possible, check assumptions carefully Red — test not appropriate for this data

Available tests

Category	Tests
Comparison	T-Test, Paired T-Test, Mann-Whitney U, Wilcoxon, ANOVA, Kruskal-Wallis, Repeated Measures ANOVA, Two-Way ANOVA, Friedman, ANCOVA, MANOVA
Correlation	Pearson, Spearman, Partial Pearson, Partial Spearman
Categorical	Chi-Squared Test
Survival	Kaplan-Meier Analysis
Regression	Simple Linear, Multiple Linear, Multiple Logistic, Multinomial Logistic

AutoPrediction

AutoPrediction automatically identifies which variables in your dataset are most likely predictors of a chosen outcome variable. Select your outcome and let StatInsight run a battery of machine learning feature selection methods to rank the remaining variables by predictive importance.

Methods used

1

L1-Regularized Regression (Lasso) — penalized regression that forces coefficients of less-relevant predictors toward zero, effectively performing variable selection.

2

ElasticNet Regression — combines L1 and L2 penalties, making it more robust when predictors are correlated with one another.

3

Random Forest Feature Importance — a tree-based ensemble method that measures how much each variable reduces impurity across all trees in the forest.

4

Mutual Information — measures the statistical dependence between each predictor variable and the outcome, capturing both linear and non-linear relationships.

5

Permutation Importance — directly measures the drop in model performance when a variable's values are randomly shuffled, providing a model-agnostic importance estimate.

Results are presented as a ranked list of predictors with a feature importance bar chart, making it straightforward to identify your most influential variables.

Filters & Navigation

The Descriptives pane includes a search bar to filter variables by name, and category filter buttons to show only Continuous, Categorical, Binary, or Date variables. This is particularly useful with wide datasets containing many columns.

The Quick Statistics pane offers several filtering options to focus on the most relevant results:

Search by test name — type any part of a test name to filter the results list.
Filter by variable type — buttons for All, Continuous, Categorical, and Binary narrow results to tests involving that variable type.
Significant only toggle — enable this to show only results where p < 0.05, so you can focus on statistically significant findings without scrolling through the full output.

Plots

Every result in StatInsight is accompanied by a chart. Plots are generated automatically and can be customized using the built-in plot editor.

Descriptive plots

Histogram (continuous variables)
Bar plot (categorical and binary variables)
Pie chart (categorical variables)
Box plot (continuous variables)
Time-area plot (date variables)

Statistical test plots

Scatter plot with regression trendline (correlation tests)
Mean bar chart with error bars (T-Test, ANOVA)
Mean line plot (Paired T-Test, Repeated Measures ANOVA)
Box plot (Mann-Whitney U, Wilcoxon, Kruskal-Wallis)
Violin plot (distribution comparisons)
Stacked bar plot (Chi-Squared test)
Survival curves with confidence bands (Kaplan-Meier)
Regression scatter with fit line (linear and logistic regression)
Feature importance bar chart (AutoPrediction)

Customization

Click the edit icon on any plot to open the Plot Editor. You can modify the chart title, axis labels, color scheme, line styles, and plot dimensions. Changes are applied in real time and are saved with the project file.

Export

StatInsight exports results to Microsoft Word (.docx) format — ready to paste directly into a manuscript or report. Use File → Export to open the export dialog.

What's exported

Statistical tables with all test metrics (test statistic, degrees of freedom, p-value, effect size)
Embedded chart images for each result
Pairwise comparison tables (post-hoc tests)
Plain-language result interpretation paragraphs
Regression summary tables with coefficients and confidence intervals
Kaplan-Meier event tables

Export options

Export All — exports every result currently loaded in the session.
Export Filtered — exports only the results currently visible after applying search or filter criteria.

Exports are formatted Word .docx files. Each result includes headings, tables, a chart, and a summary paragraph — ready to paste into a manuscript.

StatInsight Documentation