Statistical and Predictive Analytics - Data360_Analyze - Latest

Data360 Analyze Server Help

Product type
Software
Portfolio
Verify
Product family
Data360
Product
Data360 Analyze
Version
Latest
Language
English
Product name
Data360 Analyze
Title
Data360 Analyze Server Help
Copyright
2024
First publish date
2016
Last updated
2024-11-28
Published on
2024-11-28T15:26:57.181000

The Statistical and Predictive Analytics node pack provides a number of nodes that enable you to use statistical and analytic techniques without needing to use R language coding. These nodes are separately licensed and leverage the embedded TIBCO® Enterprise Runtime for R software.

Prerequisites

Ensure that you have the required Statistical and Predictive Analytics node pack license. You can apply a new license at any time, see Applying a new license.

Nodes in the Statistical and Predictive Analytics node pack depend on functionality provided by open source licensed CRAN packages. On Windows you can download and install the required CRAN R library packages by using the R Library Package Download node.

Due to the nature of the official CRAN updates and potential incompatibility issues, Precisely hosts a repository with tried and tested versions of the packages. We only host those packages known to be compatible with the Statistical and Predictive Analytics node pack (powered by TIBCO®). If you want to install other packages, this must be done on the understanding that the latest versions may not be compatible.

On Windows you can download and install the required CRAN R library packages by using the R Library Package Download node.

To install the packages on Linux the following steps are required:

  1. Install the open-source R product as described at: https://www.r-project.org/. The supported version of R required by the Precisely repository is 4.3.0. Precisely cannot guarantee compatibility with later versions.
  2. Open the R console and run the following routine:
    install.packages(c("arules","boot","cluster","lattice","Matrix","MASS","NbClust","quantreg","randomForest","SparseM","survival","xts","zoo"),destdir="<site-dir>/tmp/TERR/downloads",contriburl="http://cran.lavastorm.com/src/contrib_4.3",type="source",lib="<site-dir>/tmp/TERR/libs",
                method="libcurl")

    Note that you should replace the <site-dir> placeholder in the “destDir” and “lib” properties to point to your site configuration location.

    Please ensure that the folders referenced in the destdir and lib properties in the above command, exist before running the command.

Nodes

The following table gives an overview of the nodes that are available in the Statistical and Predictive Analytics node pack:

Nodes Description Example

Decision Forest

Predict Decision Forest

Used to predict outcomes by repeatedly identifying patterns from an existing data set. Is a particular customer with a certain set of attributes likely to switch to a competitor?
Hierarchical Clustering Used to form a user-specified number of clusters out of data sets using user-defined criteria based on an iterative process of cluster merging. Segment insurance policy holders into groups based on expected claims costs.

K-Means Advisor

K-Means Clustering

Used to form a user-specified number of clusters out of data sets using user-defined criteria based on proximity. Segment markets for differentiated pricing.

Linear Regression

Predict Linear Regression

Used to calculate a line or curve of best fit to estimate values. How much revenue can be generated for each dollar spent on advertising?

Logistic Regression

Predict Logistic Regression

Used to calculate probabilities of binary outcomes. What is the probability that a customer will make a purchase, given certain customer attributes?

Market Basket Analysis

Market Basket Miner

Used to discover co-occurrence relationships between transaction items, activities, or characteristics. Based on the TV shows that a viewer has watched, what other shows might the viewer also enjoy?

Predict Quantile Regression

Quantile Regression

Used to calculate an outcome of interest given a specific quantile (or percentile) and a specific attribute value. How responsive are students to different education factors, given a certain standardized test score?

Time Series Forecast

A method that predicts future values of interest using weighted past values. What is the expected value of retail sales for December?