sparksee.AttributeStatistics Class Reference

Attribute statistics class. More...

List of all members.

Public Member Functions

def get_min
 Gets the minimum existing value (BASIC statistics).
def get_min_length_string
 Gets the minimum length.
def get_variance
 Gets the variance.
def get_mode
 Gets the mode.
def get_null
 Gets the number of objects NULL a Value (BASIC statistics).
def get_distinct
 Gets the number of distinct values (BASIC statistics).
def get_mean
 Gets the mean or average.
def get_max
 Gets the maximum existing value (BASIC statistics).
def get_median
 Gets the median.
def get_total
 Gets the number of objects with a non-NULL Value (BASIC statistic).
def get_max_length_string
 Gets the maximum length.
def get_avg_length_string
 Gets the average length.
def get_mode_count
 Gets the number of objects with a Value equal to the mode.


Detailed Description

Attribute statistics class.

It contains statistic data about an attribute.

Some fields are valid just for numerical attributes and others just for string attributes. Also, some statistics are considered BASIC because computing them do not require to traverse all the different values of the attribute. For each getter method the documentation tells if the statistic is BASIC or not. See the Graph class method getAttributeStatistics or check out the SPARKSEE User Manual for more details on this.

Author:
Sparsity Technologies http://www.sparsity-technologies.com

Member Function Documentation

def sparksee.AttributeStatistics.get_min (   self  ) 

Gets the minimum existing value (BASIC statistics).

Returns:
The minimum existing value.

def sparksee.AttributeStatistics.get_min_length_string (   self  ) 

Gets the minimum length.

If the attribute is not an string attribute, it just returns 0.

Returns:
The minimum length.

def sparksee.AttributeStatistics.get_variance (   self  ) 

Gets the variance.

It is computed just for numerical attributes.

Returns:
The variance.

def sparksee.AttributeStatistics.get_mode (   self  ) 

Gets the mode.

Mode: Most frequent Value.

Returns:
The mode.

def sparksee.AttributeStatistics.get_null (   self  ) 

Gets the number of objects NULL a Value (BASIC statistics).

Returns:
The number of objects NULL a Value.

def sparksee.AttributeStatistics.get_distinct (   self  ) 

Gets the number of distinct values (BASIC statistics).

Returns:
The number of distinct values.

def sparksee.AttributeStatistics.get_mean (   self  ) 

Gets the mean or average.

Mean or average: Sum of all Values divided by the number of observations.

It is computed just for numerical attributes.

Returns:
The mean.

def sparksee.AttributeStatistics.get_max (   self  ) 

Gets the maximum existing value (BASIC statistics).

Returns:
The maximum existing value.

def sparksee.AttributeStatistics.get_median (   self  ) 

Gets the median.

Median: Middle value that separates the higher half from the lower.

If a < b < c, then the median of the list {a, b, c} is b, and if a < b < c < d, then the median of the list {a, b, c, d} is the mean of b and c, i.e. it is (b + c)/2

It is computed just for numerical attributes.

Returns:
The median.

def sparksee.AttributeStatistics.get_total (   self  ) 

Gets the number of objects with a non-NULL Value (BASIC statistic).

Returns:
The number of objects with a non-NULL Value.

def sparksee.AttributeStatistics.get_max_length_string (   self  ) 

Gets the maximum length.

If the attribute is not an string attribute, it just returns 0.

Returns:
The maximum length.

def sparksee.AttributeStatistics.get_avg_length_string (   self  ) 

Gets the average length.

If the attribute is not an string attribute, it just returns 0.

Returns:
The average length.

def sparksee.AttributeStatistics.get_mode_count (   self  ) 

Gets the number of objects with a Value equal to the mode.

Returns:
The number of objects with a Value equal to the mode.


Generated on Fri Sep 12 13:39:42 2014 for SparkseePython by  doxygen 1.5.6