SparkseePython
6.0.2
|
Attribute statistics class. More...
Public Member Functions | |
def | get_min_length_string (self) |
Gets the minimum length. More... | |
def | get_mode_count (self) |
Gets the number of objects with a Value equal to the mode. More... | |
def | get_variance (self) |
Gets the variance. More... | |
def | get_mode (self) |
Gets the mode. More... | |
def | get_min (self) |
Gets the minimum existing value (BASIC statistics). More... | |
def | get_median (self) |
Gets the median. More... | |
def | get_total (self) |
Gets the number of objects with a non-NULL Value (BASIC statistic). More... | |
def | get_max_length_string (self) |
Gets the maximum length. More... | |
def | get_mean (self) |
Gets the mean or average. More... | |
def | get_null (self) |
Gets the number of objects NULL a Value (BASIC statistics). More... | |
def | get_distinct (self) |
Gets the number of distinct values (BASIC statistics). More... | |
def | get_avg_length_string (self) |
Gets the average length. More... | |
def | get_max (self) |
Gets the maximum existing value (BASIC statistics). More... | |
Attribute statistics class.
It contains statistic data about an attribute.
Some fields are valid just for numerical attributes and others just for string attributes. Also, some statistics are considered BASIC because computing them do not require to traverse all the different values of the attribute. For each getter method the documentation tells if the statistic is BASIC or not. See the Graph class method getAttributeStatistics or check out the SPARKSEE User Manual for more details on this.
def sparksee.AttributeStatistics.get_avg_length_string | ( | self | ) |
Gets the average length.
If the attribute is not an string attribute, it just returns 0.
def sparksee.AttributeStatistics.get_distinct | ( | self | ) |
Gets the number of distinct values (BASIC statistics).
def sparksee.AttributeStatistics.get_max | ( | self | ) |
Gets the maximum existing value (BASIC statistics).
def sparksee.AttributeStatistics.get_max_length_string | ( | self | ) |
Gets the maximum length.
If the attribute is not an string attribute, it just returns 0.
def sparksee.AttributeStatistics.get_mean | ( | self | ) |
Gets the mean or average.
Mean or average: Sum of all Values divided by the number of observations.
It is computed just for numerical attributes.
def sparksee.AttributeStatistics.get_median | ( | self | ) |
Gets the median.
Median: Middle value that separates the higher half from the lower.
If a < b < c, then the median of the list {a, b, c} is b, and if a < b < c < d, then the median of the list {a, b, c, d} is the mean of b and c, i.e. it is (b + c)/2
It is computed just for numerical attributes.
def sparksee.AttributeStatistics.get_min | ( | self | ) |
Gets the minimum existing value (BASIC statistics).
def sparksee.AttributeStatistics.get_min_length_string | ( | self | ) |
Gets the minimum length.
If the attribute is not an string attribute, it just returns 0.
def sparksee.AttributeStatistics.get_mode | ( | self | ) |
def sparksee.AttributeStatistics.get_mode_count | ( | self | ) |
def sparksee.AttributeStatistics.get_null | ( | self | ) |
def sparksee.AttributeStatistics.get_total | ( | self | ) |
def sparksee.AttributeStatistics.get_variance | ( | self | ) |
Gets the variance.
It is computed just for numerical attributes.