Probability

Given probability vectors px and qx, compute the "cross entropy". May also pass in response vectors: y (actual) and yp (predicted).

Value parameters

base_e: whether to use base e or base 2 logarithms (defaults to e)
px: the first probability vector
qx: the second probability vector (requires qx.dim >= px.dim)

Attributes

Given a joint probability matrix pxy, compute the "conditional probability" for random variable X given random variable Y, i.e, P(X = x_i|Y = y_j).

Value parameters

pxy: the joint probability matrix
py_: the marginal probability vector for Y

Attributes

Given a joint probability matrix pxy, compute the "conditional probability" for random variable Y given random variable X, i.e, P(Y = y_j|X = x_i).

Value parameters

px_: the marginal probability vector for X
pxy: the joint probability matrix

Attributes

Count the total number of occurrence in vector x of value vl, e.g., x is column 2 (Humidity), vl is 1 (High) matches 7 rows. This method works for vectors with integer or continuous values.

Value parameters

cont: whether feature/variable x is to be treated as continuous
k: the maximum value of y + 1
thres: the splitting threshold for features/variables treated as continuous
vl: one of the possible branch values for feature x (e.g., 1 (High))
x: the feature/column vector (e.g., column j of matrix)
y: the response/classification vector

Attributes

Given a probability vector px, compute the "entropy" of random variable X.

Value parameters

px: the probability vector

Attributes

See also: http://en.wikipedia.org/wiki/Entropy_%28information_theory%29

Given a frequency vector nu, compute the "entropy" of random variable X.

Value parameters

nu: the frequency vector

Attributes

See also: http://en.wikipedia.org/wiki/Entropy_%28information_theory%29

Given a probability vector px, compute the " base-k entropy" of random variable X.

Value parameters

b: the base for the logarithm
px: the probability vector

Attributes

See also: http://en.wikipedia.org/wiki/Entropy_%28information_theory%29

Given a joint probability matrix pxy, compute the "joint entropy" of random variables X and Y.

Value parameters

pxy: the joint probability matrix

Attributes

Given a joint probability matrix pxy and a conditional probability matrix py_x, compute the "conditional entropy" of random variable X given random variable Y.

Value parameters

px_y: the conditional probability matrix
pxy: the joint probability matrix

Attributes

Compute the Joint Frequency Table (JFT) for vector x and vector y. Count the number of cases where x(i) = v and y(i) = c.

Value parameters

k: the maximum value of y + 1 (number of classes)
vc: the number of disctinct values in vector x (value count)
x: the variable/feature vector
y: the response/classification vector

Attributes

Count the frequency of occurrence in vector x of value vl for each of y's classification values, e.g., x is column 2 (Humidity), vl is 1 (High)) and y can be 0 (no) or 1 (yes). Also, determine the fraction of training cases where the feature has this value (e.g., fraction where Humidity is High = 7/14).

Value parameters

k: the maximum value of y + 1
vl: one of the possible branch values for feature x (e.g., 1 (High))
x: the feature/column vector (e.g., column j of matrix)
y: the response/classification vector

Attributes

Count the frequency of occurrence in vector x of value vl for each of y's classification values, e.g., x is column 2 (Humidity), vl is 1 (High)) and y can be 0 (no) or 1 (yes). Also, determine the fraction of training cases where the feature has this value (e.g., fraction where Humidity is High = 7/14).

Value parameters

idx_: the index positions within x (if null, use all index positions)
k: the maximum value of y + 1
vl: one of the possible branch values for feature x (e.g., 1 (High))
x: the feature/column vector (e.g., column j of matrix)
y: the response/classification vector

Attributes

Count the frequency of occurrence in vector x of value vl for each of y's classification values, e.g., x is column 2 (Humidity), vl is 1 (High)) and y can be 0 (no) or 1 (yes). Also, determine the fraction of training cases where the feature has this value (e.g., fraction where Humidity is High = 7/14). This method works for vectors with integer or continuous values.

Value parameters

cont: whether feature/variable x is to be treated as continuous
idx_: the index positions within x (if null, use all index positions)
k: the maximum value of y + 1
thres: the splitting threshold for features/variables treated as continuous
vl: one of the possible branch values for feature x (e.g., 1 (High))
x: the feature/column vector (e.g., column j of matrix)
y: the response/classification vector

Attributes

Determine whether the vector px is a legitimate "probability vector". The elements of the vector must be non-negative and add to one.

Value parameters

px: the probability vector

Attributes

Determine whether the matrix pxy is a legitimate joint "probability matrix". The elements of the matrix must be non-negative and add to one.

Value parameters

pxy: the probability matrix

Attributes

Given two independent random variables X and Y, compute their "joint probability", which is the outer product of their probability vectors px and py, i.e., P(X = x_i, Y = y_j).

Attributes

Compute the Joint Probability Table (JPT) for vector x and vector y. Count the number of cases where x(i) = v and y(i) = c and divide by the number of instances/datapoints.

Value parameters

k: the maximum value of y + 1 (number of classes)
vc: the number of disctinct values in vector x (value count)
x: the variable/feature vector
y: the response/classification vector

Attributes

Given a joint probability matrix pxy, compute the "marginal probability" for random variable X, i.e, P(X = x_i).

Value parameters

pxy: the probability matrix

Attributes

Given a joint probability matrix pxy, compute the "marginal probability" for random variable Y, i.e, P(Y = y_j).

Value parameters

pxy: the probability matrix

Attributes

Given a joint probability matrix pxy, compute the mutual information for random variables X and Y.

Value parameters

px: the marginal probability vector for X
pxy: the probability matrix
py: the marginal probability vector for Y

Attributes

Given a joint probability matrix pxy, compute the mutual information for random variables X and Y.

Value parameters

pxy: the probability matrix

Attributes

Given a probability vector px, compute the "normalized entropy" of random variable X.

Value parameters

px: the probability vector

Attributes

See also: http://en.wikipedia.org/wiki/Entropy_%28information_theory%29

Given a probability p, compute the "positive log-probability". Requires the probability to be non-zero.

Value parameters

p: the given probability

Attributes

Given a probability vector px, compute the "positive log-probability". Requires each probability to be non-zero.

Value parameters

px: the probability vector

Attributes

Return the probability of discrete random variable y taking on any of k values

Value parameters

k: the maximum value of y + 1, e.g., { 0, 1, 2} => k = 3
y: the feature/column vector of integer values whose frequency counts are sought

Attributes

Given probability vectors px and qx, compute the "relative entropy".

Value parameters

px: the first probability vector
qx: the second probability vector (requires qx.dim >= px.dim)

Attributes

Given a frequency vector, convert it to a probability vector.

Value parameters

nu: the frequency vector

Attributes

Given a frequency vector, convert it to a probability vector.

Value parameters

n: the total number of instances/trials collected
nu: the frequency vector

Attributes

Given a frequency matrix, convert it to a probability matrix.

Value parameters

nu: the frequency matrix

Attributes

Given a frequency matrix, convert it to a probability matrix.

Value parameters

n: the total number of instances/trials collected
nu: the frequency matrix

Probability

Attributes

Members list

Value members

Concrete methods

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes

Value parameters

Attributes