DecisionTreeID3

Instance Constructors

new DecisionTreeID3(x: MatrixI, y: VectorI, fn: Array[String], k: Int, cn: Array[String], vc: VectorI = null)

x
the data vectors stored as rows of a matrix
y
the class array, where y_i = class for row i of the matrix x
fn
the names for all features/variables
k
the number of classes
cn
the names for all classes
vc
the value count array indicating number of distinct values per feature

Type Members

case class FeatureNode(f: Int, branches: HashMap[Int, Node]) extends Node with Product with Serializable
case class LeafNode(y: Int) extends Node with Product with Serializable
abstract class Node extends AnyRef
type Path = List[(Int, Int)]

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
def buildTree(path: Path): Node

Extend the tree given a path e.
Extend the tree given a path e.g. ((outlook, sunny), ...).
path
an existing path in the tree ((feature, value), ...)
def classify(z: VectorI): (Int, String)

Given a data vector z, classify it returning the class number (0, .
Given a data vector z, classify it returning the class number (0, ..., k-1) by following a decision path from the root to a leaf.
z
the data vector to classify

Definition Classes
DecisionTreeID3 → Classifier
def classify(z: VectorD): (Int, String)

Given a new continuous data vector 'z', determine which class it belongs to, by first rounding it to an integer-valued vector.
Given a new continuous data vector 'z', determine which class it belongs to, by first rounding it to an integer-valued vector.
z
the vector to classify

Definition Classes
ClassifierInt → Classifier
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def dataset(f: Int, path: Path): Array[(Int, Int)]

Extract column from matrix, filtering out values rows that are not on path.
Extract column from matrix, filtering out values rows that are not on path.
f
the feature to consider (e.g., 2 (Humidity))
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
def flaw(method: String, message: String): Unit

Show the flaw by printing the error message.
Show the flaw by printing the error message.
method
the method where the error occurred
message
the error message

Definition Classes
Error
def frequency(dset: Array[(Int, Int)], value: Int): (Double, VectorD)

Given a feature column (e.
Given a feature column (e.g., 2 (Humidity)) and a value (e.g., 1 (High)) use the frequency of ocurrence of the value for each classification (e.g., 0 (no), 1 (yes)) to estimate k probabilities. Also, determine the fraction of training cases where the feature has this value (e.g., fraction where Humidity is High = 7/14).
dset
the list of data set tuples to consider (e.g. value, row index)
value
one of the possible values for this feature (e.g., 1 (High))
def gain(f: Int, path: Path): Double

Compute the information gain due to using the values of a feature/attribute to distinguish the training cases (e.
Compute the information gain due to using the values of a feature/attribute to distinguish the training cases (e.g., how well does Humidity with its values Normal and High indicate whether one will play tennis).
f
the feature to consider (e.g., 2 (Humidity))
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
val m: Int

the number of data vectors in training-set (# rows)
the number of data vectors in training-set (# rows)

Attributes
protected
Definition Classes
ClassifierInt
val md: Double

the training-set size as a Double
the training-set size as a Double

Attributes
protected
Definition Classes
ClassifierInt
def mode(a: Array[Int]): Int

Find the most frequent classification.
Find the most frequent classification.
a
array of discrete classifications
val n: Int

the number of features/variables (# columns)
the number of features/variables (# columns)

Attributes
protected
Definition Classes
ClassifierInt
val nd: Double

the feature-set size as a Double
the feature-set size as a Double

Attributes
protected
Definition Classes
ClassifierInt
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def test(xx: MatrixI, yy: VectorI): Double

Test the quality of the training with a test-set and return the fraction of correct classifications.
Test the quality of the training with a test-set and return the fraction of correct classifications.
xx
the integer-valued test vectors stored as rows of a matrix
yy
the test classification vector, where yy_i = class for row i of xx

Definition Classes
ClassifierInt
def toString(): String

Definition Classes
AnyRef → Any
def train(): Unit

Train the decsion tree.
Train the decsion tree.

Definition Classes
DecisionTreeID3 → Classifier
def vc_default: VectorI

Return default values for binary input data (value count (vc) set to 2).
Return default values for binary input data (value count (vc) set to 2).

Definition Classes
ClassifierInt
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

class DecisionTreeID3 extends ClassifierInt

Instance Constructors

new DecisionTreeID3(x: MatrixI, y: VectorI, fn: Array[String], k: Int, cn: Array[String], vc: VectorI = null)

Type Members

case class FeatureNode(f: Int, branches: HashMap[Int, Node]) extends Node with Product with Serializable

case class LeafNode(y: Int) extends Node with Product with Serializable

abstract class Node extends AnyRef

type Path = List[(Int, Int)]

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

def buildTree(path: Path): Node

def classify(z: VectorI): (Int, String)

def classify(z: VectorD): (Int, String)

def clone(): AnyRef

def dataset(f: Int, path: Path): Array[(Int, Int)]

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

def flaw(method: String, message: String): Unit

def frequency(dset: Array[(Int, Int)], value: Int): (Double, VectorD)

def gain(f: Int, path: Path): Double

final def getClass(): Class[_]

def hashCode(): Int

final def isInstanceOf[T0]: Boolean

val m: Int

val md: Double

def mode(a: Array[Int]): Int

val n: Int

val nd: Double

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

final def synchronized[T0](arg0: ⇒ T0): T0

def test(xx: MatrixI, yy: VectorI): Double

def toString(): String

def train(): Unit

def vc_default: VectorI

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from ClassifierInt

Inherited from Error

Inherited from Classifier

Inherited from AnyRef

Inherited from Any

Ungrouped