Packages

package clusterer

The clusterer package contains classes, traits and objects for clustering algorithms.

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. clusterer
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. Protected

Type Members

  1. case class Cluster(c: Int = Cluster.next (), np: Int = 0) extends Error with Product with Serializable

    The Cluster case class maintains information about clusters, the cluster id, center/centroid, cluster size, and measure of error.

    The Cluster case class maintains information about clusters, the cluster id, center/centroid, cluster size, and measure of error. Note: the cluster assignment function as an array 'to_c' indicates how points are assigned to clusters.

    c

    the cluster id

    np

    the number of points in the cluster (size)

    See also

    package.scala for the definition of the 'distance' method

  2. trait Clusterer extends AnyRef

    The Clusterer trait provides a common framework for several clustering algorithms.

    The Clusterer trait provides a common framework for several clustering algorithms.

    See also

    package.scala for 'distance' function

  3. class ClusteringPredictor extends PredictorMat

    The ClusteringPredictor class is used to predict a response value for new vector 'z'.

    The ClusteringPredictor class is used to predict a response value for new vector 'z'. It works by finding the cluster that the point 'z' would belong to. The recorded response value for 'y' is then given as the predicted response. The per cluster recorded reponse value is the consensus (e.g., average) of the individual predictions for 'z' from the members of the cluster. Training involves clustering the points in data matrix 'x' and then computing each clusters reponse.

  4. class HierClusterer extends Clusterer with Error

    Cluster several vectors/points using hierarchical clustering.

    Cluster several vectors/points using hierarchical clustering. Start with each point forming its own cluster and merge clusters until there are only 'k'.

  5. class KMeansClusterer extends Clusterer with Error

    The KMeansClusterer class cluster several vectors/points using k-means clustering.

    The KMeansClusterer class cluster several vectors/points using k-means clustering. Randomly assign points to 'k' clusters (primary technique). Iteratively, reassign each point to the cluster containing the closest centroid. Stop when there are no changes to the clusters.

    See also

    KMeansClusterer2 for secondary technique. -----------------------------------------------------------------------------

  6. class KMeansClusterer2 extends KMeansClusterer

    The KMeansClusterer2 class cluster several vectors/points using k-means clustering.

    The KMeansClusterer2 class cluster several vectors/points using k-means clustering. Randomly pick 'k' points as initial centroids (secondary technique). Iteratively, reassign each point to the cluster containing the closest centroid. Stop when there are no changes to the clusters.

    See also

    KMeansClusterer for primary technique. -----------------------------------------------------------------------------

  7. class KMeansClustererHW extends KMeansClusterer

    The KMeansClustererHW class cluster several vectors/points using the Hartigan-Wong algorithm.

  8. class KMeansClustererPP extends KMeansClustererHW

    The KMeansClustererPP class cluster several vectors/points using the Hartigan-Wong algorithm.

  9. class KMeansClustererSSE extends Clusterer with Error

    The KMeansClustererSSE class cluster several vectors/points using k-means clustering.

    The KMeansClustererSSE class cluster several vectors/points using k-means clustering. Randomly assign points to 'k' clusters (primary technique). Iteratively, reassign each point to the cluster containing the closest centroid. Stop when there are no changes to the clusters.

    See also

    KMeansClusterer2 for secondary technique. -----------------------------------------------------------------------------

  10. class KMeansPPClusterer extends KMeansClusterer

    The KMeansPPClusterer class cluster several vectors/points using the k-means++ clustering technique.

    The KMeansPPClusterer class cluster several vectors/points using the k-means++ clustering technique. -----------------------------------------------------------------------------

    See also

    ilpubs.stanford.edu:8090/778/1/2006-13.pdf -----------------------------------------------------------------------------

  11. trait KMeansPPClustererTester extends AnyRef

    The KMeansPPClustererTester trait includes a test function to aid in the testing of the KMeansPPClusterer class.

  12. class MarkovClusterer extends Clusterer with Error

    The MarkovClusterer class implements a Markov Clustering Algorithm 'MCL' and is used to cluster nodes in a graph.

    The MarkovClusterer class implements a Markov Clustering Algorithm 'MCL' and is used to cluster nodes in a graph. The graph is represented as an edge-weighted adjacency matrix (a non-zero cell indicates nodes i and j are connected).

    The primary constructor takes either a graph (adjacency matrix) or a Markov transition matrix as input. If a graph is passed in, the normalize method must be called to convert it into a Markov transition matrix. Before normalizing, it may be helpful to add self loops to the graph. The matrix (graph or transition) may be either dense or sparse. See the MarkovClustererTest object at the bottom of the file for examples.

  13. class RandomGraph extends AnyRef

    The RandomGraph class generates random undirected graphs with clusters (as adjacency matrices).

  14. class TightClusterer extends AnyRef

    The TightClusterer class uses tight clustering to eliminate points that do not not fit well in any cluster.

Value Members

  1. def dist(u: VectoD, v: VectoD): Double

    Compute a distance metric (e.g., distance squared) between vectors/points 'u' and 'v'.

    Compute a distance metric (e.g., distance squared) between vectors/points 'u' and 'v'. Override this methods to use a different metric, e.g., 'norm' - the Euclidean distance, 2-norm 'norm1' - the Manhattan distance, 1-norm

    u

    the first vector/point

    v

    the second vector/point

  2. object Algorithm extends Enumeration

    The Algorithm object specifies which algorithm to use.

  3. object Cluster extends Serializable

    The Cluster object is used for creating auto-increment identifiers for cluster ids.

  4. object Clusterer

    The Clusterer provides a simple dataset (matrix of data points) for initial testing of clustering algorithms.

  5. object ClusteringPredictor

    The ClusteringPredictor companion object provides a factory functions.

  6. object ClusteringPredictorTest extends App

    The ClusteringPredictorTest object is used to test the ClusteringPredictor class.

    The ClusteringPredictorTest object is used to test the ClusteringPredictor class. > runMain scalation.analytics.clusterer.ClusteringPredictorTest

  7. object ClusteringPredictorTest2 extends App

    The ClusteringPredictorTest2 object is used to test the ClusteringPredictor class.

    The ClusteringPredictorTest2 object is used to test the ClusteringPredictor class. > runMain scalation.analytics.clusterer.ClusteringPredictorTest2

  8. object ClusteringPredictorTest3 extends App

    The ClusteringPredictorTest3 object is used to test the ClusteringPredictor class.

    The ClusteringPredictorTest3 object is used to test the ClusteringPredictor class. Test on AutoMPG dataset and compare with KNN_Predictor. > runMain scalation.analytics.clusterer.ClusteringPredictorTest3

  9. object GapStatistic

    The GapStatistic object is used to help determine the optimal number of clusters for a clusterer by comparing results to a reference distribution.

    The GapStatistic object is used to help determine the optimal number of clusters for a clusterer by comparing results to a reference distribution. -----------------------------------------------------------------------------

    See also

    web.stanford.edu/~hastie/Papers/gap.pdf

  10. object GapStatisticTest extends App

    The GapStatisticTest object is used to test the GapStatistic object.

    The GapStatisticTest object is used to test the GapStatistic object. > runMain scalation.analytics.clusterer.GapStatisticTest

  11. object GapStatisticTest2 extends App

    The GapStatisticTest2 object is used to test the GapStatistic object.

    The GapStatisticTest2 object is used to test the GapStatistic object. > runMain scalation.analytics.clusterer.GapStatisticTest2

  12. object HierClustererTest extends App

    The HierClusterer object is used to test the HierClusterer class.

    The HierClusterer object is used to test the HierClusterer class. > runMain scalation.analytics.clusterer.HierClustererTest

  13. object HierClustererTest2 extends App

    The HierClustererTest2 object is used to test the HierClusterer class.

    The HierClustererTest2 object is used to test the HierClusterer class. > runMain scalation.analytics.clusterer.HierClustererTest2

  14. object KMeansClusterer2Test extends App

    The KMeansClusterer2Test2 object is used to test the KMeansClusterer2 class.

    The KMeansClusterer2Test2 object is used to test the KMeansClusterer2 class. > runMain scalation.analytics.clusterer.KMeansClusterer2Test

  15. object KMeansClusterer2Test2 extends App

    The KMeansClusterer2Test2 object is used to test the KMeansClusterer2 class.

    The KMeansClusterer2Test2 object is used to test the KMeansClusterer2 class. > runMain scalation.analytics.clusterer.KMeansClusterer2Test2

  16. object KMeansClusterer2Test3 extends App

    The KMeansClusterer2Test2 object is used to test the KMeansClusterer2 class.

    The KMeansClusterer2Test2 object is used to test the KMeansClusterer2 class. > runMain scalation.analytics.clusterer.KMeansClusterer2Test3

  17. object KMeansClusterer2Test4 extends App

    The KMeansClusterer2Test4 object is used to test the KMeansClusterer2 class.

    The KMeansClusterer2Test4 object is used to test the KMeansClusterer2 class. > runMain scalation.analytics.clusterer.KMeansClusterer2Test4

  18. object KMeansClustererHWTest extends App

    The KMeansClustererTestHW object is used to test the KMeansClustererHW class.

    The KMeansClustererTestHW object is used to test the KMeansClustererHW class. > runMain scalation.analytics.clusterer.KMeansClustererHWTest

  19. object KMeansClustererHWTest2 extends App

    The KMeansClustererHWTest2 object is used to test the KMeansClustererHW class.

    The KMeansClustererHWTest2 object is used to test the KMeansClustererHW class. > runMain scalation.analytics.clusterer.KMeansClustererHWTest2

  20. object KMeansClustererHWTest3 extends App

    The KMeansClustererHWTest3 object is used to test the KMeansClustererHW class.

    The KMeansClustererHWTest3 object is used to test the KMeansClustererHW class. > runMain scalation.analytics.clusterer.KMeansClustererHWTest3

  21. object KMeansClustererPPTest extends App

    The KMeansClustererTestPP object is used to test the KMeansClustererPP class.

    The KMeansClustererTestPP object is used to test the KMeansClustererPP class. > runMain scalation.analytics.clusterer.KMeansClustererPPTest

  22. object KMeansClustererPPTest2 extends App

    The KMeansClustererPPTest object is used to test the KMeansClustererPP class.

    The KMeansClustererPPTest object is used to test the KMeansClustererPP class. > runMain scalation.analytics.clusterer.KMeansClustererPPTest2

  23. object KMeansClustererPPTest3 extends App

    The KMeansClustererPPTest3 object is used to test the KMeansClustererPP class.

    The KMeansClustererPPTest3 object is used to test the KMeansClustererPP class. > runMain scalation.analytics.clusterer.KMeansClustererPPTest3

  24. object KMeansClustererSSETest extends App

    The KMeansClustererSSETest object is used to test the KMeansClustererSSE class.

    The KMeansClustererSSETest object is used to test the KMeansClustererSSE class. > runMain scalation.analytics.clusterer.KMeansClustererSSETest

  25. object KMeansClustererSSETest2 extends App

    The KMeansClustererSSETest2 object is used to test the KMeansClustererSSE class.

    The KMeansClustererSSETest2 object is used to test the KMeansClustererSSE class. > runMain scalation.analytics.clusterer.KMeansClustererSSETest2

  26. object KMeansClustererSSETest3 extends App

    The KMeansClustererSSETest2 object is used to test the KMeansClustererSSE class.

    The KMeansClustererSSETest2 object is used to test the KMeansClustererSSE class. > runMain scalation.analytics.clusterer.KMeansClustererSSETest3

  27. object KMeansClustererSSETest4 extends App

    The KMeansClustererSSETest4 object is used to test the KMeansClustererSSE class.

    The KMeansClustererSSETest4 object is used to test the KMeansClustererSSE class. > runMain scalation.analytics.clusterer.KMeansClustererSSETest4

  28. object KMeansClustererTest extends App

    The KMeansClustererTest object is used to test the KMeansClusterer class.

    The KMeansClustererTest object is used to test the KMeansClusterer class. > runMain scalation.analytics.clusterer.KMeansClustererTest

  29. object KMeansClustererTest2 extends App

    The KMeansClustererTest2 object is used to test the KMeansClusterer class.

    The KMeansClustererTest2 object is used to test the KMeansClusterer class. > runMain scalation.analytics.clusterer.KMeansClustererTest2

  30. object KMeansClustererTest3 extends App

    The KMeansClustererTest2 object is used to test the KMeansClusterer class.

    The KMeansClustererTest2 object is used to test the KMeansClusterer class. > runMain scalation.analytics.clusterer.KMeansClustererTest3

  31. object KMeansClustererTest4 extends App

    The KMeansClustererTest4 object is used to test the KMeansClusterer class.

    The KMeansClustererTest4 object is used to test the KMeansClusterer class. > runMain scalation.analytics.clusterer.KMeansClustererTest4

  32. object KMeansPPClusterer

    The KMeansPPClusterer companion object supplies a factory function.

  33. object KMeansPPClustererTest extends App with KMeansPPClustererTester

    The KMeansPPClustererTest object is used to test the KMeansPlusPlusClusterer class.

    The KMeansPPClustererTest object is used to test the KMeansPlusPlusClusterer class. > runMain scalation.analytics.clusterer.KMeansPPClustererTest

  34. object KMeansPPClustererTest2 extends App with KMeansPPClustererTester

    The KMeansPPClustererTest2 object is used to test the KMeansPlusPlusClusterer class.

    The KMeansPPClustererTest2 object is used to test the KMeansPlusPlusClusterer class. > runMain scalation.analytics.clusterer.KMeansPPClustererTest2

  35. object KMeansPPClustererTest3 extends App with KMeansPPClustererTester

    The KMeansPPClustererTest3 object is used to test the KMeansPlusPlusClusterer class.

    The KMeansPPClustererTest3 object is used to test the KMeansPlusPlusClusterer class. > runMain scalation.analytics.clusterer.KMeansPPClustererTest3

  36. object KMeansPPClustererTest4 extends App with KMeansPPClustererTester

    The KMeansPPClustererTest4 object is used to test the KMeansPlusPlusClusterer class.

    The KMeansPPClustererTest4 object is used to test the KMeansPlusPlusClusterer class. > runMain scalation.analytics.clusterer.KMeansPPClustererTest4

  37. object MarkovClustererTest extends App

    The MarkovClustererTest object is used to test the MarkovClusterer class.

    The MarkovClustererTest object is used to test the MarkovClusterer class.

    See also

    www.cs.ucsb.edu/~xyan/classes/CS595D-2009winter/MCL_Presentation2.pdf ^ > runMain scalation.analytics.clusterer.MarkovClustererTest

  38. object MarkovClustererTest2 extends App

    The MarkovClustererTest2 object is used to test the MarkovClusterer class.

    The MarkovClustererTest2 object is used to test the MarkovClusterer class. ^ > runMain scalation.analytics.clusterer.MarkovClustererTest2

  39. object RandomGraphTest extends App

    The RandomGraphTest object is used to test the RandomGraph class.

    The RandomGraphTest object is used to test the RandomGraph class. > runMain scalation.analytics.cluterer.RandomGraphTest

  40. object TightClustererTest extends App

    The TightClustererTest is used to test the TightClusterer class.

    The TightClustererTest is used to test the TightClusterer class. > runMain scalation.analytics.clusterer.TightClustererTest

Inherited from AnyRef

Inherited from Any

Ungrouped