Package | Description |
---|---|
com.cloudera.oryx.app.batch.mllib.als |
Contains an app tier implementation of a Batch Layer, for ALS recommendations, using Spark MLlib.
|
com.cloudera.oryx.app.kmeans |
Support code for the k-means application functionality common to several layers.
|
com.cloudera.oryx.app.rdf |
Support code for random decision forest application functionality common to several layers,
mostly the scoring support.
|
com.cloudera.oryx.app.serving.als.model |
Data model support for the ALS recommender Serving Layer app.
|
com.cloudera.oryx.app.serving.kmeans.model |
Data model support for the k-means clustering Serving Layer app.
|
com.cloudera.oryx.common.collection |
Collection-related utility methods and support classes.
|
com.cloudera.oryx.kafka.util |
Kafka-related utility code and command-line applications.
|
com.cloudera.oryx.ml |
Contains the core support for the ML tier of the Batch and Speed layers.
|
Modifier and Type | Method and Description |
---|---|
protected Pair<org.apache.spark.api.java.JavaRDD<String>,org.apache.spark.api.java.JavaRDD<String>> |
ALSUpdate.splitNewDataToTrainTest(org.apache.spark.api.java.JavaRDD<String> newData)
Implementation which splits based solely on time.
|
Modifier and Type | Method and Description |
---|---|
static Pair<ClusterInfo,Double> |
KMeansUtils.closestCluster(List<ClusterInfo> clusters,
DistanceFn<double[]> distanceFn,
double[] vector) |
Modifier and Type | Method and Description |
---|---|
static Pair<DecisionForest,CategoricalValueEncodings> |
RDFPMMLUtils.read(org.dmg.pmml.PMML pmml) |
Modifier and Type | Method and Description |
---|---|
List<Pair<String,float[]>> |
ALSServingModel.getKnownItemVectorsForUser(String user) |
Stream<Pair<String,Double>> |
ALSServingModel.topN(CosineDistanceSensitiveFunction scoreFn,
ToDoubleObjDoubleBiFunction<String> rescoreFn,
int howMany,
Predicate<String> allowedPredicate) |
Modifier and Type | Method and Description |
---|---|
Pair<ClusterInfo,Double> |
KMeansServingModel.closestCluster(double[] vector) |
Modifier and Type | Method and Description |
---|---|
static <C extends Comparable<C>,D> |
Pairs.orderByFirst(Pairs.SortOrder order) |
static <C,D extends Comparable<D>> |
Pairs.orderBySecond(Pairs.SortOrder order) |
Modifier and Type | Method and Description |
---|---|
static void |
KafkaUtils.setOffsets(String zkServers,
String groupID,
Map<Pair<String,Integer>,Long> offsets) |
Modifier and Type | Method and Description |
---|---|
protected Pair<org.apache.spark.api.java.JavaRDD<M>,org.apache.spark.api.java.JavaRDD<M>> |
MLUpdate.splitNewDataToTrainTest(org.apache.spark.api.java.JavaRDD<M> newData)
Default implementation which randomly splits new data into train/test sets.
|
Copyright © 2014–2018. All rights reserved.