mahout
- n.象夫;驯象人;骑象人
-
This is the first step in enabling Mahout learning algorithms process a data base .
这也是采用Mahout机器学习算法来处理数据的首要任务。
-
The output from such a test in Mahout is a data structure called a confusion matrix .
这种测试在Mahout中输出的数据结构是混合矩阵。
-
That the relationship between elephant and mahout is the key to successful training .
象和驯象员之间的关系是驯象成功与否的关键。
-
Mahout currently supports two related approaches to categorizing / classifying content based on bayesian statistics .
Mahout目前支持两种根据贝氏统计来实现内容分类的方法。
-
Early one morning , Ladyface 's mahout came to see him as usual .
一天清晨,“菩萨相”的看象人如常来看他。
-
You 'll find more information on choosing a similarity measure at the Mahout Web site ( see Resources ) .
有关如何选择相似度度量的更多信息,请访问Mahout网站(见参考资料)。
-
Run the clustering algorithm of choice using one of the many Hadoop-ready driver programs available in Mahout .
使用Mahout中可用的Hadoop就绪的驱动程序运行所选集群算法。
-
Another important aspect of the Mahout solution is the set of tools for creating vector representations of textual data .
Mahout另一个重点是,它提供一系列工具把文本数据表示成矩阵形式。
-
Mahout contains implementations for clustering , categorization , CF , and evolutionary programming .
Mahout包含许多实现,包括集群、分类、CP和进化程序。
-
Thus , to run Mahout 's classifier , you need to first train the model and then use that model to classify new content .
因此,要运行Mahout的分类器,您首先需要训练模式,然后再使用该模式对新内容进行分类。
-
The existing distributed machine learning algorithm library , Mahout , offers some classic classification algorithm , such as Bayes and decision trees .
现有的分布式机器学习算法库Mahout,提供了一些经典的分类挖掘算法,如贝叶斯、决策树等。
-
So the mahout had to report this to the king , although he said nothing about selling the friendly dog .
看象人不得不去报告国王,但是他只字不提卖掉那只狗的事。
-
Mahout currently provides tools for building a recommendation engine through the Taste library & a fast and flexible engine for CF.
Mahout目前提供了一些工具,可用于通过Taste库建立一个推荐引擎&针对CF的快速且灵活的引擎。
-
I 'll take a deeper look at each of these tasks at the conceptual level before exploring their implementations in Mahout .
在研究它们在Mahout中的实现之前,我将从概念的层面上更加深入地讨论这些任务。
-
I 'll focus on the two most commonly used ones supervised and unsupervised learning because they are the main ones supported by Mahout .
我将重点讨论其中最常用的两个监管和无监管学习因为它们是Mahout支持的主要功能。
-
There are open source frameworks available , such as Apache Mahout , that provide useful implementations of these techniques ( see Resources ) .
有一些开源的框架,比如ApacheMahout,提供了这些技术的有用实现(参见参考资料)。
-
Next up , I 'll take a look at how to find similar articles by leveraging some of Mahout 's clustering capabilities .
接下来,我将讨论如何通过利用Mahout的集群功能来查找相似文章。
-
Then I 'll show you how to use Mahout to do some interesting machine-learning tasks using the freely available Wikipedia data set .
然后,我将演示如何使用Mahout完成一些有趣的机器学习任务,这需要使用免费的Wikipedia数据集。
-
The first essential in elephant training is to assign to the animal a single mahout who will be entirely responsible for the job .
驯象中至关生要的是指派一名专门的驯象员,全面负责这项工作。
-
In your use of Mahout , you will likely want to try creating vectors in a variety of ways to see which yields the best results .
在使用Mahout时,您可能希望尝试采用不同的方法来创建矢量,以确定哪种方法的效果最好。
-
The next time you have a need to cluster , categorize , or recommend content , especially at large scale , give Apache Mahout a look .
下次在需要集群、分类或推荐内容时,特别是规模很大时,一定要考虑使用ApacheMahout。
-
To do so , you need the Mahout Job JAR , which is located in the hadoop directory in the sample code .
为此,您需要MahoutJobJAR,它位于示例代码的hadoop目录中。
-
Therein lies the premise and the promise of the field of machine learning and the project this article introduces : Apache Mahout ( see Resources ) .
这其中就蕴含着机器学习领域以及本文章所介绍项目的前景:ApacheMahout(见参考资料)。
-
The elephant , his mind filled with the night 's robber-talk , suddenly attacked his mahout .
这头大象的脑里满是强盗们在晚上的谈话,突然就攻击了自己的看象人。
-
Mahout provides driver programs for all of the clustering algorithms , including the k-Means algorithm , aptly named the KMeansDriver .
Mahout为所有集群算法都提供了驱动程序,包括k-Means算法,更合适的名称应该是KMeansDriver。
-
However , for this article , I 'll show only the Naive Bayes approach , because it demonstrates the overall problem and inputs in Mahout .
但在本文中,我只会演示NaiveBayes方法,因为这能让您看到总体问题和Mahout中的输入。
-
The final results are stored in a single file labeled with the category name , one document per line , which is the input format that Mahout expects .
最终结果将存储在一个特定的文件中(该文件名包含类别名),并采用每行一个文档的格式,这是Mahout所需的输入格式。
-
Mahout supports several clustering-algorithm implementations , all written in Map-Reduce , each with its own set of goals and criteria
Mahout支持一些集群算法实现(都是使用Map-Reduce编写的),它们都有一组各自的目标和标准
-
After giving a brief overview of machine-learning concepts , I 'll introduce you to the Apache Mahout project 's features , history , and goals .
在简要概述机器学习的概念之后,我将介绍ApacheMahout项目的特性、历史和目标。
-
The first essential in elephant training is to assign to the animal a single mahout , The next stage is to get the elephant to the trainning establishment .
驯象中至关重要的是指派一名专门的驯象员,下一步就是把象送到驯象基地。