Location:
Search - data sets
Search list
Description: 用于实现上面应用程序的相关数据集.是matlab数据文件格式的-used to achieve the above applications related data sets. Matlab is the data file format
Platform: |
Size: 508928 |
Author: zql |
Hits:
Description: 用matlab编写的基于自生成神经网络(self-generated neural network)的预测方法(包含数据集)-prepared using Matlab based on the self-generating neural network (self-generated ne ural network) forecasting methods (including data sets)
Platform: |
Size: 28672 |
Author: 邱大山 |
Hits:
Description: 以从医院病案室获得的3022例数据为样本,在完成样本数据库以及糖尿病并发症的多维数据集设计后,以糖尿病并发症流行病学知识发现为重点,研究定性数据定量化挖掘模型及算法引擎的设计与实现,即将关联模型引入糖尿病并发症的流行病学研究,应用集合论中的Apriori性质,实现关联规则的挖掘引擎设计。-cases from the hospital to obtain the data for 3,022 cases samples the completion of the sample database and diabetic complications multidimensional data sets design, Complications of diabetes epidemiology knowledge discovery as the focus, Quantitative study of qualitative data mining engine model and algorithm design and implementation, Relational Model forthcoming introduction of diabetic complications epidemiological studies, the application of set theory Apriori nature, Implementation of mining association rules engine design.
Platform: |
Size: 313344 |
Author: Eric Cheng |
Hits:
Description: 以从医院病案室获得的3022例数据为样本,在完成样本数据库以及糖尿病并发症的多维数据集设计后,以糖尿病并发症流行病学知识发现为重点,研究定性数据定量化挖掘模型及算法引擎的设计与实现,即将关联模型引入糖尿病并发症的流行病学研究.运用决策树技术对数据样本进行分析,采用C4.5找到最优决策树-cases from the hospital to obtain the data for 3,022 cases samples the completion of the sample database and diabetic complications multidimensional data sets design, Complications of diabetes epidemiology knowledge discovery as the focus, Quantitative study of qualitative data mining engine model and algorithm design and implementation, Relational Model forthcoming introduction of diabetic complications epidemiological studies. the use of decision tree technical data samples for further analysis. C4.5 used to find optimal decision tree
Platform: |
Size: 308224 |
Author: Eric Cheng |
Hits:
Description: A program to find frequent itemsets with the relim algorithm (recursive elimination), which is inspired by the FP-growth algorithm, but does its work without prefix trees or any other complicated data structures. The main strength of this algorithm is not its speed (although it is not slow, but even outperforms apriori and eclat on some data sets!), but the simplicity of its structure. Basically all the work is done in one recursive function of about 60-70 lines of code. The current version can only find free item sets. An extension to closed and maximal item sets is possible and may be available in the future.-A program to find frequent itemsets with th e relim algorithm (recursive elimination). which is inspired by the FP-growth algorithm, but does its work without prefix trees or any oth er complicated data structures. The main stren gth of this algorithm is not its speed (although it is not slow, but even outperforms apriori and eclat on some d was observed sets!) , but the simplicity of its structure. Basically all the work is done in one of recursive function about 60-70 lines of code. The current version c an only find free item sets. An extension to clos ed and maximal item sets is possible and may be av ailable in the future.
Platform: |
Size: 30720 |
Author: clark |
Hits:
Description: 聚类的测试数据集,matlab打开,3个cluster,1W个测试点,2维。-clustering test data sets, Matlab open, 3 cluster, 1W tests, the two-dimensional.
Platform: |
Size: 70656 |
Author: adrian |
Hits:
Description: In this demo, I use the EM algorithm with a Rauch-Tung-Striebel smoother and an M step, which I ve recently derived, to train a two-layer perceptron, so as to classify medical data (kindly provided by Steve Roberts and Will Penny from EE, Imperial College). The data and simulations are described in: Nando de Freitas, Mahesan Niranjan and Andrew Gee Nonlinear State Space Estimation with Neural Networks and the EM algorithm After downloading the file, type "tar -xf EMdemo.tar" to uncompress it. This creates the directory EMdemo containing the required m files. Go to this directory, load matlab5 and type "EMtremor". The figures will then show you the simulation results, including ROC curves, likelihood plots, decision boundaries with error bars, etc. WARNING: Do make sure that you monitor the log-likelihood and check that it is increasing. Due to numerical errors, it might show glitches for some data sets.
-In this demo, I use the EM algorithm with a Rauch-Tung-Striebel smoother and an M step, which I ve recently derived, to train a two-layer perceptron, so as to classify medical data (kindly provided by Steve Roberts and Will Penny from EE, Imperial College). The data and simulations are described in: Nando de Freitas, Mahesan Niranjan and Andrew Gee Nonlinear State Space Estimation with Neural Networks and the EM algorithm After downloading the file, type "tar-xf EMdemo.tar" to uncompress it. This creates the directory EMdemo containing the required m files. Go to this directory, load matlab5 and type "EMtremor". The figures will then show you the simulation results, including ROC curves, likelihood plots, decision boundaries with error bars, etc. WARNING: Do make sure that you monitor the log-likelihood and check that it is increasing. Due to numerical errors, it might show glitches for some data sets.
Platform: |
Size: 197632 |
Author: 晨间 |
Hits:
Description: 著名的数据挖掘测试数据集,由UCI维护提供.
adult.data 含15个属性,32561个样本.-Well-known data mining test data sets from UCI maintenance. Adult.data containing 15 properties, 32,561 samples.
Platform: |
Size: 369664 |
Author: laishuguang |
Hits:
Description: %例15-1 NaN数据参与分析
a = magic(3)
a(2,2) = NaN %用NaN表示遗失数据
sum(a) %对数据集进行求和- Example 15-1 NaN data involved in the analysis a = magic (3) a (2,2) = NaN missing data with NaN express sum (a) of data sets summation
Platform: |
Size: 3072 |
Author: 任国栋 |
Hits:
Description: 网络入侵检测系统的源代码 检测网络入侵的存在 数据来源是收集到的dump数据集-Network Intrusion Detection System Network Intrusion Detection of the source code of the existence of the data collected from the dump data sets
Platform: |
Size: 13312 |
Author: 王佳妮 |
Hits:
Description: 对UCI数据集之一进行PCA特征抽取实验,给出在二维PCA特征空间的数据散点图。-UCI data sets on one of PCA feature extraction experiments are given in the two-dimensional PCA feature space of the data scatter.
Platform: |
Size: 1024 |
Author: chris |
Hits:
Description: 使用k-means算法对150个数据集进行分簇。-K-means algorithm using 150 data sets to carry out sub-cluster.
Platform: |
Size: 14336 |
Author: 朱东阁 |
Hits:
Description: Matlab code for
Dimensionality Reduction of Clustered Data
Sets
Platform: |
Size: 4096 |
Author: Chetan J |
Hits:
Description: 一个arff格式的源码数据集,可用于WEKA挖掘软件当中。-1 arff format source data sets which can be used for mining software WEKA.
Platform: |
Size: 153600 |
Author: 李历 |
Hits:
Description: 所谓偏最小二乘法,就是指在做基于最小二乘法的线性回归分析之前,对数据集进行主成分分析降维,下面的源码是没有删减的,GreenSim团队免费提供您使用,转载请注明GreenSim团队(http://blog.sina.com.cn/greensim)。 -The so-called partial least squares method, this means doing the least square method based on linear regression analysis prior to the data sets of principal components analysis dimensionality reduction, the following source code is not deleted, GreenSim team, you are free to use, reproduced please note Ming GreenSim Team (http://blog.sina.com.cn/greensim).
Platform: |
Size: 2048 |
Author: biebietuo |
Hits:
Description: 本程序是用于在时域中分析处理轴承故障数据的,提取了各种时域的特征参数,包括均值,有效值,峭度,裕度指标,波形指标等等 适合初学者用于轴承的故障诊断中对参数的提取。(附上轴承故障数据3类共21组 正常、内圈、外圈)-This procedure is used in the time domain analysis of bearing fault data processing to extract the characteristics of various time domain parameters, including mean, RMS, kurtosis, margin indicators, wave indicators, and so suitable for beginners for bearings Fault diagnosis of parameter extraction.
Platform: |
Size: 38912 |
Author: 荣誉 |
Hits:
Description: Upload data sets for BIG DATA
Platform: |
Size: 147456 |
Author: mvharish |
Hits:
Description: SIMPLE ALBP DATA SETS COLLECTED FROM FACTORIES
Platform: |
Size: 79872 |
Author: YONAS |
Hits:
Description: 数据挖掘:概念与技术,本书是一个导论,介绍什么是数据挖掘,什么是数据库中知识发现。书中的材料从数据库角度提供,特别强调发现隐藏在大型数据集中有趣数据模式的数据挖掘基本概念和技术。所讨论的实现方法主要面向可规模化的、有效的数据挖掘工具开发。(Data mining: concepts and techniques. This book is an introduction to what data mining is and what is knowledge discovery in databases. The material in the book is presented from a database perspective, with particular emphasis on discovering the basic concepts and techniques of data mining hidden in large data sets with interesting data patterns. The implementation method is mainly for large-scale and effective data mining tools.)
Platform: |
Size: 6644736 |
Author: 猫熊
|
Hits:
Description: Geolife GPS 轨迹数据集–用户指南
这一 GPS 轨迹数据集是在 (微软研究亚洲) Geolife 项目中收集的, 178 用户在四年 (2007年4月至 2011年10月) 期间。该数据集的 GPS 轨迹由一个时间戳点序列表示, 每一个都包含纬度、经度和高度信息。该数据集包含17621个轨迹, 总距离为1251654公里, 总持续时间为48203小时。该轨迹数据集可以应用于移动模式挖掘、用户活动识别、基于位置的社交网络、位置隐私和位置推荐等多个研究领域。(Geolife GPS track data set - User Guide The GPS trajectory data set was gathered in the Geolife project (Microsoft Research Asia) and 178 users over a four-year period (April 2007 to October 2011). The GPS trajectory of the data set is represented by a sequence of time stamps, each of which contains latitude, longitude and altitude information. The dataset contains 17621 trajectories with a total distance of 1251654 km and a total duration of 48203 hours. These trajectories record different GPS loggers and GPS telephones, and have various sampling rates. The trajectory of 91% is recorded in dense representation, for example, every 1 to 5 seconds or 5 to 10 meters per point. The trajectory data set can be used in many research fields, such as mobile pattern mining, user activity recognition, location-based social networks, location privacy and location recommendation.)
Platform: |
Size: 22576128 |
Author: 李白43 |
Hits:
« 12
3
4
5
6
7
8
9
10
...
50
»