rminer - A open-source library that facilitates the use of Data Mining
techniques in R
This package was used in several Data Mining applications: intensive care medicine, meat and wine quality assessment, civil engineering, forest fires prediction, modeling student performance, time series forecasting, spam e-mail detection, ... Also available at Comprehensive R Archive Network (CRAN).
Springer Book: Modern Optimization with R (R code, data, ...)
Free public datasets what I studied:
-
-WineQuality (regression/classification) donated to the UCI ML repository).
-
-S-Enron corpus (personalized spam e-mail classification).
-
-Bank Marketing (classification, donated to UCI ML repository).
-
-Internet Traffic Time Series Datasets (also available at tsdl R package - index 643 to 648).
-
-Student Performance (regression/classification, donated to the UCI ML repository).
-
-Stock Market Lexicon (with more than 20.000 microblog terms associated with positive or negative scores, available at GitHub).
-
-Online News Popularity (regression/classification, donated to the UCI ML repository).
-
-CS Abstracts Dataset (sequential classification).
-
-Twitter-country-geolocation (classification).
-
-Cross-source cross-domain sentiment analysis (classification).