rminer - A open-source library that facilitates the use of Data Mining

techniques in R


This package was used in several Data Mining applications: intensive care medicine, meat and wine quality assessment, civil engineering, forest fires prediction, modeling student performance, time series forecasting, spam e-mail detection, ... Also available at Comprehensive R Archive Network (CRAN).


Springer Book: Modern Optimization with R (R code, data, ...)


Free public datasets what I studied:

  1. -Forest Fires (regression, donated to the UCI Machine Learning (ML) repository, top ten dataset).     

  2. -WineQuality (regression/classification, donated to the UCI ML repository, top ten dataset).

  3. -S-Enron corpus (personalized spam e-mail classification).

  4. -Bank Marketing (classification, donated to the UCI ML repository).

  5. -Internet Traffic Time Series Datasets (donated to TSDL and datamarket.com).

  6. -Input importance synthetic datasets.

  7. -Student Performance (regression/classification, donated to the UCI ML repository).

  8. -Online News Popularity (regression/classification, donated to the UCI ML repository).

  9. -Stock Market Lexicon (with more than 20.000 microblog terms associated with positive or negative scores, available at GitHub).