University of Pisa - Department of Computer Science

subglobal1 link | subglobal1 link | subglobal1 link | subglobal1 link | subglobal1 link | subglobal1 link | subglobal1 link
subglobal2 link | subglobal2 link | subglobal2 link | subglobal2 link | subglobal2 link | subglobal2 link | subglobal2 link
subglobal3 link | subglobal3 link | subglobal3 link | subglobal3 link | subglobal3 link | subglobal3 link | subglobal3 link
subglobal4 link | subglobal4 link | subglobal4 link | subglobal4 link | subglobal4 link | subglobal4 link | subglobal4 link
subglobal5 link | subglobal5 link | subglobal5 link | subglobal5 link | subglobal5 link | subglobal5 link | subglobal5 link
subglobal6 link | subglobal6 link | subglobal6 link | subglobal6 link | subglobal6 link | subglobal6 link | subglobal6 link
subglobal7 link | subglobal7 link | subglobal7 link | subglobal7 link | subglobal7 link | subglobal7 link | subglobal7 link
subglobal8 link | subglobal8 link | subglobal8 link | subglobal8 link | subglobal8 link | subglobal8 link | subglobal8 link

PhD Thesis

    Università di Pisa - Dottorato di Ricerca in Informatica, XXI ciclo

    A New Framework for Data Streams Classification



  • Supervisor: Prof. Franco Turini


  • International Reviewers: Prof. J. Gama (Univ. of Porto) – Dr. A. McGregor (Univ. of Massachusetts)


  • Abstract: Mining data streams has recently become an important and challenging task for a wide range of applications, including sensor networks and web applications. The massive quantity of streaming data coupled with concept drifting are two crucial issues in mining data streams. This thesis proposes a new framework for data streams classification, introducing two distinct structures to face the problem of data management and mining. On the one hand, our approach provides a synthetic structure which maximizes data availability, guaranteeing a single data access. On the other, given the synthetic structure, a selective ensemble of classifiers is managed through time to provide a good prediction accuracy. Both components are designed to maximize data usage and accuracy even in the presence of concept drifting, providing a good trade-off between data access management and quality of the model.


  • Keyword: Data Mining, Knowledge Discovery in Databases, Mining Data Streams, Complex Data Analisys


  • Available in: http://etd.adm.unipi.it/theses/available/etd-11242009-124601//
©2007 Department of Computer Science