Analysis of large data sets 

 The upcoming surveys are so large that their analysis has to be automated. Using this motto, I have been exploring the use of the k-means clustering algorithm to organize (classify and process) larger data sets. 

  1. Pre-processing of raw data sets (in the solar-physics context).
  2. Classification of galaxy spectra. ASK classication of all the galaxies with spectra in SDSS DR7
  3. Classification of SEGUE stellar spectra
  4. A single pass k-mean ... faster than the traditional algorithm by Ordovas and SA (2104)