modern algorithms for large data sets in general and for the world-wide web in particular.
topics: a. web search and web data mining,information petrieval, search engine
ranking methods,spectral analysis of text, web measurements, structure structure
of the webgraph, generative models for the web, and rand aggregation. b.
general algorithmic paradigms for dealing with large datasets, like the web:
random sampling, data streams, loss compression, compression, dimenstionality
reduction, low distortion embeddings, and nearest-neighbor schemes.