Discovering the topics of a data source: A statistical approach?