Cao Y, Jiang T and Girke T (2010). “Accelerated similarity searching and clustering of large compound sets by geometric embedding and locality sensitive hashing.” Bioinformatics, 26(7), pp. 953–959.