I do go on a lot about the importance of having modern access to data. And so the appearance of this article[cite]10.1038/sdata.2014.22[/cite] immediately struck me as important. It is appropriately enough in the new journal Scientific Data. The data contain computed properties at the B3LYP/6-31G(2df,p) level for 133,885 species with up to nine heavy atoms, and the entire data set has its own DOI[cite]10.6084/m9.figshare.978904[/cite]. The data is generated by subjecting a molecule set to a number of validation protocols, including obtaining relaxed (optimised) geometries at the B3LYP/6-31G(2df,p) level. It would be good to replicate this set with inclusion of a functional that also includes dispersion, and of course making the coordinates all available in this manner greatly facilitates this. The collection also includes data for e.g. 6095 constitutional isomers of C7H10O2, which reminds me of an early, delightfully entitled, article adopting such an approach in quantum chemistry[cite]10.1021/jp057107z[/cite]. Such collections are an important part of the process of validating computational methods[cite]10.1007/s00894-005-0278-1[/cite] This way of publishing data does raise some interesting discussion points.